#### Abstract

A multilayer learning network assisted with frequency offset cancellation is proposed for modulation classification in satellite to ground link. Carrier frequency offset greatly reduces modulation classification performance. It is necessary to cancel frequency offset before modulation classification. Frequency offset cancellation weights are established through multilayer learning network based on MSE criterion. Then the weight and hidden layer of multilayer learning network are also established for modulation classification. The hidden layers and weight are trained and tuned to combat the interference introduced by frequency offset. Compared with current modulation classification algorithm, the proposed multilayer learning network greatly improves the Probability of Correct Classification (PCC). It has been proven that the proposed multilayer learning network assisted with frequency offset has higher performance for modulation classification within the same training sequence.

#### 1. Introduction

Satellite to ground link adopts different modulation technology to satisfy different requirement. With rapid development of communication technology, modulation classification for satellite communication becomes an important research topic in signal recognition, especially in lager frequency offset environment [1–3].

Doppler frequency offset exists between users and satellite, and it greatly damages the link performance and especially affects modulation classification. As satellite is moving at high speed, frequency offsets are also changing. This process requires that modulation classification should overcome frequency offsets within larger SNR dynamic range. So it is very important to adopt frequency offset cancellation method for modulation classification.

In early research work, the literatures were researching on cancellation algorithms for frequency offsets, which were estimated at receivers and then sent back to respective transmitter. But this is not suitable for long-distance transmission in satellite to ground link. Reference [4] analyzed single user in earliest frequency offset cancellation algorithm. Least squares (LS) and minimum mean-square error (MMSE) were applied in [5]. LS method required frequency offset, while MMSE algorithm required noise powers as well. Reference [6] proposed an iterative cancellation for canceling frequency offset interference, which should need large complex iterations. Huang gave frequency offsets cancellation method using circular convolution in [7], which should need relatively large complexity.

Reference [8] proposed an alternative method in time-domain method for frequency offset compensation, but the precision was not very satisfactory. Reference [9] proposed a joint iterative detection algorithm, which should require ICI matrix inversion in each iteration. In [10], successive interference cancellation (SIC) algorithm was proposed to cancel frequency offset interference. The interference was cancelled by reconstructing signals. SIC algorithm depending on power order was proposed in [11]. Reference [12] gave SIC algorithm based on the signal-to-interference-plus-noise ratio (SINR) order. Reference [13] proposed an iterative parallel interference canceller (PIC) solution. Based on this, selective parallel interference cancellation (SPIC) algorithm was presented in [14] but had a significant implementation complexity. Based on signal-to-interference ratio (SIR) analyzing, a parallel interference canceller for mitigation of interference due to CFOs was presented in [15].

Modulation classification for communication signal in satellite link is the multivariate pattern classification problem with multiple unknown parameters. Modulation classification algorithms should extract the feature from sampling signal. These algorithms can be classified into the following 5 types.

*Type 1 Histogram Features*. In 1984, Liedtke et al. proposed a pattern recognition method to classify digital modulated signals. Liedtke et al. used amplitude histogram, frequency histogram, and phase histogram as characteristic parameters. Too many feature dimensions lead to lager computational complexity. However, reducing feature dimensions would affect the ability of different modulation classification with similar distribution functions.

*Type 2 Statistical Cumulant Features*. In view of the larger number of histogram features, literature [16] proposed a sixth-order cumulants feature in order to improve the performance. However, the sixth-order cumulants algorithm was only based on the energy model, which was not effective for frequency offset. In [17–19], cyclic high-order cumulants were obtained for modulation classification, and the performance in the Gaussian white noise channel was given. But the performance in large frequency offset status was not considered. References [20, 21] used high-order cumulant feature for modulation classification. These feature extraction operations were relatively easy to obtain. However, computational complexity for higher-order cumulants was improved. As modulation classification algorithms obtained features based on cumulants, classification performance within larger frequency offset was not in consideration.

Asoke K. Nandi and EEAzzouz et al. used the statistical features of instantaneous envelope, phase, and frequency of signal data and applied decision theory for modulation classification [22, 23]. A similar algorithm was proposed by Chan and Gadbois. According to the features of the signal envelope, the ratio of the signal envelope variance to the signal mean square was used as the decision criterion [24]. Louis used digital signals such as 2FSK, 4FSK, 8FSK, OQPSK, MSK, BPSK, QPSK, 8PSK, 16QAM, 64QAM, and other digital signals based on instantaneous frequency, phase, and amplitude [17]. Swami adopted normalized cumulative power of the symbol-synchronous sampling sequence to classify QAM, PSK, ASK, and other signals [20]. In 2001, Wong and Nandi [25] used statistical cumulant and spectral features in classification.

*Type 3 Transformation Feature Criterion*. Cumulant features for satellite signal are only to identify several common modulations. In addition to directly using the statistical parameters and the histogram as a classification feature, the signal can be transformed into other feature spaces and the satellite data analysis can be accomplished by using the feature parameters in the new feature space. With the development of signal processing theory, the signal could be transformed into various forms. Wavelet transformation feature for modulation classification was proposed in [26]. Haar wavelet feature transform had better antinoise features than the high-order cumulants. But it had weak anti-frequency offset features. In [27], modulation classification algorithm based on cycle spectrum features was proposed. Based on extending this algorithm, modulation classification algorithm based on bispectrum feature was used to classify MPSK signal in [28]. In [29], wavelet packet was proposed, which was decomposed by the wavelet packet. The partial decomposition vector of the average energy was obtained. Then average energy was arranged in a certain order to construct classification feature. In [30], MFSK and MPSK are classified through wavelet transform, and better PCC was obtained when the SNR was greater than 6 dB. In [31], an automatic classification algorithm based on spectral analysis was proposed, which used the statistical characteristics assisted with maximum likelihood estimator.

*Type 4 Bayesian Classification Criterion*. A classical method for modulation classification is the maximum likelihood method. Modulation classification is obtained through maximum likelihood function. In [32], MPSK signal based on phase maximum likelihood function was proposed. In [33], a joint likelihood function using amplitude and phase was proposed within larger SNR, and it was on the premise that the amplitude and phase were assumed to be independent. In [34], classification for MFSK signal was discussed. After averaging the unknown parameters in the average likelihood function, the integral expression of zero-order modified Bessel function was obtained. The high-order correlation analysis based on Bessel function was derived for modulation classification. In literature [35], a generalized likelihood ratio function for classification framework was proposed, which firstly expanded the power for the likelihood ratio function and then made the expected average processing for the unknown parameters and then formed the classification statistics based on higher correlation order.

The advantage of transformation domain theory for modulation classification is theoretically guaranteeing that the result is optimal under Bayesian cost criterion. The classification performance curve can be obtained by theoretical analysis. The likelihood ratio performance can be used as a theoretical upper limitation. Of course, the maximum likelihood algorithm also has limitations and shortcomings. Firstly, compared to the modulation classification method, it requires more a priori knowledge, such as the form of the distribution function including the mean, variance, and signal-to-noise ratio. Secondly, the existence of unknown parameters results in the complex calculation, which is also difficult for real-time processing.

In order to cancel frequency offset, the eigenvalue is very limited to suppress the noise based on Bayesian classification criterion. Current methods such as feature extraction, classification, and regression can be used as shallow structure algorithms. These methods are limited to complex functions in finite samples and computational units, and their generalization ability is restricted to complex classification problems.

*Type 5 Multilayer Neural Network*. With improvement of artificial intelligence, researchers introduced multilayer neural network algorithms for modulation classification. Multilayer learning could represent the input data distribution by learning a multilayer nonlinear network structure and implementing complex function approximation. The multilayer learning motivation lies in the establishment of a neural network, which could simulate the human brain for signal analysis and mimic human brain mechanisms to interpret the data.

Multilayer learning algorithms obtain data features by building models with multiple hidden layers to improve classification or prediction accuracy. Multilayer learning algorithm is different from the traditional shallow learning structure; the difference in multilayer learning lies in the following: (1) emphasizing the depth of the model structure usually with many hidden nodes and (2) the importance of feature learning being clearly highlighted. By layer-by-layer feature transformation, the feature of data in the original space is transformed into a new feature space to make classification or prediction easier. Compared with the method of constructing data features manually, the use of data to learn features can better express intrinsic information. Identification part needs to determine the appropriate decision rules and classifier structure.

Common classifiers include tree structure classifiers [36, 37], neural network classifiers [38, 39], and support vector machine classifiers [40, 41]. The tree structure classifier used a multilevel classifier structure, each structure according to one or more of the characteristic parameters. Neural network classifier uses different structures of artificial neural network for a variety of ways to achieve training and testing. SVM classifier inputs feature vector into high-dimensional space and constructs the optimal classification in the high-dimensional space to achieve data classification. In [42], a classifier based on tree structure was adopted. The multilevel tree classifier structure is based on one or more feature parameters. In [43], a multilayer clustering algorithm is used for modulation classification. In [44], -nearest neighbor algorithm classifier was used to identify modulation mode. Literature [45] used the distance distribution function to optimize and obtain better classification performance. However, this algorithm did not give analysis in larger frequency offset. In [46], an improved KNN multilayer learning architecture was proposed to classify modulation. However, this algorithm was applicable for Gaussian white noise channel environment, and the performance of the algorithm was limited for larger frequency offset.

#### 2. System Model and Problem Formulation

##### 2.1. Satellite to Ground Link Model

Due to high-speed movement of the satellite, the satellite to ground link is established between station and satellite. In this process, it is necessary to accurately calculate the constellation. The process has been shown in Figure 1. In this process, frequency offset of the satellite to ground link is changing, so it is necessary to accurately cancel the interference of frequency offset before modulation classification.

We establish the satellite to ground link communication model. Ossanna has firstly proposed the probability statistical model of satellite to ground link in 1964 [47]. The principle is based on the reflection wave interference, so the angle of reflection is limited. It is fuzzy to use this satellite ground to link model to describe the rural and urban areas. Clarke proposed in 1968 statistical model [48] based on the principle of scattering signal strength, but the model needed to assume that the incident wave formation was the same, so the application of the model was limited. After the 70s, according to the measured data in Canada, Loo proposed a statistical model based on [49]. If the shadow fading follows the Nakagami distribution, the Abdi star fading model is formed [50]. So, it is common for satellite to use probability density function, such as Rician model, Loo model, and Rician-Lognormal model. The scene for the satellite to ground communication system is mainly urban environment, and most of signal transmission has direct path.

##### 2.2. Signal Processing Model

The th received frame can be written as where represents channel impulse response at instant . At the receiver, frequency offset is induced into the baseband signal due to Doppler frequency and the clock oscillator misalignment between the satellite and user. The baseband signal could be given aswhere denotes frequency offset (FO) normalized by the subcarrier spacing and is the additive white Gaussian noise with zero mean and variance.

As shown in Figure 2, the multipath fading satellite communication signal should pass through the downconversion sampling block. Signal-to-noise ratio should be estimated firstly, and then signal enters into the frequency offset and modulation classification block based on multilayer learning network. After the corresponding demodulation and decoding, the information can be adopted finally.

#### 3. Multilayer Learning Network for Modulation Classification Assisted with Frequency Offset Cancellation

##### 3.1. Motivation of Multilayer Learning Network

The optimization goal of modulation classification for satellite communication signal with frequency offset is the minimum mean square (MSE) between the received signal with carrier frequency offset and the expected output signal, which could be expressed as .

This equation implicates that the optimization cost function exhibits a strong nonlinear behavior. So it is hard to solve the problem with linear method. Reference [16] also involved a nonlinear regression problem that is also hard to solve. Therefore, in the present work, a nonlinear weight solution method based on multilayer learning network is proposed to solve the nonlinearity problem.

Due to carrier frequency offset interference existing, it seriously influences modulation classification for satellite communication signal. In accordance with the signal processing flow, the proposed multilayer learning network is divided into two parts as shown in Figure 3. The first part is carrier frequency offset cancellation module, and the second part is modulation classification module. Carrier frequency offset module can greatly improve the probability of correct classification for the modulation classification.

Multilayer neural network mechanism for frequency offset interference and modulation mode classification are trained separately.

Due to the formation principle of frequency offset interference in satellite communication system, the required multilayer neural network mechanism for frequency offset interference and modulation mode classification are different. And the weights required by the frequency offset interference cancellation neural network and modulation mode classification neural network are greatly different. Therefore, two cascaded neural networks are separately trained and reversely tuned with the same training sequence being used.

##### 3.2. Framework for Communication Signals

Before introducing the multilayer learning network, the framework for communication signal is defined below. Figure 4 shows the defined typical framework, which includes the training symbols and data symbols. The training symbols are known by both the transmitter and the receiver, which are also considered as reference symbols. In other words, every training symbol is already known by the receiver. The multilayer learning network can be trained with convergence by training symbols, and the trained network with the optimal weight can be used for frequency offset cancellation and modulation classification.

#### 4. Multilayer Learning Network for Frequency Offset Cancellation

##### 4.1. Processing Flow for Carrier Frequency Offset Cancellation

It is effective to obtain optimum weights of multilayer learning network through iterative training symbols. Figure 5 is obtained by passing through the two-layer learning network with weight , where represents the input signal for the first layer learning network and represents the input signal for the second layer network. Here, we use two layers for frequency offset cancellation. For current analysis, more layers are not able to obviously improve the performance. The relevant theoretical analysis will be given later. is defined as the length of symbols for input. is defined as the training symbols, which can be known by the transmitter and the receiver. After multilayer learning networks are trained adaptively through these symbols, weights will converge. In practice, .

##### 4.2. First Layer for Carrier Frequency Offset Cancellation

We define cost function as where is exponential weighting factor and is defined as a positive constant value less than unit. represents the weight for the first layer learning network and represents the weight for the second layer network. represents the output for the first layer network and represents the output for the second layer network.

We further could obtain through the first layer

Optimization goals for weight can be expressed as

is defined as the reverse tuning error for the first layer, which is between the th reference symbol and th received symbol. We could obtain The gradient vector for cost function with respect to can be derived asAfter some simple mathematic manipulation, we could obtainDefineNote that (9) may be rewritten into the iterative form:We could also obtain in iterative form:Substituting into (6), we finally getwhereAfter manipulation, we could obtain that is also a constant value.

Updating , we obtainTaking the limit, we could obtainAfter mathematical operation, the first layer output will be obtained aswhereFrom (17), we can obtain that the output has the product of and , and the input symbol can be compensated with weights of multilayer learning network. When , we could obtain that . So we have that the interference induced by could be ignored. If the frequency offset is so large, would induce the residual frequency offset, which could also destroy the output signal . So we establish the second layer for residual frequency offset cancellation.

is defined as the residual interference, which is caused by interference part . We could also get this result from constellation simulation as shown in Figure 7. This disturbance also causes the phase rotation of ideal signal, so signal passing through the first layer could be expressed aswhere is the residual frequency offset; could only induce the phase rotation for .

##### 4.3. Second Layer for Carrier Residual Frequency Offset Cancellation

The second layer for signal processing is similar to the first layer.

Updating ,whereAfter manipulation, we could obtain that is also a constant value.Taking the limit, we will obtainAt the high SNR, the noise can be ignored; the output signal will be

From (23), we can obtain that the output has the product of and ; we could get the signal with non-frequency offset. We therefore conclude that the symbol with residual frequency offset can be compensated with second layer of multilayer learning network completely.

The performance for frequency offset cancellation of multilayer learning network could be obtained by the constellation simulation. Consider that SNR is 20 dB and the number of subcarriers is set as 256. Figure 6 gives out the constellation of base band signal with the frequency offset . Figure 7 presents the constellation after the first layer scheme cancellation. Figure 8 presents the constellation after the second layer scheme cancellation. As obtained from these figures, the first layer scheme may cancel the frequency offset completely when frequency offset value is smaller. But, for a larger frequency offset value, there also exists residual frequency offset. After the second layer, scheme may cancel the frequency offset completely when frequency offset value is larger.

#### 5. Multilayer Learning Network for Frequency Offset Cancellation

##### 5.1. Multilayer Learning Network for Modulation Classification

Define as the prediction output for the network. is defined as the ideal output value for the th-order network.

The output estimator value of the th layer network can be reconstructed with th hidden layer unit and the weight ; the th layer network can be expressed aswhere is defined as the weight for the th hidden layer.

exists as the th hidden layer activation function, which is obtained by the following activation function:

From the above, we can obtain that the output response can be obtained through hidden layer unit, which can be expressed as

Define , , as training, is the ideal for the th training unit, and is the target output of the th training unit.

In order to improve prediction and classification efficiency for the multilayer learning network, the expected forecast direction is through th hidden layer unit and th layer network training; signal output error is .

Multilayer network input sequence is the multipath fading signal. After the th depth learning network training, the th hidden layer for unit can be expressed as

and expanded as

By substitution, we could obtain

Obtaining for the gradient from the above equation, we could obtain

Then the available is simplified:

The amplitude weights can be expressed as the above formula and can be represented by the matrix, which can be simplified:where

##### 5.2. Reverse Parameter Adjustment

A parametric model for forward training networks should be adjusted as reversed parameter. The energy model is used to obtain the gradient for parameters . Through Contrastive Divergence method, we can obtainwhere indicates the expectation of the data distribution and indicates the input data after Gibbs sampling.

The parameters can be adjusted by the following steps:where is the learning rate.

#### 6. Experimental Classification Results and Analysis

Modulation mode commonly used in satellite communication, QPSK, 8PSK, and QAM are defined as the test sets. These modulation signals are defined as the narrowband signals with white Gaussian noise, which are modulated into the signals with narrowband Gaussian white noise. Signal carrier frequency is defined as = 1000 kHz, symbol rate is defined as = 180 kHz, and sampling rate is defined as = 4000 kHz.

The digital signal modulation mode is generated by random generator. Multilayer learning network is simulated by Monte Carlo simulation for 20,000 tests. is the maximum normalized frequency offset, which is defined as the ratio between frequency offset and bandwidth.

Typical multipath model is established in Table 1. The main channel adopts the Rice channel model. The second-order model is Rayleigh channel model.

The measured results for urban environment were proposed in [51] with signal carrier frequency of 1.82 GHz. Table 1 shows the measured parameters of the urban environment, rural environment, and suburban environment.

##### 6.1. BER Performance versus SNR

Firstly, we simulate BER versus SNR through frequency offset cancellation performance. The bit-error-rates (BER) versus average SNR for and are shown in Figures 9 and 10, respectively. The results with no frequency offset, no compensation, MMSE equalization algorithm [3], iterative algorithm proposed in [15], and the proposed compensation method are presented. By investigating these figures, it is clearly shown that no cancellation suffers from ICI and has a high error floor. MMSE algorithm performs well, since ICI and MUI can be removed. MMSE-SIC algorithm performs better than MMSE algorithm as the effect of noise enhancement can be significantly reduced. It is noticed that the more precise frequency offset can be obtained by the iterative algorithm at the cost of the complex iterative computation and estimation. However, the two layers of multilayer learning network exhibit the best performance.

##### 6.2. PCC Performance versus SNR at Different Modulation

Secondly, we simulate PCC performance versus SNR. Figure 11 shows the PCC of the multilayer learning network at SNR from −5 dB to 20 dB. The frequency offset is defined as . Within the same training sequence for each modulation mode, 2048 sampling points constitute a frame. 100 groups are set as training set; that is, . The remaining 1900 groups are test set. It can be seen from the figure that when the SNR is greater than 0 dB, the PCC of the multilayer learning network is above 93% for the above modulation scheme. At SNR greater than 5 dB, the PCC is more than 96%. Therefore, it is obtained from the simulation that the proposed multilayer learning network assisted with frequency offset cancellation has a better PCC for different signal modulation types.

##### 6.3. PCC Performance versus SNR at Different Training Length

Figure 12 shows PCC of satellite to ground link signal modulation pattern under different training lengths. Under the different training sequence conditions for each modulation mode, 2048 data frames constitute a group. Frequency offset is defined as . For classification modulation within 2000 groups, we choose different lengths of training data, 20 groups, 40 groups, and 60 groups for training.

Through the above method, PCC of multilayer learning network including BPSK, QPSK, and 8PSK modulation signal are tested. It can be seen from Figure 7 that when the training length is greater than 20 groups, the PCC is above 93% for the above modulation mode. When it is greater than 40 groups, the PCC is more than 96%. Therefore, it is possible to know from the simulation that the proposed multilayer learning network has a better PCC for different types of signal modulation. The proposed multilayer learning network has more profound signal features.

##### 6.4. Probability of Correct Classification Simulation at Different SNR

Figure 13 shows the PCC of proposed multilayer learning network from −5 dB to 20 dB compared with different algorithms proposed in [20, 26, 46]. Define the same training sequence for each modulation mode. 2048 data points constitute a frame, and 1024 frames constitute a group. For 2000 groups, 100 groups are training sets; that is, . The remaining 1900 groups are defined as the test group. When SNR = 0 dB, PCC for communication signal modulation mode of all algorithms is below 90%, and the performance of the algorithm is affected by noise interference.

When SNR = 5 dB, the performance of the algorithm for PCC has increased. The SVM proposed in literature [26] is based on the principle of wavelet packet decomposition criterion, and the PCC performance is higher based on making full use of the wavelet packet features. The PCC of SVM increases rapidly. Therefore, SVM has better PCC in Gaussian white noise. However, in case of frequency offset status, the performance for wavelet packet decomposition is constrained; thus the SVM algorithm does not show an advantage. The modulation classification algorithm based on sixth-order cumulants is proposed in literature [20]; the performance is stable under different SNR conditions. This is because the cumulants features are constrained by the frequency offset status. With frequency offset increasing, cumulants-based signal features are constrained. It can be seen from the simulation results that the proposed algorithm has high accuracy and stability under different SNR conditions. Compared with the KNN algorithm proposed in [46], the proposed multilayer learning has higher PCC under the same SNR condition. This is because the proposed multilayer learning network assisted frequency offset using the multilayer network weights for signal reconstruction. When SNR is not less than 10 dB, the Probability of Correct Classification can reach more than 95%.

Table 2 shows the PCC performance of proposed multilayer learning network from 0 dB to 20 dB, which is compared with different algorithms proposed in [20, 26, 46].

Table 3 shows the measured parameters for the proposed multilayer learning network, which is compared with different algorithms proposed in [20, 26, 46].

##### 6.5. Importance for Multilayer Learning Network with MSE Guidelines

For the frequency offset interference cancellation neural network, the interference is induced by the instantaneous linear exponential multiplicative interference, according to relevant proofs given about formulations (15)–(17), (19), (21), (22), and (23) in Sections 4.2 and 4.3.

The MSE criterion can obtain the optimal interference cancellation performance, and the multiplicative exponential linear interference cancellation can be eliminated. Obtaining few training samples according to MSE criterion, the generated weight network can be trained and the optimal interference cancellation performance can be obtained. So it is effective for adopting the MSE criterion to establish the neural network.

In order to prove the effectiveness of the MSE criterion, Figure 14 shows PCC of satellite to ground link signal modulation classification assisted or not with MSE criterion. For classification modulation within 2000 groups, we choose 200 groups for training. 2048 data frames constitute a group. Frequency offset is defined as .

Through the above method, PCC of deep learning network including BPSK, QPSK, and 8PSK modulation signal are tested. It can be seen from Figure 13 that when the training length is the same, the PCC is above 93% for modulation classification assisted with MSE criterion, which can obtain the optimal interference cancellation performance. Therefore, it is possible to know from the simulation that the proposed deep learning network has a better PCC for different types of signal modulation. Based on MSE criterion, the proposed deep learning network has more profound signal features.

##### 6.6. Overfitting Analysis of Multilayer Learning Assisted with Frequency Offset Cancellation

For the characteristics of satellite link communication signals, the influence of modulation mode classification is mainly due to the interference of larger carrier frequency offset. If the frequency offset interference is not eliminated, more training samples are needed to ensure correct fitting. Due to the innovation of the paper, carrier frequency offset interference cancellation is performed with fewer samples assisted with MSE criterion; thus it improves the accuracy of the modulation mode classification.

Excluding related frequency offset interferences, the classification features are obtained based on the modulation mode, and the model can be established by a small amount of data to ensure the PCC and prevent overfitting of the data.

At the same time, we perform simulation with different training samples. In the condition of frequency offset interference, we could need more training samples in order to ensure a reasonable PCC. And it is the thesis that a small amount of training samples is used to ensure the precancellation of frequency offset interference that makes it possible to obtain a higher classification recognition rate with fewer training sequences.

In order to prove that the small training set can be potentially overfit for the dataset, Figure 15 shows PCC of satellite to ground link signal modulation classification assisted or not with frequency offset cancellation. For classification modulation within 2000 groups, we choose 100 groups and 200 groups for training. 2048 data frames constitute a group. Frequency offset is defined as .

Through the above method, PCC of deep learning network including BPSK, QPSK, and 8PSK modulation signal are tested. It can be seen from Figure 12 that when the training length is the small, the PCC is above 97% for modulation classification assisted with frequency offset cancellation, which can obtain the ideal performance. Therefore, it is possible to know from the simulation that the proposed multilayer learning network has a better PCC for small training set.

#### 7. Conclusion

A multilayer learning network assisted with frequency offset cancellation is proposed for modulation classification in satellite to ground link. Larger carrier frequency offset greatly degrades the performance of modulation classification. Firstly, we establish cancellation weights for frequency offset through multilayer learning network. Secondly, we establish the weights and hidden layers for modulation classification through multilayer learning network. We also train hidden layers and weight using positive fine-tuning to cancel the interference introduced by frequency offset in the satellite to ground link. Compared with current modulation classification algorithm, the proposed multilayer learning offset greatly improves the PCC. It has been proven that the multilayer learning network model assisted with frequency offset has higher performance for the satellite communication signal classification within the same training sequence.

#### Conflicts of Interest

The author declares that there are no conflicts of interest.

#### Acknowledgments

This work is supported by the National Natural Science Foundation of China (no. 61501306), the Natural Science Foundation of Liaoning Province of China (no. 2015020026), and the Natural Science Foundation of Liaoning Provincial Department of Education (no. L2015402).