Abstract

Atomic clock is the core component of navigation satellite payload, playing a decisive role in the realization of positioning function. So the monitoring for anomalies of the satellite atomic clock is very important. In this paper, a complete autonomous monitoring method for the satellite clock is put forward, which is, respectively, based on Phase-Locked Loop (PLL) and statistical principle. Our methods focus on anomalies in satellite clock such as phase and frequency jumping, instantaneous deterioration, stability deterioration, and frequency drift-rate anomaly. Now, method based on PLL has been used successfully in China’s newest BeiDou navigation satellite.

1. Introduction

The most important function of navigation satellite is to support its users to acquire their position through the satellite signal, during which satellite time is one of the most important factors. Because of the changes in temperature, humidity, radiation, and the aging of the satellite clock, the physical and electric part of clock may both have problems, which will bring anomaly in clock signal, resulting in large error in the prediction of satellite time or even unpredictability, which may lead to disastrous consequence. So the anomaly monitoring of satellite clock is very important.

So far, researchers have proposed schemes to monitor anomalies of clock, such as Interferometric Detection Method [1], Least Square (LS) Detection Method [2, 3], Generalized Likelihood Ratio Test (GLRT) [46], Kalman Filtering Method [710], and Dynamic Allan Variance (DAVAR) Method [5, 1113]. Though their schemes have been proved to be effective for some (not all) anomalies, some extra work is still needed to realize Self-Monitoring.

Under normal circumstance, ground station can evaluate the health condition and performance of the clock by continuously tracking satellite signals. But when the satellite flies beyond the ground station’s sight, or owing to some reasons, the satellite cannot contact with the ground station in a few hours or even days; the satellite needs to judge the status of the clock all by itself. Self-Monitoring for clock anomaly, which is in the absence of ground station, is that the satellite monitors its clock by itself to make a judgment on the satellite clock running state.

The common anomalies of the satellite clock are signal loss, phase jumping, frequency jumping, instantaneous deterioration, stability, and frequency drift-rate deterioration.

The contributions of this paper can be summarized as follows.

In this paper, a set of Self-Monitoring algorithms is proposed to improve the reliability of satellite. Two methods are put forward to monitor satellite clock anomalies. The first method is based on PLL, and it can detect signal loss and phase and frequency jumping. Based on the measurement data from intercomparison among three clocks, Modified DAVAR is used to detect phase and frequency jumping and instantaneous deterioration; we use windowed overlapping Hadamard variance to evaluate clock stability in real time and the three-state Kalman filter to detect large drift rate.

The method based on PLL has been proved effective and used in newest BeiDou satellite. And the other research on Self-Monitoring Method in this paper can be used in next generation navigation satellites.

2. Self-Monitoring Method for Anomaly of Satellite

Generally speaking, there are two methods to evaluate atomic clocks: () comparing the clock signal with standard reference whose stability is much better than the evaluated clock and () making intercomparison among three or more clocks whose stability is almost the same.

Because there is no standard reference in the satellite and the performance of satellite clocks is similar, we make use of the second method to realize Self-Monitoring for anomalies. The schematic diagram is shown in Figure 1.

Firstly, we define that is the time error of clock 1, which is the difference between clock time and the standard time. is the time difference between clock 1 and clock 2. As shown in Figure 1, three clocks are all powered up. Their 10 MHz signals act as the input of phase difference measurement module, through which we can get the time difference data , , and among them. The Signal Processing Module uses the time difference data , , and to evaluate the health state of three clocks with certain algorithm and then commands the master clock selector to choose suitable clock as the frequency and time source of the entire satellite.

In this paper, , , and are used in Modified DAVAR to monitor phase and frequency jumping, used to evaluate the stability of three clocks, and used to monitor drift-rate anomaly.

The detailed structure of Self-Monitoring Module based on PLL in Figure 1 can be described in Figure 2.

2.1. Self-Monitoring Method Based on PLL

Figure 2 shows the basic schematic diagram of this method. As a phase tracking system, PLL is used to adjust the phase of local signal to trace the reference signal. Voltage Controlled Oscillator (VCO) provides sampling clock and working clock for AD and FPGA, respectively. As the input of the Decision Module, the observed quantity of this method comes from the output of Phase Detector. Output of Decision Module will be sent to Signal Processing Module in Figure 1 to help choose the master clock. At the same time, 10 MHz signal from Figure 1 is sampled by AD in Figure 2. The working frequency of Phase Detector is 1000 Hz.

Once phase or frequency jumping occurs, the output of Phase Detector in PLL will follow. In this section, the response of Phase Detector to these two anomalies will be derived.

According to [14], assuming that the phase of reference signal of PLL is and that of Direct Digital Synthesizer (DDS) output is , then we getwhere is the transfer function of the two-order ideal loop filter, is the normalized transfer function of the DDS, and is the loop gain. From expression (1), we getwhere is the phase difference between the reference signal and the local signal, and the error transfer function of the loop can be expressed aswhere is the undamped oscillation frequency and is the damped coefficient.

In the following, the tracking property of Phase Detector for phase and frequency jumping will be deduced.

2.1.1. Phase Jumping

Supposing that the phase jumping can be written as , whose Laplace transform can be expressed as , then the error response is

Through factorization, (4) is equivalent towhere

Considering (6) and (7), the inverse Laplace transform of (5) can be expressed as

From (8) we notice that the phase difference at reaches its peak value which has nothing to do with the loop parameters. Figure 3 is the simulation result, the PLL is locked at the beginning, and phase of reference signal jumps at s which leads to obvious jumping in the output of Phase Detector. In Figure 3, the loop parameters of three PLLs are and and the amplitude of phase jumping is equal to period of reference signal. With different loop parameters, the relocking process is different. The narrower the loop bandwidth is, the slower the tracking will be.

2.1.2. Frequency Jumping

Assuming that frequency jumping is , whose Laplace transform is , then the error response can be expressed as

After factorization,

and can also be described by expression (6). According to (6) and (10), the inverse Laplace transform of (5) can be expressed as

As can be seen from (11), the maximum amplitude of the phase difference in tracking is inversely proportional to . Figure 4 is the simulation result, PLL is locked at the beginning, and frequency of reference signal jumps at  s, which leads to obvious jumping in the output of Phase Detector. In Figure 4, the loop parameters are the same as in Figure 3 and frequency jumping is equal to reference signal frequency. With different loop parameters, the relocking process is different. The narrower the loop bandwidth is, the slower the tracking will be; however the jumping is much more obvious.

2.1.3. Signal Loss

As shown in Figure 5, assuming that the PLL has been locked and reference signal lost at  s, the jumping amplitude of output of Phase Detector is far larger than threshold in the following section; then it turns to 0 immediately, which is easy to be detected.

2.1.4. Summary

It can be seen that, from Figures 3, 4, and 5, phase jumping, frequency jumping, and signal loss will all lead to obvious jumping in Phase Detector output, which provides us with chance to monitor anomalies of satellite clock signal.

2.1.5. Simulations and Detection Performance

In practice, Probability of False Alarm (PFA) and Detection Probability (PD) are usually used to evaluate the detection method. The basic principle in setting parameters (loop parameters and detection threshold) is to improve PD and minimize PFA at the same time.

The loop parameters and detection threshold are mainly determined by clock noise level and required resolution. We usually use Allan variance (12) to calculate stability to evaluate the size of noise. And resolution is the minimum range of phase and frequency jumping that algorithm can distinguish.

During the following simulations, we simulate 10000 realizations.

Simulation 1. During the first simulation with MATLAB, , , and we use two-order ideal loop filter. Assuming that the relative frequency deviation (12) of clock signal follows Gauss distribution, whose Allan deviation can be expressed as , the detection performance of the method for phase and frequency jumping is shown in Tables 1 and 2. They separately give the PD, PFA, and detection delay in period phase jumping and frequency jumping. Detection delay is defined as , where is the sampling interval and is the number of sampling points that lasted from the moment anomaly occurred to the moment they are detected by the algorithm. So it is in fact determined by the output frequency of VCO in Figure 2.

Simulation 2. During the second simulation, detection threshold and resolution change with the stability of atomic clock. Tables 3 and 4 show it.

Analysis. When , PLL is called underdamped system, in which phase and frequency jumping will result in drastic oscillation. If , PLL is overdamped and usually more stable and slow to anomaly. In practice, we often set , which is an acceptable compromise between stability and response speed. From expression (11) and Figure 4, we notice that detection for frequency jumping will become difficult when is too large, and suppression for noise will also become weaker. Conversely, if is too small, on one hand, the locking process will become difficult, and detection delay becomes longer; on the other hand, the loop will be too sensitive, which leads the Decision Module to regard bottom noise as jumping by mistake frequently, which results in rising in PFA. During simulation, , which is also a compromise between PD and detection delay and can be adjusted as required.

The noise level of atomic clock directly determines the detection resolution, which we can see from Tables 3 and 4. The relationship between resolution and stability can be described as and , while threshold can be set as . From Tables 1 and 2, we can see that PD will be more than and PFA less than with appropriate threshold for both phase and frequency jumping. Moreover, it should be noted that PD and PFA listed in the tables are for least phase and frequency jumping that the method can distinguish; the detection performance improves with jumping size increasing. In fact, before the method was used in the satellite, we have tested it in real circuit board for a long time and it works well. Detection delay depends on Phase Detecting frequency, which is 1000 Hz. Delay for phase jumping is 1 ms, and it is less than 0.5 s for frequency jumping.

The method based on PLL can realize Self-Monitoring for phase jumping, frequency jumping, and signal loss. The computation complexity is low and costs little time to detect anomaly. But if we want to enhance its weak anomaly detection performance, we need to lower the working frequency of Phase Detector, which will lead to longer detection delay. In practice, we pay more attention to large frequency jumping in satellite atomic clock, which will obviously affect the positioning accuracy and our PLL method is designed for it.

2.2. Self-Monitoring Method Based on Statistics
2.2.1. Allan Variance

We usually use Allan variance [15, 16] to evaluate the stability of atomic clock; it can be expressed as follows:where is the averaging time and is the amount of . What needs to be pointed out is that is the relative frequency deviation. is the instantaneous frequency, is the nominal frequency, and is the clock time error at the th measuring instant.

Power-law spectrum is used to analyze noise property in frequency domain:where is spectrum density for relative frequency deviation and is the amplitude corresponding to different noise type. The power-law spectrum model contains five kinds of noise (); they are RW FM, Flicker FM, White FM, Flicker PM, and White PM. is determined by these five kinds of noise:

The slope of Allan variance gives us a knowledge of noise distribution in different averaging time.

After obtaining a sufficient number of measurement data, Allan variance can be used to calculate the stability of different averaging time. But when anomaly occurs, the results given by Allan variance may lose practical significance. As shown in Figure 6(a), the frequency of clock signal jumped and then returned some time later. We cannot get correct judgment for noise distribution according to the computed result by (12) shown in Figure 6(b). Besides, we do not know the anomaly type and detection delay is also too long.

2.2.2. Dynamic Allan Variance (DAVAR)

As can be seen from Figure 6(a), the main noise type does not change, but Figure 6(b) gives wrong judgment. Hence the conclusion is not consistent with the actual situation, and we cannot find the sign of frequency jumping either. Therefore, the traditional Allan variance cannot give believable information on such anomaly. In view of this, Galleani and Tavella put forward DAVAR, which can be expressed as (15) and can be used to evaluate the performance of clock in real time:

When we calculate DAVAR, a sliding window is used to cut the data. The window length is , and will be updated when new measurement data comes, so it can tell us health condition of clock in real time. is the least measurement period, and is the averaging time.

2.2.3. Modified DAVAR

From expression (15), we know that DAVAR can be updated in real time, but in order to guarantee the reliability of long-term stability, must be large enough, which will greatly reduce the detection probability of instantaneous anomaly. Because when frequency jumping occurs, , only one factor is not 0, DAVAR is not sensitive enough to weak frequency jumping. In this paper, we modify Dynamic Allan Variance to improve detection sensitivity for small frequency jumping:

Expression (16) is the Modified DAVAR, and (17) is its iterative calculation method.

2.2.4. Detection Performance of Modified DAVAR

In this section, we will firstly analyze and compare the detection performances of DAVAR and Modified DAVAR when facing phase and frequency jumping and then show that Modified DAVAR is also effective in detecting instantaneous stability deterioration.

The monitoring method for phase and frequency jumping is based on statistics. , , and from Figure 1 will be used here. Assume that only one of the three clocks breaks down. If phase or frequency jumping occurs in clock 1, , will be abnormal, while is still normal. Because the anomaly in is the same as that in , we only need to analyze .

In Figure 7(a), the amplitude of phase jumping is 12 times the standard deviation of the relative frequency deviation data . In Figure 7(b), the frequency jumping is 4 times the standard deviation of .

We simulated 1000 sampling points and phase and frequency jumping occurred at the 500th point. We simulated one realization and saved the response data of DAVAR and Modified DAVAR to jumping at every time instant; then we repeated 10000 realizations in the same way. Of course the 1000 sampling points’ data is different in every realization. Then we got the average response at every time instant that is shown in Figures 8 and 9.

Because the peak value of response of DAVAR and Modified DAVAR to jumping determines if the jumping could be detected, we focus on the peak value in each realization. Assuming that, in one realization, the maximum value of response of DAVAR to frequency jumping is , the maximum value of response of Modified DAVAR to frequency jumping is , and then we saved and . We studied the saved data to give the minimum value, maximum value, mean value, and standard deviation of and in Figures 8 and 9.

To make the statistical result more clear, we list the statistical characteristic of and in frequency jumping in Table 5. From Table 5 and Figure 9 we know that is not big enough to be distinguished from the base noise, and Modified DAVAR is more sensitive to weak frequency jumping.

In Tables 6 and 7, the detection performance of two kinds of variance for phase and frequency jumping is given. Tables 8 and 9 tell us the detection performance for different clock stability.

What should be pointed out is that the unit of resolution in phase and frequency jumping in Tables 69 is the standard deviation of the relative frequency deviation data , namely, . Detection delay is the number of sampling points from anomaly occurring to be detected.

During the simulation, we simulate 10000 realizations and use same threshold for both DAVAR and Modified DAVAR. The window length , and  s. It should be noted that because  s, it is precise enough for us to do the approximation to only consider WFM in the simulation. In fact, the Modified DAVAR is still effective in detecting phase and frequency jumping in the presence of other noise types.

Table 6 tells us that PD of Modified DAVAR is almost as good as DAVAR and PFA is lower. From Table 7, we know that Modified DAVAR is more sensitive to weak frequency jumping but detection delay is longer.

From Tables 8 and 9, we notice that the resolution of Modified DAVAR for phase and frequency jumping is the same for different clocks. What we need to do is only to reset the threshold according to expression (18):

In addition, the window length is an important parameter for Modified DAVAR. The longer the window, the weaker the detection performance. However, if the window length is too short, PFA will rise and detection resolution will also deteriorate.

Figure 10 shows that the Modified DAVAR can also monitor instantaneous deterioration effectively.

Modified DAVAR can be considered as a statistical tool; it is effective to detect phase and frequency jumping. Compared with PLL method, we need to measure the time error data firstly and then calculate the statistical characteristics of clock. Modified DAVAR can monitor weaker frequency jumping compared to PLL method, but PLL method is independent of a second standard reference and time-comparison device, which will give us more flexibility. Taking their respective characteristics into account, cooperation between them may be a good choice to improve the Self-Monitoring reliability.

2.2.5. LS Method and Kalman Method on Detection for Frequency Jumping

In this section, we will introduce two existing methods, which are called LS method and Kalman filter method.

LS Method. We may make use of LS algorithm to calculate the averaging frequency deviation with newest saved sampling points and then predict the time error . After we can compare it with the real measurement , if the difference is beyond the threshold , we think that frequency jumping occurs.

Kalman Filter Method. We can make use of Kalman filter to predict the next state of clock and then compare it with the real measurement . If the difference is larger than the configurable threshold, we think that anomaly occurs.

Simulations. Because the sampling interval  s, we only consider WFM noise, whose standard deviation is . During the simulations for LS method, we choose , while, for Kalman filter method, the state transition matrix , observation matrix , system error covariance matrix , and observation error covariance matrix .

Figures 11 and 12 show the frequency jumping detection of the two methods.

After we have done numerical simulations, we give Table 10 to show the detection performance of the methods, in which PLL method, DAVAR method, and Modified DAVAR method are included.

Discussion. It should be noted firstly that the detection performance will be different with different parameters. We use the same noise level to test different methods to give Table 10, which can be a reference to show different characteristics of different methods.

Different observation quantity is needed for different methods. In our opinion, DAVAR, Modified DAVAR, LS, and Kalman filter are all effective in detecting weak frequency jumping. They need a second standard reference and time-comparison device to get the time error measurement to be used as observation quantity to run the algorithm. PLL method can realize Self-Monitoring for frequency jumping without standard source, which can give us more flexibility and has been used on BeiDou satellite. The computation complexity is also different among them. LS method, Kalman filter method, and PLL method should be of less computation quantity. The resolution of Modified DAVAR is the best but at the cost of longer delay, while the PLL method has the shortest delay but at the cost of worst resolution. Sometimes we may have to compromise between resolution and delay.

2.2.6. Stability Evaluation of Satellite Clock

In this section, we use the well-known “three-cornered hat” approach [1719] to evaluate the stability of atomic clocks.

According to the references, we can get , , and from , , and in Figure 1. For example, can be expressed as

Because Allan variance is convergent for the five kinds of noises at different averaging time, it is often used to evaluate the stability of clock. However Allan variance cannot rule out frequency drift. Especially when the drift is almost equal to Allan variance for certain averaging time, if we use Allan variance to calculate , , and and then calculate , , and , the result cannot reflect the real situation of atomic clock. In order to avoid the influence of frequency drift, two-order difference for frequency data or three-order difference for phase data is needed, which is just the definition of Hadamard variance.

In order to make full use of the measurement data and also track slow change of satellite clock timely, we use windowed overlapping Hadamard variance, as shown in (20); even though the additional overlapping differences are not all statistically independent, they nevertheless increase the number of degrees of freedom and thus improve the confidence in the estimation. Moreover, by using the latest data, the variance can evaluate the health condition of atomic clock in real time:where is the length of the window, namely, the amount of data used for each update, and is the time interval between and , which is defined as sampling period. is the measurement period, averaging time , and . We know that, from the characteristics of Hadamard variance, the longer the averaging time, the larger the amount of data needed.

Expression (21) can be used as recursive algorithm to reduce computation complexity in the updating of . Moreover, is a constant for each averaging time; division operation can be done only when necessary:where and .

As opposed to the Allan variance, which makes use of a second difference, the Hadamard variance employs a third difference that leads to reduction in the degrees of freedom by one. The Hadamard variance requires more data to produce a single stability calculation, as compared to the Allan variance, given equal averaging time . So it will be a better choice to use different statistical tool for different averaging time. When the averaging time is short, it is much more convenient to use an ADEV three-cornered hat method, while HDEV will be a better choice when the linear frequency drift is dominant.

2.2.7. Detection for Frequency Drift-Rate Anomaly of Clock

According to [2024], several drift-rate estimators are discussed and compared. We will firstly compare six different estimators by simulations in the following:(a)Two points: (b)Two groups of points: (c)LS: (d)Three points: (e): (f)Kalman filter: , ,where is the phase data, frequency deviation and drift rate, and observation matrix . The state transition matrix is and state error covariance matrix can be expressed as [25].

To compare these six estimators, we generate simulation data by well-known Stable 32 software. The parameters of phase data are shown in Table 11, and we consider three kinds of noise type which are WFM, FFM, and RWFM.

From Table 12 and Figures 13 and 14, we know that method and Kalman filter should be good choice for drift-rate estimation. To make sure that the simulation result is reliable, we make use of different simulation data to test the performance of the six estimators, and the result is just similar.

In fact, no matter which method we choose to evaluate the drift rate, we must know the time error data , which is equivalent to in this section.

When the satellite is within the sight of ground station, we can calculate the drift rate with certain estimator by comparing the satellite time with the timescale on the ground. But when the satellite cannot contact with the station, the only available data is the intercomparison data , , and .

In order to evaluate the drift rate, there may be two methods. () Three-state Kalman filter can be used to evaluate the drift rate directly with the observation quantity , , and , which is similar to the Kalman timescale algorithm on the ground. () Two steps are needed. The first step is to predict , , and from , , and . And the second step is to evaluate drift rate with , , and (, , and are the prediction values of , , and ).

In the following, we will compare these two methods. The three-state Kalman filter method is called method 1, and the combination of two-state Kalman filter with is called method 2.

The basic Kalman fiter equations are as follows.

System equation is

Observation equation is

For the two-state Kalman filter,The system error covariance matrix can be expressed as [2628].

For the three-state Kalman filter,and we can get the system error covariance matrix according to [29].

To compare these two methods, we make use of Stable 32 to generate simulation data. The parameters of the phase data are shown in Table 13.

Firstly, we will show the prediction performance of both two-state and three-state Kalman filter for , , and ; they are similar, which can be shown in Figure 15.

For method 2, after we have got the prediction of , , and , we will use to evaluate the drift rate.

Figure 15(a) is the comparison of the clock difference and its prediction . Figure 15(b) compares the real time errors and . From Figure 15, we know that the prediction error    will increase gradually, while is unbiased. In fact, this phenomenon is inevitable owing to the lack of absolute standard reference. No matter which method we choose, the prediction error will increase with time going. That also indicates that we cannot get precise drift rate, but it does not mean we can do nothing. Next we will prove that method 1 can give an alarm when the drift rate of one clock is much larger than the best one. It should be noted that when we make use of method to evaluate drift rate for method 2, a sliding window is used to evaluate the drift rate in real time instead of batch processing.

Figures 16 and 17 show the drift-rate estimation of these two methods for three clocks. From the simulation result, we know that if the drift rate of one clock is much larger than the best clock, its drift-rate estimation is very close to its real value, and its computed estimation is much larger than the best clock, which enables method 1 to give an alarm. Besides, method 1 gives more precise and stable computation result compared to method 2.

3. Conclusion

This paper puts forward a set of Self-Monitoring Methods for common anomalies. We use PLL to realize Self-Monitoring for signal loss and phase and frequency jumping. Based on the measurement data from intercomparison among three clocks, Modified DAVAR is used to detect phase and frequency jumping and instantaneous deterioration; we use windowed overlapping Hadamard variance to evaluate clock stability in real time and the three-state Kalman filter for large drift-rate anomaly.

The method based on PLL has been proved effective and used in newest BeiDou satellite. And the other research on Self-Monitoring Method in this paper can be used in next generation navigation satellites after year 2019.

Competing Interests

The authors declare that they have no competing interests.