Abstract

The microwave devices are usually optimized by combining the precise model with global optimization algorithm. However, this method is time-consuming. In order to optimize the microwave devices rapidly, the knowledge-based neural network (KBNN) is used in this paper. Usually, the a priori knowledge of KBNN is obtained by the empirical formulas. Unfortunately, it is difficult to derive the corresponding formulas for the most electromagnetic problems, especially for complex electromagnetic problems; the formula derivation is almost impossible. We use precise mesh model of EM analysis as teaching signal and coarse mesh model as a priori knowledge to train the neural network (NN) by particle swarm optimization (PSO). The NN constructed by this method is simpler than traditional NN in structure which can replace precise model in optimization and reduce the computing time. The results of electromagnetic band-gap (EBG) structures optimally designed by this kind of KBNN achieve increase in the bandwidth and attenuation of the stopband and small passband ripple level which shows the advantages of the proposed KBNN method.

1. Introduction

The electromagnetic band-gap (EBG) [1, 2] is a kind of artificial periodic structure that prohibits the propagation of electromagnetic waves in certain frequency bands at microwave frequencies. Because of the unique feature of EBG structures, they have been applied to microwave circuits such as filters [3], power amplifiers [4], and antennas [5] to improve their performance.

Recently, neural network (NN) is widely used in microwave modeling and design [68] for its good learning ability and generalization. Usually, NN suffers a large number of samples to ensure the accuracy, which greatly increases the workload to establish NN. In existing papers, one of the effective ways to solve this problem is knowledge-based neural network (KBNN) [9]. The a priori knowledge of KBNN is always the empirical formula which contains the basic information about the microwave circuits but cannot achieve the required precision. There are different kinds of the knowledge-based structures, such as difference method [10], the a priori knowledge input (PKI) method [11], and KBNN [12]. In the existing methods, the a priori knowledge is obtained by empirical formula or NN [13, 14]. If empirical formula is considered as a priori knowledge, the cost of calculation can be negligible, but not all microwave devices have equivalent circuit; if the a priori knowledge is obtained by NN, the training of NN requires a large number of samples: both of them are flawed. In this paper, the a priori knowledge is obtained by coarse mesh model and is used as knowledge neurons in the hidden layers of NN. The advantage of this approach is that it can be widely applied to quick electromagnetic optimization even for the complex microwave devices. The method to build the KBNN model is given in the second section, including the neural network structure, acquisition of samples, and training method. In the third section, the KBNN is applied to tapered dual-plane EBG and papilionaceous dual-plane EBG which shows the feasibility of the method. Summary and discussion of future work are given in Section 4.

2. The Proposed KBNN

Assume that the structure of KBNN is , shown in Figure 1. Let represent input vector containing physical parameters of a microwave device, and let represent output vector of the KBNN. The number of knowledge-based neurons in the hidden layer is ; represent vector of the knowledge-based neurons values. The number of traditional hidden neurons is ; represent output vector of traditional hidden neurons. Because no empirical formula is used here, there is no connection between input neurons and knowledge-based neurons. In short, the network combines the features of DC model and KBNN model; a part of the hidden neurons is used to generate the a priori knowledge and the other part is same as the traditional hidden neurons which are used to generate the difference between the coarse and fine models.

Given input , the output can be computed bywhere is the weight between knowledge-based neurons and output layer, is the weight between traditional hidden neurons and output layer, and is threshold. can be computed bywhere is the weight between input layer and traditional hidden neurons; is an activation function; we choose the sigmoid function (the function’s gain ) here.

In the simulation examples, both substrates have low loss. So we can approximatively consider the relationship between and as follows:Because of this relationship, the number of knowledge-based neurons in hidden layer can be set from 0 to 2. When the number is 0, it means there is no knowledge-based neuron. is used as knowledge-based neuron when the number is 1; while it becomes two, both and are used.

The design parameters are generated by partial orthogonal experimental design to reduce the number of training samples. Orthogonal experiment design is a kind of design method to study multifactors and multilevels. It is based on orthogonality to select some representative points which is uniformly dispersed from the comprehensive experiment. In this paper, 6-factor and 5-level orthogonal table is used, shown in Table 1. In the table, A~F represent 6 design parameters and 1~5 represent the levels of parameters. There are 25 groups of design parameters. If the training samples are too few, the KBNN cannot accurately map the relationship between input and output.

When the design parameters are confirmed, the output of KBNN is obtained by HFSS. Generally, the accuracy of HFSS simulation depends on adaptive analysis parameters, which are maximum number of passes and maximum delta S. The maximum number of passes value is the maximum number of mesh refinement cycles that you would like HFSS to perform. And delta S is the magnitude of the change of S-parameters between two consecutive passes. They are stopping criterion for the adaptive solution. Usually, the maximum number of passes is 6 and maximum delta S is 0.02 which can get accurate results. In the paper, delta S is 0.3 in this case; that is, the maximum number of passes is 1 or 2 for coarse mesh model. When delta S is about 0.02, the maximum number of passes is 6, which can be considered as fine mesh model. So we can find that if the mesh is not coarse enough, the simulation results will be closer to the accurate results. However, its analysis time will be much longer. So we just consider that the maximum number of passes of model is 1 or 2 which is suitable for a priori knowledge. VBScript is used here to provide an interface between HFSS and MATLAB. Thus, the data exchange can be realized which makes the acquisition of samples more automated and concise.

Particle swarm optimization (PSO), which is a kind of global optimization method [15], is chosen to train the network which can effectively avoid the local optimum of the NN. A flowchart of the proposed KBNN is shown in Figure 2. The number of particles is 80 with 1000 iterations for training the KBNN. We can judge the accuracy of the network by calculating the mean absolute error (MAE) of the sample and network correlation coefficient (NCC):where represents the corresponding result of HFSS simulation, is the total number of data samples, is mean of the desired outputs, and is mean of network outputs. NCC is an important standard to measure the rationality of the network. If it is closer to 1, the network is more reasonable. Otherwise, the network needs to be trained again. According to Figure 2, we can establish the KBNN and use the KBNN to optimize microwave devices. A flowchart of the optimization by PSO is shown in Figure 3. The criterion of terminating the PSO for training KBNN and optimization is same. When the iteration reaches the maximum or the fitness meets the required error, the PSO can be terminated.

3. Simulation Examples of EBG Structures

3.1. Tapered Dual-Plane EBG Structure

The proposed KBNN is applied to a tapered dual-plane EBG [16], illustrated in Figure 4. As can be seen in Figure 4, the tapered dual-plane EBG consists of two single-plane EBG structures, one of which is a modulated microstrip line, while the other is a ground plane with etched circles. Between these two planes, there is a substrate with thickness , relative dielectric constant , and loss tangent . The distance between the centers of two adjacent circles is ; the radius of the circle from the middle to the side is , , and . The distance between the centers of two adjacent squares is ; the length of the square from the middle to the side is , , and . Before optimization, and . The frequency range is 1.7 GHz~4.3 GHz with 56 sampled points.

The simulated parameters of the tapered dual-plane EBG before optimization are shown in Figure 5. As can be seen in Figure 5, the tapered dual-plane EBG shows a −10 dB bandwidth of 4.9029 GHz with attenuation of 33.5691 dB at 9.6 GHz. The ripple level is 1.8247 dB in the lower passband and 6.3265 dB in the higher passband. To achieve maximum band-gap, the tapered dual-plane EBG is optimized by proposed KBNN.

First, we need to obtain the training samples to train the KBNN. The design parameter is . The ranges of them are as follows: . The input of the proposed KBNN is . As it is said above, we sample 25 groups of by partial orthogonal experimental design. Hence, there are totally training samples.

To some degree, the reliability of proposed KBNN depends on the number of hidden neurons and knowledge-based neurons. So we train the same test sample in different number of hidden neurons and knowledge-based neurons; the results are listed in Table 2.

In Table 2, 2K4H represents the fact that the number of hidden neurons is 4, in which the number of knowledge-based neurons is 2, and 1K5H represents the fact that the number of hidden neurons is 6, in which the number of knowledge-based neurons is 1, and so on. It can be seen from Table 2 that when the number of knowledge-based neurons is 2, the number of hidden neurons has little influence on the results. When the number of knowledge-based neurons is 1, NCC decreases from 0.9944 to 0.9933, and MAE increases from 0.8040 to 1.1050. When there is no knowledge-based neuron in hidden layer, the NN can be regarded as a PSO-NN. NCC drops to 0.9268 and MAE increases to 1.6115; the NN cannot map the relationship between input and output accurately. As a result, the existence of knowledge-based neurons can significantly improve the reliability of the network and make it easy to get the precise output.

We generate 5 different groups of test samples randomly and test on the 2K5H KBNN; the test results are shown in Table 3 and Figure 6. The KBNN is trained by PSO with 1000 iterations, while the 10-hidden-neuron multilayer perceptron (MLP) is trained with same training samples and 1500 iterations. As can be seen in Figure 6, the MAE of proposed KBNN is 0.8040 and the MAE of 10-hidden-neuron MLP is 1.6733. Compared with MLP, the result of proposed KBNN is closer to the HFSS. In a short, in case of ensuring the accuracy, the proposed KBNN needs less number of training samples and has good generalization ability.

Use the trained KBNN with 2K5H structure to optimize the tapered dual-plane EBG. In the process of optimization, the precise model is replaced with KBNN which can reduce the time of optimization. The PSO algorithm to optimize has 20 particles with 80 iterations. The output of KBNN is used to calculate the fitness. The fitness function is calculated bywhere is every point of frequency; is the output of KBNN in corresponding frequency. Finally, we get the optimized sizes:The fitness curve is shown in Figure 7 and the results of optimization are shown in Table 4 and Figure 8.

It can be seen from optimization results that, after optimization, the maximum attenuation is from 33.5691 to 38.8096 dB. The 10 dB bandwidth is 1.19 times wider than that before optimization. The ripple level is lowered from 1.8247 to 0.6878 dB in the lower passband and from 6.3265 to 4.0484 dB in the higher passband. This proves that it is effective to use the proposed KBNN as precise model to optimize the tapered dual-plane EBG.

The average analysis time of coarse mesh model is about 35 s, while the time of precise model is about 69 s. When both coarse mesh model and precise model are analyzed, the time is about 86 s. In the condition of 20 particles with 80 iterations, we can roughly calculate the total time of optimization by precise model in . The training process includes obtaining 25 sets training samples and training the KBNN. So the time of training process is calculated by , and the total time of optimization by KBNN is 58846 s. As can be seen from Table 5, although the proposed KBNN takes short time to obtain the training samples and train the KBNN, the total time of optimization by KBNN is almost half of the time of optimization by precise model. As a result, using the KBNN based on coarse mesh to optimize the tapered dual-plane EBG, to a great degree, shortens the time of optimization and makes it efficient.

3.2. Papilionaceous Dual-Plane EBG Structure

The proposed KBNN is applied to a papilionaceous dual-plane EBG [17], illustrated in Figure 9. As can be seen from Figure 9, the papilionaceous dual-plane EBG also consists of two single-plane EBG structures, one of which is six cells of papilionaceous patches, while the other is a ground plane with six etched rings. Between these two planes, there is a substrate with thickness , relative dielectric constant , and loss tangent . The distance between the centers of two cells papilionaceous patches is . The length of papilionaceous patch is . The widths of papilionaceous patch are, respectively, and . Before optimization, the inner and outer radii . The frequency range is 3 GHz~7.5 GHz with 46 sampled points.

The simulated parameters of the papilionaceous dual-plane EBG before optimization are shown in Figure 10. As can be seen from Figure 10, the papilionaceous dual-plane EBG shows a −10 dB bandwidth of 2.4553 GHz with attenuation of 32.5114 dB at 5.2 GHz. The ripple level is 3.648 dB in the lower passband and 5.3396 dB in the higher passband.

The design parameter of papilionaceous dual-plane EBG is . The subscript numbers are set from center to side. The relationship of them is , and , The ranges of them are as follows: The input of the proposed KBNN is . has 25 groups. Hence, there are totally training samples.

The results on same test sample of different number of hidden neurons and knowledge-based neurons are listed in Table 6.

From the table, we know that when the number of knowledge-based neurons is 2, the number of hidden neurons has little influence on the results. When the number of knowledge-based neurons is 1, NCC decreases from 0.9880 to 0.9870, and MAE increases from 0.9398 to 1.0683. When there is no knowledge-based neuron in hidden layer, the NCC drops to 0.7471 and MAE increases to 9.2719. We can draw same conclusions that the proposed KBNN makes the NN structure simpler and has easy access to accuracy value.

The test results on 5 different test samples of 2K7H KBNN are shown in Table 7 and Figure 10. As can be seen from Figure 11, the MAE of the proposed KBNN is 0.9398 and the MAE of 10-hidden-neuron MLP is 1.2488. Similarly, the result of the proposed KBNN is closer to the HFSS compared with the MLP NN. The modeling of the papilionaceous dual-plane EBG shows same conclusion with tapered dual-plane EBG which verifies superiority of the proposed KBNN.

Same with the optimization of tapered dual-plane EBG, the papilionaceous dual-plane EBG is optimized by exploiting the trained KBNN with 2K7H structure. The fitness function is calculated by

The optimized size is . The fitness curve is shown in Figure 12 and the results of optimization are shown in Figure 13 and Table 8.

It can be seen from the optimization results that, after optimization, the maximum attenuation is from 32.5144 to 39.7025 dB. The 10 dB bandwidth is 1.268 times wider than that before optimization. The ripple level is significantly lowered from 3.6480 to 1.6889 dB in the lower passband and from 5.3396 to 2.0679 dB in the higher passband. In this example, the proposed KBNN is also proven to be a good way to optimize the papilionaceous dual-plane EBG.

The average analysis time of coarse mesh model is about 23.8 s, while the time of precise model is about 38.5 s. When both coarse mesh model and precise model are analyzed, the time is about 58 s. Similarly, we can calculate the total time of optimization by two ways, shown in Table 9. As can be seen from Table 9, using the KBNN based on coarse mesh to optimize the papilionaceous dual-plane EBG also shortens the time of optimization and makes it easy to optimize.

4. Conclusion

In this paper, we propose a new way to obtain the a priori knowledge by coarse mesh model which avoids complicated derivation of the formulas or the cost of obtaining a large number of samples. This method is also applied to those microwave devices which do not have empirical formulas. The modeling results of tapered dual-plane EBG and papilionaceous dual-plane EBG show that the knowledge-based neurons in the hidden layer can significantly reduce the number of hidden neurons which makes the structure simpler and the test results are in good accordance with the results of HFSS simulation which shows the strong generalization ability of the proposed KBNN. The optimizations of above examples indicate that using the proposed KBNN to optimize the microwave devices is feasible which can obviously reduce the optimization time. In other words, the proposed KBNN has good value in the optimization of microwave devices. The other way to obtain the a priori knowledge will be investigated in the future work and the new structure of KBNN will also be considered.

Competing Interests

The authors declare that they have no competing interests.

Acknowledgments

This work is supported by Key Research and Development Program Project of Social Development in Jiangsu Province, China (no. BE2016723).