Abstract

To address the severe spectrum scarcity problem and achieve efficient green communications, we propose a new and practical scheme to obtain electromagnetic data (ED) and a composite electromagnetic map reconstruction method (CEMRM). The scheme uses a small number of sensing nodes to obtain incomplete sampled ED, then uses CEMRM to reconstruct complete ED according to the incomplete ED and builds realistic electromagnetic maps (EMs) in various propagation scenarios. Specifically, we firstly adopt kriging (KG) method to obtain the geography-based ED (GED) according to geographical correlation of the locations of the sampled ED. Meanwhile, a novel algorithm, named filtered subdistrict sparsity adaptive matching pursuit (FSSAMP), is proposed to obtain the pure ED (PED) according to electromagnetic correlation of the sampled ED. Then, weight factors are mapped into the above two types of data and the fast gradient projection (FGP) method is employed to obtain the highly accurate combined ED (CED). Based on the CED, the accurate and practical EMs can be drawn. Simulation results demonstrate that the proposed scheme can provide more accurate ED and EMs than existing benchmark schemes in various propagation scenarios, and the built EMs can provide accurate information for the assessment of spectrum resources utilization to make spectrum resources efficiently used.

1. Introduction

The rapid development of wireless applications and blossoming growth of traffic demand lead to severe spectrum scarcity problem. To achieve efficient application of spectrum resources and green communications, it is necessary to obtain electromagnetic data (ED) first and then assess the spectrum resources utilization so as to adopt spectrum resources allocation method. In the existing methods to obtain ED, manual measurements cost highly especially in a large-scale area. Employing lots of human beings to obtain the ED is not realistic, and the obtained ED may be not accurate due to the human subjectivity. With the development of automation technology, sensing nodes replace human beings to obtain the ED [1]. However, the cost and effect are relative to the number of sensing nodes. If sensing nodes are deployed redundantly, the cost is high but accurate and complete ED can be obtained. If there are a small number of available sensing nodes deployed, the cost is greatly less than that when sensing nodes are deployed redundantly, and the obtained ED are accurate but incomplete [2]. However, the incomplete ED cannot provide sufficient information for assessing the usage of spectrum resources. The completeness degree of ED and cost are both proportional to the number of sensing nodes in a certain range. The mentioned three types of measurement methods cannot make a balance between the cost and effect. Therefore, it is necessary to study an effective and low-cost method to obtain ED.

An electromagnetic map (EM) [3] is a promising tool for assessing the usage of spectrum resources, which contains lots of information about electromagnetic spectrum resources. The EM can make spectrum resources efficiently used, which can solve the spectrum scarcity problem and achieve efficient green communications. The existing works on EMs are mainly about the constructing methods of radio environment maps (REMs), whose methods can be classified into three categories, which are called indirect methods, direct methods, and hybrid methods. Indirect methods are mainly based on propagation models or radiation geographical positions, direct methods are mainly interpolation methods, and hybrid methods are the combination of indirect methods and direct methods.

In indirect methods, the authors of [4] proved that the propagation model can be used to construct the REM. ED of unsampled positions can be inferred from the known or estimated radiation source locations and the propagation model, and it is the main method used by the indirect methods to obtain the ED. However, the single electromagnetic propagation model cannot fit the dynamic electromagnetic environment, and the selection of relevant parameters is difficult without relevant information of electromagnetic environment, so the effect of the method based on the propagation model cannot be ensured in the dynamic and noncooperative electromagnetic environment [5, 6].

On the other hand, the direct methods need no information about electromagnetic environment especially propagation models, and they are used to reconstruct the ED according to the geographical correlation of the sampled positions. The kriging (KG) [7] is the mostly used algorithm, and its accuracy is satisfactory [8]. However, the accuracy of KG is influenced by the outliers [9], and the calculation complexity is high [10]. Different from KG, other interpolation methods are used to construct the REM, such as the local polynomial (LP) algorithm [11], nearest neighbour (NN) algorithm [12], and inverse distance weighting (IDW) algorithm [13]. However, LP is sensitive to neighbourhood distance, and a small searching neighbourhood distance will cause an empty hole in the data reconstruction. The data reconstruction result of NN is not continuous. IDW is sensitive to isolating outliers, and a lot of isolating outliers cause downfall in the accuracy when IDW is used in large-scale data reconstruction. Generally, the REM construction effect of existing interpolation methods is relatively lower when the number of sensing nodes is small [8].

In hybrid methods, the mostly used combination scheme is to combine the KG and propagation models [14, 15]. In fact, the combination of a single propagation model and KG is suitable for the special scenario, and the accuracy is much better than KG and the propagation model. However, the effect of the method is relatively lower in scenarios of other kinds of propagation models, and the methods also need information about propagation models in a certain content.

In the case of noncooperation, the information of the electromagnetic environment like propagation models cannot be obtained, even though some methods like maximum likelihood estimation [16] can be used to estimate the information, the error is not acceptable; moreover, the method based on a single propagation model cannot do well in scenarios of different propagation models. In the case that a small number of sensing nodes are available, the interpolation methods cannot obtain good results, and the accuracy needs improvement. As for the hybrid methods, these kinds of methods do badly in the case of noncooperation or the case that a small number of sensing nodes are available, and the method has high complexity. In order to achieve the reconstruction of EMs in the case of noncooperation and the case that a small number of sensing nodes are available, it is necessary to study an effective method.

To address the above challenges, this paper proposed a novel scheme to obtain accurate ED, reconstruct the practical electromagnetic signal map (which we refer to as an EM), and take the reference signal receiving power (RSRP) as the research object of EMs.

The contributions of the paper are summarized as follows: (1)To achieve ED reconstruction, this paper proposes an improved sparsity adaptive matching pursuit (SAMP) algorithm, which reconstructs ED based on the electromagnetic correlation of the sampled ED. Besides, the processing time of the proposed improved SAMP is much less than that of the traditional SAMP algorithm(2)To achieve the reconstruction of EMs in the noncooperation situation with a small number of sensing nodes, this paper proposes a composite EM reconstruction method (CEMRM) to obtain the complete and accurate ED and reconstruct realistic EMs. Simulation results show that the obtained EMs reconstructed by the CEMRM are more accurate than those reconstructed by existing methods

The organization of this paper is as follows. Section 2 describes the system model of the proposed scheme. Section 3 presents algorithm description about the proposed method. In Section 4, simulation results are presented. Finally, the paper draws the conclusions in Section 5.

2. System Model

Assuming that there are three base stations in the target area as shown in Figure 1, a small number of sensing nodes are deployed randomly in the target area, and the ratio of sensing nodes to the total number of ED is 2% (smaller than that in existing works [17]). First of all, the target area is divided into grids to ascertain the total number of ED and obtain the corresponding geographic positions. The values of ED are RSRP values. We just consider the electromagnetic datum in each grid center. When reconstructing a large-area EM, the total number of ED is huge. If we study the relation between the change in the specific quantities of sensing nodes and the accuracy of reconstructed EMs, the workload is huge and results are not accurate. However, if we study the relation between the change in the ratio of sensing nodes to the total number of ED and the accuracy of reconstructed EMs, the useless work can be decreased and the results are accurate. Therefore, this paper uses the ratio to describe the number of sensing nodes instead of the specific quantities. The sampled ED must be incomplete. After sampling the ED [18, 19], the incomplete data are sent to the data processing center, where CEMRM is adopted to reconstruct the precise EMs. The data processing center assesses the usage of spectrum resources according to the reconstructed EMs and then controls the working status of the base stations with the adopted spectrum resources allocation method. If the data processing center breaks down, the calculating task will be undertaken by the distributed calculating platform in the sensing nodes.

3. Algorithm Description

This section proposes the CEMRM to achieve ED reconstruction, which is shown in Figure 2.

Specifically, CEMRM consists of four parts. In the first part, KG is used to reconstruct the ED according to the incomplete data obtained by sensing nodes, and the obtained geography-based ED (GED) contain strong geographical correlation. At the same time, the filtered subdistric sparsity adaptive matching pursuit (FSSAMP) is used to reconstruct the ED according to the incomplete data obtained by sensing nodes, and the obtained pure ED (PED) contain strong electromagnetic correlation. The ED are affected by geography just in a certain extent, so the GED obtained by KG can be more accurate after being processed with the electromagnetic correlation. In order to make obtained ED containing both geographical and electromagnetic correlation, weight allocation is used to balance the two kinds of correlation.

After weight allocation, the primary ED, is obtained. The fast gradient projection (FGP) is an algorithm to improve the data accuracy according to the correlation of data, so the FGP is promising to help improve the accuracy of the primary ED according to the weight allocation of the geographical and electromagnetic correlation. After with strong correlation being processed by the FGP, more accurate combined ED (CED) can be obtained. Finally, the EM is drawn with isomagnetic lines according to the CED.

The proposed CEMRM needs no electromagnetic information about the electromagnetic environment, and it just needs a small number of sensing nodes. With the help of electromagnetic correlation extracted by the FSSAMP, and accuracy improvement by the FGP, it can overcome the weakness that KG is sensitive to outliers and obtain realistic results.

3.1. Kriging

KG is an accurate interpolation algorithm and can reconstruct ED according to geographical correlation, and the obtained ED contain geographical correlation. The principal of KG to predict the RSRP is given by [7]. where is the value of the RSRP, and is the location. is the vector containing selected functions that consist of the regression model. is the parameter vector of the model. is a stochastic process satisfying the following conditions: where is an operation that calculates the expectation, , are both input locations, is the correlation model, is the variance of , and is the parameter of the correlation model. The algorithm defines as a set of sampling point locations whose value is . Then, where and are the predicted values of the corresponding variables and . is a matrix containing , for . is a correlation matrix of the sampling points containing . The correlation function is defined as follows: where is the dimension of sampling points. Note that the following condition should be ensured: where calculates the determinant of a matrix. Here, the spherical model is adopted. where . Then, the predicted RSRP of the unsampled location is shown as follows:

The complete ED obtained by the KG are , where is the union of .

3.2. Improved Sparsity Adaptive Matching Pursuit

The procedure of sampling ED with sensing nodes can be described as where is the sampled ED containing non-zero values, is the real complete ED, , and need to be met. Here, is the observation matrix. where is the total number of ED.

Since it is difficult to determine directly, the problem is converted to where is the sensing matrix, is the sparse base matrix, is calculating the -norm of a matrix, and is the projection result of in the sparse base matrix , which is given by where .

In order to solve Eq. (11), we improve the SAMP [20]. Firstly, the target area is divided into four parts, which are numbered as , and a certain number of sensing nodes are randomly deployed in four subareas, respectively. The number of sensing nodes in each subarea is same. Then, the algorithm reconstructs ED of each part parallelly or serially, and then they are integrated into a set of complete ED. Finally, the obtained complete ED are filtered by a median filter. The improved algorithm is summarized as follows.

Needed parameter: the sensing matrix , sampling ED of the subarea, the step size , which is the number of subareas, and the observation matrix that being changed into the column vector format .

Step 1. Initialize parameters, the residual error , index matrix , the element number , , is the iteration, is the th column of , stage index , .

Step 2. is calculated, and the algorithm chooses maximum values, and obtains a union set containing the column codes of chosen values in .

Step 3. , .

Step 4. Least squares solution .

Step 5. The elements with the biggest absolute values are chosen from , and the corresponding column vectors of the chosen values in make up , the corresponding column codes of the chosen values are.

Step 6. The new residual error is updated. , .

Step 7. If which is in the original algorithm, , and do Step 8 directly. If , , , ,, and return Step 2; at the same time, if the algorithm reaches the maximum iterations, and do Step 8. Or , , and return Step 2.

Step 8. Update , the values whose positions are in is .

Step 9. The corresponding ED of the sub area are , is the column vector made up of 1.

Step 10. A set of complete ED are obtained by data fusion, which are described as , . The complete ED obtained by the FSSAMP are , where means the union of the data, is the ED of the corresponding subarea, calculates the 2-norm of a matrix, and is the median filter.

The median filtering is a kind of nonlinear filtering, and its principle is like where is set as the median value of , , , , , , , , and ; the values of ~ are known.

The median filtering can protect the edge information of ED and reduce the influence of isolating outliers, which can help to obtain more accurate results.

3.3. Fast Gradient Projection

The FGP is a good method using the data correlation to increase the precision of data. After the data obtained by Eq. (1) being processed by the FGP, more precise data can be obtained. The accuracy improvement of the FGP is described as a minimum optimization problem, which satisfies

represents calculating the 2-norm of the matrix, is the ED obtained by weight allocation, is more precise ED which is the final result, and is the total variation regularizer of , is the regularization parameter. The dimensions of and are both .

In order to solve Eq. (14), we adopt the FGP, which is proposed in [21]. The procedure is given as follows.

First of all, parameters are initialized as , and .

Then, it is an iterative procedure, in the step , the algorithm calculates where represents the total number of iterations; represents the in the th iteration; represents the in the th iteration; represents the in the th iteration; and represents the in the th iteration.

Finally, the more precise result can be obtained. where and are obtained in the final iteration.

As a set of matrix pairs, consists of . The dimension of is , is , and they satisfy where is an operation that satisfies , and . is an operation that satisfies , where .

means doing an orthogonal operation on the set , and the meaning of is similar to . For example, if , the calculation is where is defined as

4. Simulation Results

4.1. Indicator Description

We use four indicators to demonstrate advantages of the proposed method, which are the root mean square error (RMSE), determination coefficient , robustness, and quality of the reconstructed EM (QoM).

RMSE represents the average error between the reconstructed ED and real ED. If the RMSE is small, the result is accurate. Equation (20) shows the calculating procedure of RMSE. represents the similarity of the data distribution between the reconstructed ED and real ED. If the value is high, the data distribution of the reconstructed ED is similar to that of the real ED. That is, the reconstructed EMs are realistic. Equation (21) shows the calculating procedure of . What we study is the EM reconstruction with a small number of sensing nodes. Therefore, if the available sensing nodes decrease, the effect of the method should not decrease too much. That is, the method needs a good robustness. The robustness is defined as the changing range of the RMSE when the ratio of sensing nodes to the total number of ED decreases from 10% to 2%. QoM is mainly judged according to the bull’s eye, the similarity of isomagnetic lines between the reconstructed result and the real EM. The RMSE and are given by where is the average value of the real ED, is the th real electromagnetic datum, is the corresponding reconstructed electromagnetic datum, is the residual sum of squares, and is the total sum of squares.

4.2. Results and Analyses

Three base stations are arranged in an area of , and the reference signal receiving power data of the long-term evolution (LTE) mobile communication network in this area are used as the experimental data. The data are generated by the currently highly recognized software Atoll, and the landform of Brussels are taken into account. The Okumura-Hata model, Longley-Rice model, and Erceg-Greenstein model are adopted as the propagation models. The frequency of electromagnetic signal is 2110 MHz, and the bandwidth is 10 MHz. The equivalent antenna height of base stations is set to 30 m, and that of detection equipments is 1.5 m. The extended matrix of the Longley-Rice model is set to 5500 m, and that of other models is set to 7000 m. All main matrices of three models are set to 4000 m. Other settings remain the original settings. The total number of the ED is 40000. The area is divided into four parts, and 200 low-cost sensing nodes are randomly deployed in each part, the ratio of sensing nodes to the total number of ED is 2%. The experimental platform is based on core i9, the software used is MATLAB R2020b, Atoll 2.8.0, Surfer 14.

If the terrain of the target area doesn't change drastically, the Okumura-Hata model is applicable. If the terrain of the target area changes drastically, the Longley-Rice model is applicable; the Erceg-Greenstein model is applicable for scenarios with shadowing. To obtain convincing results and decrease the redundant work, the abovementioned models are used.

Table 1 gives the average running time of the proposed FSSAMP and SAMP. It is seen that the average running time of FSSAMP is 7.08% accounting for that of SAMP, the proposed FSSAMP is faster than SAMP.

In order to prove the advantages of the proposed CEMRM, we adopt the modified shepard’s method (MSM), KG, IDW, LP, and NN to reconstruct the EMs. What we study is the EM reconstruction in the noncooperation environment, the information of electromagnetic environment, especially the propagation models, cannot be obtained, so the indirect methods are not achievable.

Figures 35 show the reconstruction EM results in the different scenarios of three typical propagation models. From the three figures, it is seen that the results of IDW have lots of bull’s eyes, the isomagnetic lines are greatly different from those of the real EMs; the results of MSM have fewer bull’s eyes than those of IDW, there are two big white holes in the result; the results of NN have few bull’s eyes, but the results are not continuous, which are different from the real EMs; the results of LP have few bull’s eyes, but the results contain little changing information of electromagnetic wave, which are different from the real EMs; the results of KG have few bull’s eyes, but the results cannot express the information at the locations close to base stations; the results of CEMRM have few bull’s eyes, the results can express the information at the locations close to base stations, the results of CEMRM are more similar to real EMs than KG. To sum up, the QoM of CEMRM is better than that of any other algorithm mentioned above.

Figure 6 shows the changing situation in RMSE when the ratio of sensing nodes to the total number of ED decreases from 10% to 2%. It is obviously that the proposed CEMRM has the lowest RMSE in all situations. The average changing range in RMSE of the proposed CEMRM is 0.4264, the second smallest. The average changing range in RMSE of KG is 0.3887, which is the smallest. Compared with KG, the change of the proposed CEMRM in RMSE is more stable than that of KG. After comprehensive consideration, the proposed CEMRM has the better robustness than any other five algorithms.

Table 2 gives the result of between the reconstructed ED and real ED, and the ratio of sensing nodes to the total number of ED is 2%. From Table 2, it can be clearly seen that the of the results obtained by CEMRM is the largest, so the distribution of the reconstructed ED obtained by CEMRM is the most similar to the real ED.

From the comprehensive consideration of RMSE, , robustness, QoM, it can be concluded that the effect of CEMRM is better than those of the existing KG, IDW, NN, LP, and MSM. The proposed CEMRM is promising to achieve the efficient application of spectrum resources and green communications in various fields [23, 24].

5. Conclusions

In order to obtain the complete and accurate ED with limited sensing nodes in the noncooperation situation, we proposed a new and practical scheme using EMs and a small number of sensing nodes, and an accurate EM reconstruction method, called CEMRM. After obtaining the incomplete sampled ED from sensing nodes, the proposed CEMRM can reconstruct the complete and accurate ED and build realistic EMs. The required minimum number of sensing nodes accounting for the total number of ED is just 2%. Compared to the existing methods of reconstructing the EMs, the proposed CEMRM remains better robustness against the decreasing of sensing nodes and obtains more accurate ED and EMs. The proposed scheme makes a good trade-off between the number of sensing nodes and accuracy of the obtained ED, and the obtained EMs can provide great support for assessing spectrum resources utilization. What we studied is helpful for green communications, high-efficiency spectrum resources utilization, and so on.

Data Availability

There are no available data.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China under grant 61971473.