Abstract

In this paper, a parallel group detection (PGD) algorithm is proposed in order to address the degradation in the bit error rate (BER) performance of linear detectors when they are used in high-load massive MIMO systems. The algorithm is constructed by converting the equivalent extended massive MIMO system into two subsystems, which can be simultaneously detected by the classical detection procedures. Then, using the PGD and the classical ZF as well as the QR-decomposition- (QRD-) based detectors, we proposed two new detectors, called ZF-based PGD (ZF-PGD) and QRD-based PGD (QRD-PGD). The PGD is further combined with the sorted longest basis (SLB) algorithm to make the signal recovery more accurate, thereby resulting in two new detectors, namely, the ZF-PGD-SLB and the QRD-PGD-SLB. Various complexity evaluations and simulations prove that the proposed detectors can significantly improve the BER performance compared to their classical linear and QRD counterparts with the practical complexity levels. Hence, our proposed detectors can be used as efficient means of estimating the transmitted signals in high-load massive MIMO systems.

1. Introduction

In recent years, massive multiple-input-multiple-output (massive MIMO) systems have been proposed to improve the quality of signal transmission in wireless communications. In a massive MIMO system, each cell site is equipped with very large number of antennas. Therefore, massive MIMO systems can provide not only high energy efficiency but also very high spectral one [1, 2]. Currently, the system with 128 antennas deployed at the BS, which simultaneously serves 8 single-antenna users, has been built successfully in laboratory [3]. Consequently, massive MIMO is expected to be one of the most important technologies for next generation cellular networks.

In massive MIMO, all complex signal processing, including signal detection for the uplink, precoding for the downlink, and channel estimation for both, should be implemented at the BS due to large dimension of the system [1]. For uplink scenario, all active users transmit their signals to the BS using the same time-frequency resources. These transmitted signal symbols are recovered at the BS by adopting suitable detectors. The detectors used in massive MIMO systems must satisfy the following requirements: (1) they should provide good BER performance or high spectral efficiency and (2) they should have low complexities. Low complexity linear detectors, such as zero-forcing (ZF) [47] or minimum mean square error (MMSE), can provide near-optimal bit error rate (BER) performance when they are used in massive MIMO systems [1]. In [8], the authors proved that if the system uses the Bell Laboratory Space Time (BLAST) detector, it will obtain a huge energy efficiency compared to that of the classical MMSE.

It is worth noting that the BER performance of the system depends on the so-called load factor β, defined by the ratio of the total number of antennas equipped at the users’ side and the ones at the BS. In [9], Björnson et al. declared that there exists no specific value of system’s load factor and the system can be defined unconventionally with arbitrary number of antennas and users. This means, massive MIMO can be defined as either a high-load system (i.e., ) or a low-load one (i.e. ). When β is high, the BER performance given by linear detectors presented in [1] becomes worse. Thus, they are not suitable detectors for signal recovery in high-load scenarios. In spite of the fact that it can provide high BER performance, the BLAST detector in [8] is also not applicable to massive MIMO systems with hundreds of antennas due to its high complexity.

In order to improve BER performance of linear detectors in high-load massive MIMO systems, Post Detection Sparse Error Recovery (PDSR) algorithm [10] can be applied. Nevertheless, this algorithm only uses binary phase shift keying (BPSK) or quadrature phase shift keying (QBSK). As a result, the overall system throughput is limited. Another way to improve BER performance of the system is to artificially reduce load factor of the system. In [11], Nguyen et al. proposed group detection (GD) algorithm and the efficient low complexity detectors called the ZF-GD and ZF-IGD. The algorithm is built by dividing a high-load massive MIMO system to two separate subsystems with smaller load factors. As a consequence, these detectors significantly outperform their classical ZF counterpart. Unfortunately, in full-load systems (), they underperform than the classical MMSE detector, thereby making them less attractive.

In this paper, we develop our idea called the parallel group detection (PGD) algorithm in [12] for high-load massive MIMO systems. The algorithm is built by dividing the equivalent extended form of high-load massive MIMO system into two smaller load subsystems, which can be detected in parallel by utilizing the classical detectors. Based on the PGD, the classical ZF, and the QRD detectors, we are able to create two new detectors, namely, ZF-PGD and QRD-PGD. In order to improve the accuracy of the signal detection process, the shortest longest basis (SLB) algorithm [13], an efficient lattice reduction technique, is further combined with the PGD to generate two other new detectors, called ZF-PGD-SLB and QRD-PGD-SLB. The empirical simulations and complexity evaluations show that the proposed detectors can significantly improve the BER performance of massive MIMO systems compared to those of the classical linear detectors and the QRD while their complexities are at the practical levels. Consequently, they are the good candidates for signal recovery in high-load massive MIMO systems.

: denotes set of complex numbers; is the slicing operation, which slices the received signals to the nearest values in the set of integer numbers corresponding to the QAM constellation; is a identity matrix, and is a one vector, whose elements are all 1; and are the transpose and Hermitian transpose operations; and denote the expectation operation and Kronecker product, respectively; and is the pseudoinverse of matrix and is the round operation.

Let us consider an uplink scenario of a single-cell massive MIMO system, as depicted in Figure 1. In the system, the BS with antennas, located at the cell’s origin, communicates with K multiple-antenna users using the same frequency and time resources. Each user has antennas. In order to obtain high spectral efficiency, each user is assumed to use spatial division multiplexing (SDM) scheme. Using this scheme, the serial data stream of each user is first converted into substreams, and then they are transmitted simultaneously by this user’s antennas to the BS. Under the above assumptions, the overall transmit signal vector from K users, , can be expressed aswhere is the transmit signal vector of the ith user satisfying and is the average energy of -QAM signals.

The received vector at the BS, can be modeled as follows:where denotes the transmit power of each user; and are, respectively, small-scale fading channel matrix and noise vector, whose entries are independent identically distributed (i.i.d) random variables with zero mean and unit variance; and is a diagonal matrix including the large-scale fading coefficients. It is reasonable to assume that the large-scale fading coefficients are the same for one user and different from those of the others because the distances between the antenna elements are very much smaller than those from the users to the BS. The large-scale fading coefficient corresponding to ith user is given by [14]where is the ith diagonal element of ; is a random variable, representing the shadowing, with zero mean and variance ; and , respectively, denote the distance from the ith user to the BS and the reference distance; and γ is the path loss factor.

For the sake of simplicity, let us define . Then, the system model in (2) can be rewritten asor equivalently aswhere , , and , respectively, represent the received signal vector, the channel matrix, and the noise vector of the extended massive MIMO system [15]. Normally, the receiver needs to get the channel state information (CSI), i.e., to estimate the channel matrix , through the adoption of some estimator, e.g., the MMSE estimator. In this paper, it is assumed that the CSI is perfectly known at the BS for the sake of simplicity.

3. Linear Detection and Its Drawbacks

Linear detectors, such as the ZF or the MMSE, have low complexity. Therefore, it is suitable for signal detection in massive MIMO systems. When the load factor of the system is sufficiently small (i.e., ), the BER performances of the linear detectors become near optimal [1]. However, as the load factor approaches unit, performance of the linear detectors degrades noticeably. By applying the ZF detector to the original system in (4), all N transmitted symbols from K users can be simultaneously detected as follows:

The weighted vector is then sliced to obtain the final output of the ZF detector as

It is easy to observe from (6) that there is an error vector in estimation given by . Therefore, the error covariance matrix is determined as follows:

The average mean square error (MSE) occurred when recovering one symbol can be determined by

Shown in Figure 2 is the empirical cumulative distribution function (ECDF) of the realized with iterations when antennas, or equivalently, the load factor ). The large-scale fading coefficients are determined by setting the SNR dB (where is the noise power),  m,  m,  dB, and . The results in Figure 2 clearly show that the MSE significantly reduces as the load factor gets smaller. In the worse case, when or , the MSE becomes very large. This indicates that the BER performance of the system will severely degrade. The reasons of this phenomenon can be two-fold: (1) the ZF detector suffers from the noise amplification effect and (2) the diversity order provided by the ZF detector reduces from (for ) to 1.

It is remarkable that when the ZF detection procedures are utilized on extended system in (5), the resultant detector is called the MMSE detector. Therefore, MMSE detection and its MSE can be extended from the ZF ones by substituting channel matrix, , by its extended form, .

4. Proposed Detectors with Parallel Group Detection

4.1. Parallel Group Detection Approach

In order to address the problem of performance degradation in very high-load massive MIMO systems, we propose the so-called parallel group detection (PGD) algorithm. In the proposed approach, a massive MIMO system is divided into several parallel subsystems with lower load, which are then detected separately. As a result, the overall system performance can be noticeably improved. In general, the number of parallel branches can be from 2 to N. Nevertheless, the more the number of subsystems generated, the higher the detection complexity required. Therefore, in this paper, a system with two branches, i.e., two subsystems, is considered to keep the complexity at a reasonable level.

Figure 3 shows the detection scheme using the proposed PGD algorithm when number of parallel branches equals 2. The transmit signal vector is estimated by the PGD algorithm through the following steps:Step 1: convert the massive MIMO system into its equivalently extended form, as in (5)Step 2: generate 2 subsystems in parallelStep 3: estimate the transmitted subvectors (i.e., and ) by applying some classical detector on each subsystemStep 4: rearrange the estimated subvectors to get the estimated signal vector as

In this section, Step 2 is described in detail. Steps 3 and 4 are presented in the following sections.

First of all, we rewrite the extended massive MIMO system in (5) aswhere and are the subchannel matrices obtained by, respectively, taking first L and the remaining columns of the extended channel matrix . Similarly, and denote the transmitted subvectors generated by taking the first L and the remaining elements of the transmitted signal vector . The value of L should be selected in such a way that the PGD-based detectors meet the following requirements: (1) the BER performances of users belonging to different groups are the nearly same and (2) the total complexities of proposed detectors get as low as possible. In [11], the authors pointed out that the group detection algorithm (GD) is able to achieve the best BER performance at the lowest possible computational cost when the sizes of both subsystems are identical. Following the method in [11], we can find out that is the best value of L that satisfies both of the above requirements. Therefore, throughout the paper, we select as the optimal value.

In order to generate the first subsystem, the interference term in (10) needs to be eliminated. Let be the projection term. Then, multiplying both sides of (10) by , we get

Now, define , , as received vector, channel matrix, and noise vector of the first subsystem; then, (11) is further rewritten as

Similarly, the second subsystem is generated in parallel with the first one as follows:where , and .

It is noteworthy that the load factor given by each subsystem is , which is a half of that of the original system. Therefore, the BER performance of the system is expected to improve as the PGD algorithm is adopted. The PGD algorithm is summarized as in Algorithm 1.

Input: , , K,
Output:
(1)Convert the system to equivalent extended form with ; .
(2)Set and define .
(3)Compute and .
(4)Compute received vectors and channel matrices of the two parallel subsystems as , , , .
(5)Estimate both transmitted subvectors , by applying a suitable subdetector to the kth subsystem having , .
(6)Rearrange the recovered subvectors to get the overall estimated vector as .
4.2. Proposed ZF-PGD and QRD-PGD Detectors

Theoretically, any classical detectors can be used as subdetectors in the PGD algorithm to estimate and . However, low-complexity detectors should be adopted to keep the complexity at practical levels. In this section, we adopt the classical QRD and the ZF in the PGD to generate two new detectors, called the ZF-PGD and QRD-PGD, as described below.

4.2.1. ZF-PGD Detector

In the ZF-PGD algorithm, are estimated by the ZF detection procedure to both subsystems in the PGD as follows:

Similarly to the ZF detector, the error covariance matrices and MSEs for each branch in the ZF-PGD can be determined, respectively, as follows:

Shown in Figure 4 is the ECDF of the classical linear detectors and the ones based on PGD algorithm when the parameters are set almost the same as in Figure 2 except antennas. The illustration shows that the MSEs of the two subsystems detected by the ZF-PGD are equal to each other and much smaller than that of the classical ZF detector. They are even comparable with the MSE of the MMSE detector. This implies that the ZF-PGD can improve the system performance significantly compared to the ZF one.

4.2.2. QRD-PGD Detector

In order to further improve the BER performance of the PGD algorithm, the conventional QRD detector is applied to the subsystems to generate the QRD-PGD detector as follows.

First, using the QR decomposition to decompose subchannel matrices, , we getwhere is unitary matrices satisfying that and be upper triangle matrices, which can be presented in the matrix form asfollows:where denotes the ith row and jth column entry of . Next, multiplying both sides of (17) by , we obtain

Finally, ignoring the noise term in (19), all entries of (denoted by , ) are determined layer by layer using the following rule:where is ith entry of and is the sliced value of .

Note that the BER performance given by the QRD-PGD is usually better than that of its ZF-PGD counterpart due to the advantage of the successive interference cancelation (SIC) technique. However, it suffers from the larger delay of the SIC technique. The higher the dimensions of the system are, the higher delay the QRD-PGD suffers. Therefore, the QRD-PGD detector is suitable to be used in small- or medium-size systems.

5. Proposed ZF-PGD-SLB and QRD-PGD-SLB Detectors

5.1. Lattice Reduction-Aided PGD Detection Approach

In this section, a new detection approach, called PGD-LR, is considered by combining the lattice reduction (LR) techniques with the PGD to further enhance the BER performance of massive MIMO systems.

First of all, the subsystems, , are converted into the ones in LR domain as follows:where , is the channel matrix in the LR domain and is a unimodular matrix (i.e., ), whose entries are integer numbers. and are determined by applying LR techniques on the channel matrix .

Next, we apply some classical detectors to (21) to estimate the transmitted vector in LR domain, . It is worth noting that the entries of must be selected from a set of consecutive integer numbers. Therefore, if the signal is modulated by -QAM method, it needs to be converted to the set of consecutive integer number by the shift-and-scaling operation. Let us define , where . Then, the shift-and-scaling transmitted signal vector is , thereby leading to the shift-and-scaling signal vector in the LR as follows:

Therefore, the hard decision of is given by

Once is determined, the estimated signal subvector, , is obtained as follows:

Finally, is sliced to get the final output of subsystem as .

5.2. Proposed ZF-PGD-SLB and QRD-PGD-SLB Detectors

In practice, and can be carried out by utilizing LR techniques such as Lenstra–Lenstra–Lovasz (LLL) algorithm [16], Seysen algorithm (SA) [17], or element-based lattice reduction (ELR) [13] on the subchannel matrix, . Among these LR techniques, ELR [13] is the most suitable to be used in massive MIMO systems because of its high performance and low complexity. The ELR finds out and matrices from by minimizing the diagonal entries of error covariance matrix in (15). This technique includes two versions called shortest longest vector (SLV) algorithm and shortest longest basis (SLB). While the SLB minimizes all diagonal entries of , the SLV only investigates the maximum one. As a sequence, BER performance given by the SLB is better than that of the SLV, yet at the cost of higher complexity. In this work, we use the SLB as the LR technique and combine it with the proposed ZF-PGD and QRD-PGD to create the ZF-PGD-SLB and QRD-PGD-SLB ones. Due to space limitation, we just summarize the SLB adopted on channel matrix in Algorithm 2. For more details of this technique, the readers can find them in [13].

5.2.1. ZF-PGD-SLB Detector

In the ZF-PGD-SLB detection algorithm, the transmitted symbols in LR domain are determined by utilizing the ZF detection procedure as follows:

Once is obtained, the estimated subvector, can be easily determined using (23) and (24).

Similar to the ZF detector, the error covariance matrix and the MSE for the kth subsystems in the LR domain are, respectively, given by

Note that the error in estimating depends on the accuracy of in (25). Hence, (27) can be used as the MSE for estimating It can be seen from Figure 4 that the MSEs of the subsystems detected by the ZF-PGD-SLB are almost identical and much smaller than those of the ZF and the ZF-PGD counterparts. This indicates that the ZF-PGD-SLB detector is able to noticeably outperform the ZF and the ZF-PGD ones.

5.2.2. QRD-PGD-SLB Detector

In the QRD-PGD-SLB detection algorithm, the transmitted symbols in LR domain is determined by adopting the SIC detection technique as follows.

First, use QR decomposition to decompose aswhere and are, respectively, the unitary matrix and upper triangle one.

Next, multiplying both sides of (28) by , we get

Let be the entry at the ith row, jth column of and and , are respectively, the ith entries of and . Applying SIC detection technique, the last entry of (i.e., ) is determined aswhere and is Lth row of . Once is determined, the th entry of will be estimated by canceling the interference of out of . In general, The ith entry of is determined asand its hard-decision value is given by

This process is repeated until all entries of are obtained.

6. Complexity Analysis

In this section, we evaluate the complexity of the classical linear detectors, the QRD, the BLAST, and all the proposed ones. In order to do so, we follow the method in [11, 18, 19] to count the number of necessary floating point operations (flops) to estimate a transmitted signal vector. It is assumed that each real arithmetic operation such as a real addition, a real subtraction, a real multiplication, a real division, or a square root of a real number is counted as a flop. It is worth noting that a multiplication of two matrices with dimensions of and needs a total of multiplications and additions. In addition, the inversion of matrix requires multiplications and additions [20]. Furthermore, we would like to note that the complexity of a detector can be computed using either the complex or the equivalent real system. However, as shown in [21], the real system requires more flops than the complex one.

Under the above assumptions, the complexity of the classical ZF, the MMSE, the QRD, and the BLAST detectors in terms of flops is computed directly on the complex system as follows:

Because the proposed detectors are built based on the extended form of the massive MIMO system with the channel matrix including both real and complex values, to evaluate the complexity more exactly, we represent as follows:where both and are complex matrices; and are diagonal real matrices; and and denote zeros matrices, .

6.1. Complexity Evaluation of the ZF-PGD and QRD-PGD Detectors

It can be seen that the computational costs of the ZF-PGD and the QRD-PGD detectors comprised of two parts: (1) generating the two subsystems and (2) utilizing detection procedures to detect and . Because the subsystems in the PGD algorithm are created exactly the same way and their dimensions are equal to each other, the complexity of the ZF-PGD and QRD-PGD detectors can be computed by using the following equation:where is the total complexity of the ZF-PGD/QRD-PGD, denotes the computational cost of generating a subsystem, and is number of flops given by applying the classical D detector (D is either the ZF or the QRD) to each subsystem. From the PGD algorithm, is evaluated aswhere , , and are, respectively, the numbers of flops to compute the projection term , the sub-channel matrix , and the received vector ; .

Because the dimensions of the subsystems are , can be obtained based on (33) and (35) by, respectively, replacing and N with and as follows:

6.2. Complexity Evaluation of the ZF-PGD-SLB and the QRD-PGD-SLB Detectors

From the signal detection procedures of the ZF-PGD-SLB and the QRD-PGD-SLB, we can easily observe that the complexity of these detectors are different from those of their ZF-PGD and the QRD-PGD counterparts by three aspects: (1) converting the subsystems to the ones in the LR domain; (2) utilizing hard decision to estimate transmitted symbols as in (23); and (3) converting the estimated vector in the LR domain to in the transmit constellation. Besides, the dimensions of subsystems in the PGD and the ones in the LR domain are exactly the same. Therefore, the computational complexity of the ZF-PGD-SLB and the QRD-PGD-SLB detectors can be determined based on the following equation:where denotes the complexity of either the ZF-PGD-SLB or the QRD-PGD-SLB detector, is the number of flops to covert one subsystem to the one in the LR domain (e.g., convert to ), and are, respectively, the total complexity of hard decision operation in (23) and of converting to in (24).

Following Algorithm 2 step by step, can be evaluated to bewhere , and are, respectively, the average number of times needed to compute and and to update and so that and are generated successfully.

Input:
Output: ,
(1)Set ; ; and compute .
(2)Do
(3)Find j index such that is maximum reducible value of .
(4)
(5)Compute and chose
(6)If go to (10)
(7); % is jth column of
(8); % is jth column of
(9); % is jth row of
(10)While (true)
(11);

From (23) and (24), the remaining two terms in (42) can be evaluated to be

Note that the complexity of in (23) is already included in the SLB algorithm.

The computational complexities of all the detectors are summarized in Table 1. The results in Table 1 show that all proposed detectors have the same complexity order as those of linear detectors (i.e., ).

The computational costs of all the detectors with different antenna’s configurations are further illustrated in Figures 5 and 6.

It can be observed from Figures 5 and 6 that the complexities of the detectors increase proportionally with the number of antennas. Although the computational costs of the proposed detectors are higher than those of the classical MMSE and the ZF-SLB, they are very much smaller than those of the BLAST one in high-load systems. It is worth emphasizing that the higher complexities of our proposed detectors than those of the MMSE/ZF-SLB are rewarded by huge SNR gains as illustrated in the following section.

It can also be seen from Figure 6 that when the load factor reduces from to (or equivalently, the number of data streams reduces from 96 to 32), the gaps between the complexities of the detectors under consideration get much smaller. This implies that the computational costs of proposed detectors and the BLAST one become comparable in very low load systems. Therefore, our proposed detectors are only suitable to use in the systems with sufficiently high load factors.

7. Simulation Results

In this section, the BER performances of all the proposed detectors as well as the classical linear detectors, the QRD, and the MMSE-BLAST (in the paper, the detector is referred to as the BLAST for short) ones are investigated. The proposed detectors are also compared with the ZF-SLB one proposed in [13] regarding the BER performance. In the simulation, the channel is assumed to be block fading, which is unchanged within the signal block and changed from one block to another. The main simulation parameters and assumptions are set up as follows: the cell radius, , and reference distance, , are 1000 meters and 100 meters, respectively; all active users are uniformly distributed in the cell such that the distance from the ith user to the BS satisfying ; the large-scale fading coefficients are generated according to (3), where dB and . All BER curves are drawn versus SNR defined by in dB.

Figures 7 and 8 illustrate the BER curves of the aforementioned detectors for 2 different system configurations: (1) and and (2) and antennas. The results in both figures show that all the proposed detectors significantly outperform their classical ZF and the QRD counterparts. In addition, the ZF-PGD-SLB and the QRD-PGD-SLB detectors provide much better BER performance than the ZF-SLB one. More specifically, BER performance of the ZF-PGD is much better than that of the classical ZF and even comparable to the MMSE detector’s performance (it is well understood that the MMSE detector can be implemented by applying the ZF detector to the extended system defined in equation (5); therefore, the ZF-PGD is able to approach the performance of the MMSE detector, and due to the effects of the projection terms and , when generating the two subsystems, the ZF-PGD slightly underperforms than its MMSE counterpart). This result confirms our analysis in Section 3. The illustrations in both these figures show that there are big gaps between the BER curve of the MMSE and those of the remaining proposed detectors. For example, for the first antenna configuration, i.e., , the gaps between the MMSE curve and the ones of the QRD-PGD, the ZF-PGD-SLB, and the QRD-PGD-SLB are about 7.4 dB, 11.9 dB, and 13.4 dB at BER, respectively. These gaps reduce to 6.52 dB, 9.02 dB, and 10.82 dB for and . These results imply that in full-load scenarios, as the system size increases, the BER performance gaps between our proposed detectors and those of the MMSE get smaller. This is probably due to the fact that the columns of the channel matrix becomes more orthogonal in pairs when the system size increases, thereby reducing the effect of cochannel interference at the BS.

It is worth noting that the BER performance of the ZF detector is worse than that of the MMSE even if the SNR is very high. The reason is that there always exists a significant gap between the output SNRs of these two detectors when the SNR reaches to infinity [6]. The results in Figures 7 and 8 also show that the BLAST detector gives the best BER performance among all the detectors under consideration. However, its excessively high complexity prevents it from being practically adopted in massive MIMO systems.

Shown in Figure 9 are the BER curves of the detectors under consideration when the load factors of the system are and . The system parameters are set as follows: antennas, , and antennas, leading to the equivalent dimensions as and . It can be seen from Figure 9 that when the load factor of the system reduces, the BER performances of all considered detectors improve significantly. Besides, the SNR gains between our proposed detectors and the classical ones get much smaller when the load factor reduces. Specifically, when , the ZF-PGD, the ZF, and MMSE detectors have nearly the same BER performance, but they underperform than the ZF-SLB, the proposed ZF-PGD-SLB, and QRD-PGD-SLB detectors by about 2 dB, 2.2 dB, and 3.2 dB at BER = , respectively. When the load factor reduces to , performances of all considered detectors are just slightly different from each others. This implies that when the system’s load factor is sufficiently small, the proposed detectors are no longer more advantageous than the classical linear detectors with respect to the BER performance. If the detection complexity is taken into account (as illustrated in Figure 6), then the classical ZF and MMSE detectors are the most efficient for signal recovery in massive MIMO systems with low load factors.

8. Conclusion

In this paper, the parallel group detection algorithm has been proposed to enhance BER performance of high-load massive MIMO systems. By combining the PGD, the classical ZF, and the QRD detectors, we are able to construct the ZF-PGD and QRD-PGD detectors. Moreover, the PGD is further combined with shortest longest basis (SLB) algorithm to generate two other new detectors, called ZF-PGD-SLB and QRD-PGD-SLB. Various simulation and analytical results show that the proposed detectors remarkably outperform their classical linear detectors and the QRD counterparts, yet at the cost of higher detection complexities. Fortunately, the complexities of the proposed detectors are very much smaller than those of the BLAST in high-load scenario. Consequently, our proposed detectors are suitable for signal detection in high-load massive MIMO systems.

Data Availability

All data included in this paper are the results of our research and are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.