Abstract

With the substantial growth in number of wireless devices, future communication demands overarching research to design high-throughput and efficient systems. We propose an intelligent Convergent Source Mapping (CSM) approach incorporating Differential Space-Time Spreading (DSTS) technique with Sphere Packing (SP) modulation. The crux of CSM process is assured convergence by attaining an infinitesimal Bit-Error Rate (BER). Data Partitioning (DP) H.264 video codec is deployed to gauge the performance of our intelligent and efficient system. For the purpose of efficient and higher data rates, we have incorporated compression efficient source encoding along with error resiliency and transmission robustness features. The proposed system follows the concept of iterations between the Soft-Bit Source-Decoder (SBSD) and Recursive Systematic Convolutional (RSC) decoder. Simulations of the DSTS-SP-assisted CSM system are presented for the correlated narrowband Rayleigh channel, using different CSM rates but constant overall bit-rate budget. The SP-assisted DSTS systems are mainly useful in decoding algorithms that operate without requiring Channel State Information (CSI). The effects of incorporating redundancy via different CSM schemes on the attainable performance and convergence of the proposed system are investigated using EXtrinsic Information Transfer (EXIT) charts. The effectiveness of the proposed system is demonstrated through IT++ based proof-of-concept simulations. The Peak Signal-to-Noise Ratio (PSNR) analysis shows that using Rate-2/6 CSM with minimum Hamming distance () of 4 offers about 5 dB gain, compared to an identical overall system code rate but with Rate-2/3 CSM and of 2. Furthermore, for a consistent value of and overall rate, the Rate-2/3 CSM scheme beats the Rate-5/6 CSM by about 2 dB at the PSNR degradation point of 2 dB. Moreover, the proposed system with Rate-2/3 CSM scheme furnishes an gain of 20 dB when compared with the uniform-rate benchmarker. Clearly, we can say that higher and lower CSM values are favourable for our proposed setup.

1. Introduction

Evolution in Internet of Things (IoT) and recent developments in ultrafast cellular technologies resulted in a constriction of available bandwidth [1, 2]. It is estimated that escalation in the demand of higher data rates and bandwidth efficient systems will continue to grow with the advent of Fifth-Generation (5G) wireless technology. 5G is expected to revolutionize the outlook of mankind in medical [3] and intelligent communication technologies [4]. Thus, further researches need to be carried out to better cope with the future demands of wireless and multimedia services.

Traditionally, source coding standards and channel coding schemes have been incorporated in multimedia systems for the transmission of voice and video streams. Source coding is an important approach to compress the original stream of data, focusing on the removal of redundancy from the stream for transmission over band-limited channels. The unwanted concomitant is that removal of redundancy results in the hostile happening of unreliability in the transmitted data. The main reason for such unreliability is that transmission over a nonideal (practical) channel triggers distortions in the transmitted signal. Very few or no-redundant bits schemes neither detect nor correct any potential error arising from unreliable transmissions. Therefore, multimedia streams present an exacting research topic for wireless channels [5]. To address the issue of unreliability, channel coding is utilized. Channel coding focuses on the addition of redundant bit(s) to make the transmission scheme reliable. Consequently, the concept of Joint Source and Channel Decoding (JSCD), making the most of residual redundancy phenomenon, has remarkably attracted wide attention [6].

The approach of Soft-Bit Source-Decoder (SBSD) was proposed [7] and effectively deployed in Iterative Source and Channel Decoding (ISCD) [8] for better convergence. More specifically, SBSD operates by gleaning the residual redundancy in the source coded stream for yielding extrinsic information, thus transforming the stream into short frames. The authors in [9] proposed two techniques, namely, I-frame and P-frame error concealment methods, for robust video transmission over noisy channels. Several other methods are presented in [5] to overcome the exigent tasks of susceptible transmission and channel errors due to the limited residual redundancy and predictive coding. Advanced Video Coding (AVC) or H.264 video codec is currently the most popular compression standard among the researchers for mobile video systems. The strategy of Data Partitioning (DP) [10] is advantageously subsumed in the H.264 standard, reducing the unfriendly channel effects corrupting the data stream. DP splits the stream in line with the level of significance of data into three substreams, each having specific level of importance. Another approach known as Irregular Variable Length Coding (IrVLC) was efficiently utilized in schemes with joint source and channel coding [11]. In short, researchers continuously strive and have put persistent efforts to hone the overall wireless communication infrastructure by making it incrementally closer to Shannon’s capacity [12] pinnacle.

Multiple-Input Multiple-Output (MIMO) system technologies are growingly becoming popular in bandwidth efficient systems supporting ultrafast rates. Such technologies will be of great significance in the future wireless systems such as 5G and 6G [13]. One of the prestigious classes of MIMO systems is Space-Time Block Codes (STBCs), offering fairly acceptable performance with a much easier approach of encoding and spreading [14, 15]. Space-Time Spreading (STS) technique was presented by Hochwald et al. [16] attaining the best possible transmit diversity gain. The authors in [17] developed an orthogonal transmit diversity technique using Sphere Packing (SP) modulation and demonstrated that the SP-assisted STBC exceeds the conventional STBCs in performance. The STBC and STS approaches rely mainly on the technique of coherent detection, which in turn requires Channel State Information (CSI) at the receiver [18]. Besides this, CSI burdens the overall system which dissipates power due to the enormous transmission of training symbols. An alternate to CSI based systems, known as Differential Space-Time Block Coding (DSTBC), was proposed by Tarokh et al. for the cases of two and more transmit antennas [19, 20]. DSTBC greatly reduced the complexity but the price paid was a slight loss in performance. SP-assisted DSTS approach was deployed to enhance the performance of adaptive multirate wideband speech coding [21], Irregular Variable Length Coding with iterative detection [22, 23], cooperative communication [24], self-concatenated coding [25], and turbo detection [18] systems. The performance analysis of convergent and nonconvergent two-stage DSTS-SP schemes based on video resolution and motion contents was done in [26]. The three-stage performance investigation of compressed video using SP modulation was done for the combinational gain technique of layered steered space-time coding in [27], whereas that for the noncoherent DSTS scheme was presented in [28].

In this letter, we apply the technique of DSTS-SP to CSM and investigate its effects on the Bit-Error Rate (BER) performance and EXtrinsic Information Transfer (EXIT) curves. The main contributions of our research work are as follows:(i)An instructive preamble to the concept of CSM and its linkage with RSC, devised for attaining assured convergence.(ii)Incorporating the technique of DSTS-SP in CSM scheme and investigating its effects using two Transmitters (Txs) and a simple Receiver (Rx) with no CSI requirements(iii)Analysing the effects of alterations in CSM rates and Hamming distance () on the EXIT convergence and, hence, attainable performance of the proposed system(iv)Quantifying the video performance of the presented system with the H.264 standard

The remainder of this article is organized as follows. Section 2 gives details about the parametric parlance used in this discourse. An overview of the DSTS-SP based CSM approach is provided in Section 3. The proposed system with the parameter settings is further described and made plain in Section 4. Simulated results are discussed in Section 5. We succinctly conclude the article in Section 6.

2. Parametric Terminologies

Logarithmic-Likelihood Ratio (LLR) or L-value is an important parameter based on the Soft Decision (SD) output and is mainly used to estimate the reliability of received data. In an SD process, the messages are represented in conditional probabilities of their occurrence, such that the received bit is marked either 1 or 0. The LLR is defined as in the following equation [29]:

Here, is the LLR or soft value of a random variable X and represents the probability of X for its two legitimate values.

One of the key solutions for future ultrafast wireless communication is the MIMO technology. MIMO concept mainly asks for utilizing multiple antennas at base stations to serve a vast range of users connected via single receiving antenna or multiple receiving antennas installed in their devices. There arise some issues like hardware complexity, channel estimation, and correlation of the multiple antennas [30], when considering MIMO schemes. DSTS technique offers a good solution to the issues in MIMO systems and advocates the employment of MIMO schemes for attaining rich diversity gains. Spreading codes are utilized in the systems having MIMO. An STS technique with two transmitters is considered in the proposed system of this treatise. STS algorithm basically splits the data stream into multiple substreams (equal to the number of transmitters) and each substream is transmitted by an assigned antenna. Walsh codes are widely exploited as spreading codes in wireless MIMO schemes, specifically in synchronized multiuser systems. Walsh codes are based on the mutually orthogonal Hadamard codes, providing frequency diversity and very good BER performance. Hadamard matrix is necessarily a square matrix generating Walsh code by the continuous conversion between ON and OFF (1 and −1) states at a specified regular transition interval [31]. Details about the operation and generation of different length Walsh codes using Hadamard matrix are discussed in [32].

SP modulation is scaling up in the construction of error-correction codes. Su et al. introduced the merger of transmit diversity techniques with SP modulation, evincing that the SP aided STBC surpassed the conventional STBCs in performance [18]. The viable profit of orthogonal transmit diversity is mostly evaluated using the minimum Euclidean distance [18]. SP modulation results in the best possible minimum Euclidean distance for the symbols, boosting the system’s error resilience property. Regarding Euclidean distance, it is simply the length of straight-line segment connecting two coordinates. Hamming distance terminology, proposed by Hamming [33], is frequently used in the realm of error-correcting codes. Hamming distance quantifies and corrects the emerging error due to noisy channel when data bits are transmitted to the destination. More specifically, Hamming distance shows the difference in two strings with same dimension, that is, the number of varying places in two strings (bitwise exclusive OR, commonly known as XOR) [34]. For example, 10110100 and 10000111 differ in 4 bit places and hence a Hamming distance of 4 exists between them. Similarly, “ROSE” and “NOSE” are single character apart, so a Hamming distance of 1 applies here.

EXIT chart analysis, proposed by Stephen ten Brink [35], promptly predicts the convergence behaviour of iteratively decoded systems. EXIT curves are based on mutual information exchange between the constituent decoders of iterative systems [36]. EXIT charts serve as a handy replacement to the cumbersome Monte Carlo based simulation, offering accurate results for attaining infinitesimal BER [37]. Some common terminologies associated with an EXIT analysis are a priori Information (), a posteriori LLR, and Extrinsic Information (). As given in [37], is the already known intrinsic knowledge of any bit and its value is independent of the iterative decoding operation. When any decoder accepts input in the form of LLRs from channel and , the decoder’s output is termed as a posteriori LLR. Performing a mathematical operation of subtracting from the first output of concerned decoder yields . It is worth mentioning that can be manipulated with the aid of interleavers and deinterleavers, converting it to for other input instances to decoder. EXIT chart analysis asks for two factors in order to generate accurate results [37, 38]. Firstly, the a priori LLRs should be greatly uncorrelated by utilizing higher interleaving bits and, secondly, the probability density function (PDF) of such LLRs must be Gaussian distribution. The following relations as given in [37] present a clear view of the concepts and terminologies linked with an EXIT analysis.

According to equations (2) and (3), the values of and range from 0 to 1. to conversion requires a transfer function T operating at specific , as given in equation (4). The inverse of T exists on the range specified in equation (5).

3. Proposed System Overview

Figure 1 depicts the block diagram of our proposed system based on the CSM scheme with DSTS-SP transmission mechanism and utilizing iterative JSCD approach at the receiver. The information bits are first allowed to the H.264 source encoder, transforming the data into squeezed form apt for transmission with minimal essential bandwidth. The compressed stream is DeMUltipleXed (DeMUX) to generate three different partitions. The technique of Data Partitioning (DP) is incorporated to assign specific important parameters and coding elements to partitions A, B, and C. The partitions are concatenated into single stream . There exists a certain level of unreliability in , mainly due to the concomitants of compression. To vouch for the desired reliability, CSM encoding is introduced as channel coding technique. CSM encoder transforms the video stream into . The bit-interleaver П assists in boosting the achievable performance of iterative scheme. There is a direct linkage between the degree of statistical independence associated with an interleaver and the number of interleaving bits [39]. The only hitch in longer interleaver is the delay it entails. To cope up with the delay, we concatenated the generated bits such that 99 Macroblocks (MBs) were within each frame. The interleaved stream is then RSC encoded with a specific rate to yield , which is SP modulated for propagation over Rayleigh fading channel. Finally, DSTS is applied to revamp the overall gain of system by embracing transmit diversity via two transmitting antennas (Txs).

The decoding operation initiates with single receiving antenna (Rx1). Multiple receiving antennas can be utilized but we proceed with single antenna, merely for easier understanding of the concept. The received signal is allowed to a suboptimum DSTS decoder which continuously accepts interdependent signals from the Txs. The DSTS decoding presents a simple and handy approach for systems benefitting from the MIMO technique. SP demapper brings back the signal to its original frequency range and yields the L-values. The next stage constitutes an ISCD between the RSC and SBSD component decoders. The output from RSC decoder is interleaved to serve as a priori input to SBSD. The extrinsic output from SBSD is again deinterleaved and fed to RSC decoder. The iterative process refines and gleans the best possible information exchange between component decoders. The iterative process yields , which is passed through CSM decoder to generate the reconstructed stream . The stream is then deconcatenated for forwarding to MUltipleXer (MUX). Eventually, the reconstructed signal is processed by the source decoder to produce the decoded bits.

4. Description of the Proposed System

For the parametric setting stated in Table 1, the proposed system was simulated using the IT++ signal processing and communication library. The Quarter Common Intermediate Format (QCIF) Akiyo is considered as a test video sequence with 15 frames per second (fps). To even out the effect of error propagation, the encoded video clip complies with the intracoded “I” and predicted “P” frame sequence, placing 44 “P” frames after each “I” frame. A lag of 3 seconds exists between two consecutive “I” frames. Selection of channel model becomes crucial considering the reflection, diffraction, and scattering effects in wireless systems. Such effects result in the multipath phenomenon [40]. Multipath effect splits the transmitted signal into multiple versions based on the power and fading distributions. Rayleigh fading channel is deemed to be the best fitting in such scenario [41] and hence has been considered for our system. The Rayleigh probability density function is given by the following equation as in [41]:

Here, is variance of the circularly symmetric complex random variable , having real and imaginary parts. The term denotes statistical averaging.

DSTS is incorporated in the proposed system to achieve diversity gain. The schematic diagram of DSTS encoder is depicted in Figure 2. The SP modulated signal is differentially encoded till the refined uncorrelated output q is fed to STS encoder. Differentially encoded data is repeatedly looped back until reaching the set value of interleaving delay. The STS encoder governs the spreading of data with the aid of Walsh codes. The overall differentially spread data is divided into two substreams, each transmitted via a specific transmit power antenna.

The CSM process obeys the convergence condition of iterative systems, requiring a minimum Hamming distance () of at least 2 for valid codewords [42]. The artificial redundancy of CSM assists ISCD in attaining the highest source entropy, given by at the a priori input of . This leads to the design of CSM schemes known as Mapping-I and Mapping-II, acceding to the condition of .

4.1. Mapping-I

Mapping-I adds single parity in the K-bit information word, yielding -bit CSM-coded symbol and conveniently represented as scheme. The redundant bit can be placed anywhere out of the available slots, creating different combinations with fixed of 2 in all. To determine the binary state of generated redundant bit, Mapping-I proceeds by calculating the exclusive OR (XOR) function of the source coded K-bit word as in [6].

In equation (7), represents the redundant bit generated for the t-th M-ary source word and denotes the XOR operation between the corresponding input bits.

4.2. Mapping-II

For the sake of increasing , further N bits are appended in a reverse form to the output of Mapping-I. This reduces the code rate of Mapping-II scheme relative to Mapping-I as discussed in [6]. Hence, the output bits in Mapping-II approach will be bits, where bits as shown in Table 2. This can be conveniently written as scheme.

To further elaborate the two schemes, the following symbols as specified by Table 3 make use of a constant overall code rate. It is clearly seen that the developed schemes follow the necessary condition of all the time except in the case of Rate-1 mapping. For simple understanding, we designate Rate-1 as the case when no CSM is actually invoked. All the symbols generated using the Mapping-I scheme record of 2, whereas those from Mapping-II have a value of 4. It can be seen that only out of symbols of Mapping-I and possible symbols of Mapping-II satisfy the condition. The concatenated RSC encoder provides with the flexibility to cater for the persistent overall code rate. The considered RSC encoder operates by adjusting the puncturing rate to meet the constant bit-rate budget. Puncturing refers to interrupting some bits from transmission to enhance the bandwidth efficiency. The octal format of generator polynomial for the RSC encoder deployed with code memory of 4 is given by .

5. Results and Discussion

Our simulations are based on the QCIF Akiyo video test sequence [5] encoded with the H.264/AVC Joint test Model (JM) 19.0 standard. Additionally, the partitioning of each QCIF frame into 9 slices with each slice containing 11 MBs of 16 × 16-pixel resolution is considered. Generally, MBs are used in the iteratively decoded systems for enhancing efficiency and assisting the use of larger interleaver lengths [10]. The intraframe MB updates along with DP technique in each QCIF frame, enhancing the error resiliency of our proposed setup. Flexible MB Ordering (FMO) and interframe prediction using the bidirectional “B” frame were not invoked for the purpose of reduced video coding complexity. However, to better cope with the fluctuations, only the precursor frames were considered. Moreover, for Rate-1 mapping (no artificial redundancy), we limited the number of iterations between the constituent RSC and SBSD decoders to 5 with the intention to limit the computational complexity. However, the number of iterations is set to 10 for other schemes employing CSM with rate less than unity. For enhancing accuracy and confidence in our simulations, we use the average results after repeating the experiment 160 times. It is worthy to state that all the considered error protection schemes are simulated for a persistent overall code rate and video rate.

We first investigate the convergence pattern of different candidates and their possible constituent RSC and CSM rates for a persistent overall code rate of 1/4. Figure 3 illustrates the EXIT characteristics of three types of protection schemes simulated for our proposed iterative DSTS-SP system. It is observable that the incorporation of DSTS-SP approach to CSM technique is favourable for rate less than unity CSM schemes. Explicitly, for such schemes, the iterative SBSD becomes favourable for the EXIT curves to attain (1, 1) point of convergence. This allows reliable transmissions for these schemes as the convergence point is responsible for an infinitesimal BER. The system is unable to attain convergence for Rate-1 mapping (i.e., no CSM) as no artificial redundancy is introduced in the source coded stream. More explicitly, the outer EXIT curves associated with Mapping-I and Mapping-II schemes ultimately reach the top-right corner point following different paths as shown in Figure 3.

Further analysis on the corresponding convergent schemes employing Mapping-I and Mapping-II CSM schemes along with the associated RSCs as summarised in Table 3 is done at −3 and −4 and −6 and −6.5 dB, respectively. It can be clearly observed that the EXIT chart curves for Mapping-II CSM require lower for convergence compared to the Mapping-I CSM. The corresponding EXIT curves along with the decoding trajectories for different CSM rates are portrayed in Figures 411. The inner and outer decoders assist in an iterative fashion, sharing mutual information with each other to generate decoding trajectories based on the Monte Carlo simulation algorithm. It is clear from Figures 411 that the reduction of CSM rates positively affects the convergence pattern of both Mapping-I and Mapping-II schemes. Furthermore, the Mapping-II scheme portrays additional improvement compared to the Mapping-I scheme owing to the improved and greater number of redundant bits. The convergence of Mapping-I and Mapping-II schemes is in accordance with the Kliewer criterion as discussed in [42]. In Figures 411, the outer and inner EXIT curves intersect each other after the formation of an open tunnel between the two curves. This tunnel is referred to as the convergence tunnel [38], once the constituent curves meet at the (1, 1) point of perfect convergence. According to [43], the wider convergence tunnel results in better convergence performance of the schemes. Figures 411 confirm that the associated tunnel becomes more favourable in terms of convergence performance when lower CSM rates are invoked. We can see in Figures 8 and 10 that, for the Mapping-I scheme, the outer and inner EXIT curves intersect each other two times. More specifically, here the point of intersection prior to (1, 1) point of perfect convergence results in semiconvergent tunnel for Rate-4/5 CSM and Rate-5/6 CSM. This semiconvergent tunnel specifies a moderately lower BER achievement due to convergence rather than an infinitesimal BER at perfect convergence. The Monte Carlo based trajectories accordingly validate the pattern of convergence associated with each of the schemes in Figures 411. For capacity analysis, the higher number of iterations between the EXIT curves is associated with a more nearer performance to the corresponding channel’s capacity as reported in [38]. Comparing Mapping-I and Mapping-II schemes, it is deducible from Figures 411 that Mapping-II scheme requires a greater number of iterations. Hence, Mapping-II performance is more proximal to capacity than Mapping-I performance for the same channel. Similarly, for any of the Mapping-I or Mapping-II scheme, the number of iterations required for convergence is more when the outer CSM rate is least.

The BER performance of different CSM based error protection schemes of Table 3 is depicted in Figure 12. Furthermore, the comparison of these CSM based schemes with Rate-1 is presented in Figure 13. The Peak Signal-to-Noise Ratio (PSNR) performance metric of CSM based schemes is represented in Figure 14, while comparison with Rate-1 scheme is done in Figure 15. Both the BER and PSNR performance curves endorse our claim that the performance of CSM based schemes improves upon reducing the rate of CSM. Furthermore, it is clearly visible from Figures 1215 that the BER and PSNR performances improve with the increase in value for the advocated CSM schemes. More specifically, for Rate-1 scheme, the outer and inner EXIT curves do not converge and, therefore, an infinitesimally lower BER cannot be attained. For the PSNR curves shown in Figure 15, the best PSNR is provided by the Rate-2/6 CSM invoked with Rate-3/4 inner RSC. To be precise, up to 20 dB gain in is recorded with the CSM scheme in comparison to the benchmarker.

Lastly, the subjective video quality performance of Mapping-I and Mapping-II CSM schemes for the 45th Akiyo video sequence is given by Figure 16. The top frames from left to right are recorded at −2.5 dB for Rate-2/3, Rate-3/4, Rate-4/5, and Rate-5/6 Mapping-I CSM schemes, respectively. The bottom frames from left to right are recorded at −6 dB for Rate-2/6, Rate-3/8, Rate-4/10, and Rate-5/12 Mapping-II CSM schemes, respectively. The video test sequence was separately transmitted for each type of scheme presented and was accordingly decoded via the H.264 decoder. It is clear from Figure 16 that the subjective video quality also holds the claim of improved performance for lower CSM rates. Visibly, there are considerably more distortions in Mapping-I schemes even at higher in comparison to Mapping-II schemes. Additionally, employing Mapping-II CSM scheme offers even more superior video quality at considerably lower value of −6 dB relative to Mapping-I CSM.

6. Conclusion

The overall system developed offers significant performance improvements; hence, it will be beneficial in wireless communication technologies. The proposed system is an attempt to improve the PSNR and BER performances using the approach of DSTS-SP-assisted smart antennas which requires no CSI for the decoding process. The CSM technique was used to incorporate artificial redundancy in the SBSD-aided H.264/AVC standard. The novel CSM technique offers a beneficial way to flexibly change the value accordingly for an enhanced and reliable performance. Furthermore, CSM is divided into two schemes based on the generation and number of artificial redundant bits. The BER performance of DSTS-SP-assisted ISCD was revealed to significantly improve by incorporating rate <1 CSM and favourably dividing the bit-rate budget between source and channel codes. EXIT chart analysis also recommends our proposed system as it indicates fruitful convergence patterns. The simulation results clearly advocate for the proposed system. More specifically, the DP H.264 video codec with Rate-2/6 CSM and  = 4 evinces an gain of 5 dB relative to Rate-2/3 CSM and  = 2, at the PSNR degradation point of 2 dB. Additionally, the presented system with Rate-2/3 CSM scheme outperforms the benchmarker (no CSM) scheme having persistent overall rate by about 20 dB. In the future, we aspire to subsume the approach of DSTS in other modulation techniques and state-of-the-art multimedia applications.

Data Availability

The data used to support the findings of this study are embedded in the programming. They are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this study.

Acknowledgments

The financial support of National Centre in Big Data and Cloud Computing, University of Engineering and Technology Peshawar (NCBC-UETP), under the auspices of Higher Education Commission, Pakistan, is gratefully acknowledged. The authors would like to thank and acknowledge the VSB-Technical University of Ostrava and the Ministry of Education, Youth and Sports in the Czech Republic for funding the publication charges of this article (Project nos. SP2021/45 and SP2021/32).