Abstract

Compressive sensing (CS) offers compression of data below the Nyquist rate, making it an attractive solution in the field of medical imaging, and has been extensively used for ultrasound (US) compression and sparse recovery. In practice, CS offers a reduction in data sensing, transmission, and storage. Compressive sensing relies on the sparsity of data; i.e., data should be sparse in original or in some transformed domain. A look at the literature reveals that rich variety of algorithms have been suggested to recover data using compressive sensing from far fewer samples accurately, but with tradeoffs for efficiency. This paper reviews a number of significant CS algorithms used to recover US images from the undersampled data along with the discussion of CS in 3D US images. In this paper, sparse recovery algorithms applied to US are classified in five groups. Algorithms in each group are discussed and summarized based on their unique technique, compression ratio, sparsifying transform, 3D ultrasound, and deep learning. Research gaps and future directions are also discussed in the conclusion of this paper. This study is aimed to be beneficial for young researchers intending to work in the area of CS and its applications, specifically to US.

1. Introduction

US imaging uses sound waves to produce images of the inside of the body. US is the most commonly used medical imaging modality for clinical diagnostics due to its diverse applications including cardiology, ophthalmology, pulmonology, nephrology, gynecology, urology, angiology, and general abdominal imaging [14]. US imaging is used as a guide for surgeries, small pieces of tissue examination, pediatric review, blood pressure, and blood flow direction estimation [5]. US imaging is the most often used modality by the physicians after radiography [4], as it is noninvasive, inexpensive, highly portable, and easily manipulable in medical diagnostics [6, 7]. It has strong connectivity and the ability to diagnose on-site in real time. US uses a variety of techniques for diagnosing various body parts and organs like Doppler ultrasonography, contrast ultrasonography, elastography, interventional ultrasonography, and compression ultrasonography. Several modes of US imaging can be used in many ways for medical diagnostics as discussed in [811].

The term ultrasound is used for frequencies above 20 KHz. Commercial ultrasonography imager uses ultrasound waves in the frequency range of 2–20 MHz. The important parameters which portray a wave are wavelength, frequency, intensity, and velocity. The velocity of the ultrasound waves in a medium is directly proportional to the medium stiffness and inversely to its density. Therefore, ultrasounds travel with high velocity in bones compared to water, fat, and tissue. The velocity of the US in bone, water, and fat is 4080 m/s, 1489 m/s, and 1450 m/s, respectively. US scanner works on principle of echo imaging. To generate US waves, electric current is applied to piezoelectric crystals, and the frequency generated by the crystal is directly proportional to the amplitude of the current. US transducers have an array of these elements often varying in number from 68 to 256. Frequency selection for scanning depends on the region of interest and technique used for diagnosis [12].

For imaging, US transducer transmits the ultrasonic waves on region of interest to be scanned. Some of the US waves are absorbed by the subject body, while a large amount is reflected back towards the transducer. The reflected back waves are transformed to electrical signals by the transducer piezoelectric crystal and are further processed to form an image. Sampling of the reflected signals is done at Nyquist rate to avoid aliasing [13]. Current challenges in acquiring US images are high acquisition speed, high rate of sampling, analogue-to-digital converter, and faster processing units which are costly and consume more power. A basic working principle of US is shown in Figure 1.

2. Survey Methodology

To acquire scholarly work related to area of US image acquisition using CS, Google scholar, IEEE, Elsevier, Springer, and some other scientific search engines and research publishers were searched. The terms used for searches were compress sensing, compressive sensing, and compressed sensing along with the addition of ultrasound. More than ninety papers were acquired and studied for review, out of which twenty papers were selected for final draft. Papers obtained were from various conferences, letters, transactions, and journals from 2008 to 2018. Papers based on CS for Doppler US imaging, photoacoustic US imaging, or specific use of CS for US image noise removal, segmentation, classification, and dictionary learning are not in the scope of this work. Papers which used CS for US image acquisition and image reconstruction were selected for review. Five groups are made based on the objective of the paper; each group has four to five papers except group five which is based on deep neural networks and its application to US image reconstruction in our survey. Results of the papers are tabulated and discussed on the basis of evaluation metric of the algorithms. In the conclusion part of the paper, future directions on the basis of research gaps are also discussed.

3. Compressive Sensing

Compressive sensing (CS) is a method for recovering a compressed signal from far fewer samples than that needed in the Nyquist sampling model [14]. CS is dependent on three basic suppositions: the first one is the sparsity of original data or signal in a transform domain, the second is incoherence among sensing matrix (Ф) and transform matrix (Ψ), and the third is restricted isometric property (RIP). The number of samples needed to precisely reconstruct the compressed data is dependent on the specific reconstruction algorithm [14]. Almost all real world signals have the property of sparsity in one domain or another [15]. If the acquired data are not sparse in any transform domain, then the maximum number of data coefficients will be needed to reconstruct the data.

3.1. Sparsity

Most of the real world data/signals are sparse at least in some transform domain. For instance, image, seismic, and sound signals can be compressed and stored in terms of their projection using suitable model basis. When a model basis is selected suitably, the majority of the projection coefficients become very small or even zero and thus can be ignored. The definition of data sparsity can be mathematically interpreted as follows. Let a signal , represented in a set of orthogonal basis such that x is a member of and is shown in terms of N coefficients . The signal x can be specified as , where symbolizes the coefficients of N × 1 vector s and symbolizes the columns of matrix N × N. are the coefficient sequences of x and N are the total coefficients of x [16]. In common, is named transform domain, sparsifying dictionary, or sparse domain [17, 18]. Signal x is mathematically stated as follows:

The signal x in equation (1) is k-sparse in transform basis if there exist s ϵ  which has only K nonzero coefficients. Commonly known compressible model bases are 2D wavelets, discrete cosine transform (DCT), discrete Fourier transform (DFT) to make images sparse, localized sinusoid to make audio signals sparse, curvelets for wave propagation, and fractal-type waveforms for spiky reflective data [19]. The recovery process of original vector x can be completed using the compressed vector y that has a dimension of M × 1 such that M < N:

The term is an M × N matrix or group of vectors called measurement basis, measurement vectors, or sensing matrix with representing rows. The value of x in equation (2) can be replaced with , and we getwhere A in equation (4) is a matrix of M × N dimension and is helpful in transformation of vector x that is a k-sparse signal into y measurements of M × 1.

3.2. Incoherence

Coherence is the measure of correlation between the elements of two matrices. The matrices could be in dissimilar representation domain of different dimensions. For example, we have Ψ matrix of N × N dimension with Ψ1, Ψ2, …, ΨN as its column vectors and Φ as an M × N matrix having Φ1, Φ2, …, ΦM rows. In equation (5) μ represents coherence and is mathematically stated aswhere 1 ≤ o ≤ N and 1 ≤ p ≤ M. Thus, following the principle of Linear Algebra, we get

In framework of CS, our main objective is to get maximum incoherence between the matrix utilized for sensing, sampling, and compression of data (Φ) and the matrix used for representation of signal as sparse (Ψ) [19]. The more is near to 1, the more the incoherence between and basis is.

3.3. Restricted Isometry Property (RIP)

The idea of RIP was first presented in [20] and has been used to solve many theorems in CS [21]. RIP characterizes matrices which are almost orthonormal, at least when in use on sparse vectors. An important consideration of compressive sampling is that the information of signal must be preserved by measurement matrix which is guaranteed by checking the RIP of measurement matrix . RIP is defined on isometry constant of a matrix, which is the smallest number such that equation (7) holds for all k-sparse vectors x. Similarly, if is smaller than one, then all pairwise distances between k-sparse signals must be well kept in the measurement space, as shown in equation (7) for k-sparse vectors x1 and x2:

3.4. Compressive Sensing in Ultrasound

CS has the ability to reconstruct the images and signals from far fewer samples required in traditionally used Shannon Nyquist sampling theorem. The main concern of CS is that the data must be sparse in some model basis, e.g., Fourier basis, wavelet basis, and dictionary learned from data. US is a medical imaging modality which acquires image by the beam forming of the set of raw RF signals received at each transducer element. CS in ultrasound reduces the number of transducer elements and thus reduces acquisition time and power consumption.

4. Reconstruction Model

For CS reconstruction nonlinear algorithms are used, which requires information of sparsifying matrix, where data are sparse or compressible. In equation (4), is a k-sparse vector representing projection coefficients of x on , y is the measurement vector, and A is the MxN matrix , where s is the original ultrasound signal vector and x is the sparse representation of that vector. To make signal or vector s sparse is used, is the measurement matrix used to sample and compress the vector x, and y is the compressed signal called measurement vector obtained from the product of matrix A and ultrasound signal vector x. To solve and prove that solution of equation (8) is sparse reconstruction, L0, L1, and L2 norms are used over solution space [22]. Using least square solution which is the minimization of L2 norm, the solution can be presented as

Using L1 minimization matching pursuit (MP) or basis pursuit (BP) etc., the signal is accurately reconstructed from vector y having M number of compressed measurements by solving a simple convex optimization problem [23]. Reconstruction of signal using L1 minimization can be stated as

Using L2 minimization for nonsparse signals gives disappointing results [21]. As most of real world signals are not sparse, therefore, L2 minimization is an inappropriate choice for signal reconstruction. In most of the cases, L0 minimization guaranties perfect results although that involves more computation. Under certain conditions L1 norm gives same results as L0 [24, 25].

5. Performance Evaluation Parameters

For the evaluation of reconstructed images, various evaluation parameters have been used by researchers for the comparison of their results. The most common evaluation parameters used are signal-to-noise ratio (SNR), contrast-to-noise ratio (CNR), mean square error (MSE), root mean square error (RMSE), normalized root mean square error (NRMSE), peak signal-to-noise ratio (PSNR), structural similarity index (SSIM), and mean absolute error (MAE). Some of them are discussed below.

5.1. Mean Absolute Error (MAE)

The accuracy of signal or data after CS reconstruction is obtained by comparing original data or image with reconstructed data or image usingwhere N represents the total number coefficients of image data, is the intensity or amplitude of the ith original image pixel, and is the intensity or amplitude of the ith processed image pixel.

5.2. Signal-to-Noise Ratio (SNR)

SNR is used to evaluate the image quality with the definition aswhere μ denotes the mean and represents the standard deviation of noise.

5.3. Root Mean Square Error (RMSE)

RMSE is a quadratic scoring rule that gives the average of the error. RMSE is used to measure accuracy for continuous variables. In equation (14), is the intensity of original image pixel and is the intensity of processed image pixels. Mathematically, RMSE is written as follows:

5.4. Structure Similarity Index (SSIM)

SSIM is a measure of the degradation originated because of data compression or due to data losses during transmission. Two images are needed for SSIM calculation: original image and the reconstructed one. SSIM is mathematically expressed aswhere characterizes the standard deviation and expresses the average value of original image I. The terms and represent standard deviation and average value of the processed image. Here in equation (15), express the correlation coefficients of the original and processed images. and stabilize the division with a weak denominator [26]. If SSIM results in the value 0, then the images are totally dissimilar, and two images are completely matched if the value is 1.

5.5. Contrast-to-Noise Ratio (CNR)

CNR determines the quality of reconstructed image in comparison to original image. CNR is similar to the SNR with the difference that it subtracts some image intensity term and then calculates the ratio [27]. CNR is given by [28] aswhere represents the image intensity of producing structure O, denotes the image intensity for image producing structures P in the region of interest, and represents the standard deviation of the original image noise.

6. Classification of CS Algorithms in Ultrasound

There has been a lot of work done on different aspects of medical ultrasound. Researchers are working to improve the CS reconstruction techniques for achieving good quality images with high compression ratios and greater accuracy. In this section, groups are made on the basis of various algorithms and reconstruction techniques, keeping in view their different attributes. The classification of CS algorithms is shown in Figure 2.

6.1. General CS-Based Reconstruction Algorithms

A number of algorithms have been used for CS, where each gives unique results for the reconstruction accuracy and computation time. This group consists of algorithms which are used for US acquisition or reconstruction.

Donoho introduced the idea of approximate message passing (AMP) in CS [29]. AMP is used for image reconstruction in compressive sampling framework with various denoising algorithms [30, 31]. Kim in [32] used AMP as a CS reconstruction algorithm for US image reconstruction. To achieve better image quality and compression ratio, several combinations of denoiser and transform domains are used. Amplitude scale invariant Bayes estimator (ABE) and soft thresholding (ST) are used as denoiser in spatial domain, wavelet domain, and DCT. The steps of AMP iteration are defined inwhere is an estimate of at iteration, is the thresholding function derivative, is the transpose of measurement matrix, represent the error, and is the measurement rate.

Hill in [33] has used AMP with image denoiser, which is based on a heavy tailed distribution. AMP has been used specifically as a Cauchy prior based maximum a posteriori (MAP) approximation within a wavelet-based compressive sensing pattern. The MAP denoising algorithm results in very fast convergence, which is approximately twice fast in comparison to the AMP method for image compressive sensing. To benchmark the performance of the proposed system, the authors have proposed two other methods: amplitude scale invariant Bayes estimator and soft thresholding.

Achim [34] extended their previous proposed approach for CS reconstruction of RF US images. This approach of RF signal reconstruction has used symmetric alpha-stable-IRLS (SαS-IRLS) algorithm in the Fourier domain with prior information about US RF signals. The first information based on observation is that US RFs are best characterized by statistical use of alpha-stable distribution, and the second one is that RF echoes are easily assumed in Fourier domain. Using these two observations, IRLS has been used. Through simulation, it is proved that RF echoes give best results using the proposed LP-norm minimization with the two prior observations. Reconstruction is done using the proposed LP minimization method. SSIM and NRMSE results show that the proposed SαS-IRLS-DP gives better results in comparison to SαS-IRLS and SαS-IRLS-FD.

Liu [35] proposed unique CS-based synthetic transmit aperture (STA) technique for achieving a high-frame-rate, high-contrast, and high-resolution US imaging. In STA, one element of US transducer transmits plane wave sequentially at a time with random apodization for several times and all the elements receive the corresponding echoes. CS-STA uses a less number of plane wave firings than STA. A reconstruction algorithm is then applied on CS-STA to recover full dataset of STA from recorded echoes of CS-STA. A B-mode image is formed from the data by applying the SA beam-forming algorithm. The advantage of CS-STA is the use of less ultrasound firings than STA firing to achieve high frame rates. CS-STA also maintains high resolution, as full dataset of STA is recovered by CS and due to plane wave firing the contrast is improved. y is a sparse signal having most of its entries zero. Authors have solved the problem using convex optimization as shown in

Quinsac [36] has presented a Bayesian reconstruction framework for CS of ultrasound images using frequency domain as sparsifying basis. Bernoulli Gaussian prior is applied to the DCT of US images for achieving sparsity. To reconstruct US image Bayesian CS is used by the authors in [35]. The Bayesian method allows the image sparse level to be estimated in a spectral domain. Furthermore, it is a useful parameter in L1 constrained minimization problem. The advantage of the proposed method is the increase in frame rate acquisition. Moreover, it is completely automatic and no requirements are needed to adjust parameters. For comparison, US image is reconstructed using classical techniques and proposed Bayesian based algorithm. Achim [36] first time proposed iteratively reweighted least squares algorithm (IRLS) algorithm to US RF signal for LP-norm minimization. All the above discussed CS algorithms along with their results are summarized in Table 1.

6.2. CS Reconstruction Algorithms Based on Sparsifying Transforms

Every real world signal is sparse in some specific transform domain and if right sparsifying transform is selected for a signal, one can get data or image with improved results for a large compression ratio. In 2010 Friboulet [37], used directional wave atom as model basis for reconstruction, Fourier model basis and daubechies wavelet model basis were used for the comparison. The obtained results shows that directional wave atom basis gives better results in comparison to others two model basis at a sampling rate of 50% to 90%. Reconstruction was done by solving L1 minimization using L1-Magic-Packet. The accuracy was given by comparing the results of CS reconstruction with the original signal data using MAE. In [38] Chu has used DCT and discrete wavelet transform (DWT) as model basis. They have discussed CS reconstruction performance of three different types of US post-beam-formed data. The three post-beam-formed data types reconstructed are radio frequency (RF), envelope, and log converted data. Individually data type has a unique signal spreading and thus has unique sparse representation which affects sparsifying effectiveness in the process of CS reconstruction and quality of reconstructed images. DWT was used for experiment, and MSE results of US image data have shown that CS reconstruction of post-beam-formed data has the minimum value of error. Results of DCT as a sparsifying domain were the same as DWT; therefore, they were not discussed in their paper.

Kumar [40] discussed frequency and time domain beam-formed matrices and their implementation as CS matrix for US image reconstruction. Using beam-forming matrices, a direct US image reconstruction based on CS is presented. The authors in [34] developed an ultrasound time domain model for beam forming along with a frequency domain equivalent. In CS sparse recovery, they used this model based on matrices of time and frequency domain to recover images from undersampled ultrasound waves. Time domain ultrasound beam-formed matrices are given by

Frequency domain US beam-formed matrix is represented by

The beam-forming matrices use the Fourier transform as a sparsifying domain. The CS matrix is created by taking the product of beam-forming matrix with Fourier matrix. The image is recovered directly from the revived signals using orthogonal matching pursuit (OMP) for recovery. The experiments were done with half of the minimum needed sampling rate.

Kim [32] used CS reconstruction algorithm AMP for US image reconstruction and spatial or time domain, wavelet domain, and DCT as sparsifying domains. AMP is an iterative algorithm which reconstructs the image by compressive sampling and image denoising. ABE and ST are used as denoiser. PSNR and SSIM results show that DCT with ABE denoiser has better results. In 2017, Shin [39] used CS applied to ultrasound elastography. Their research has used three different model bases or sparsifying matrices, i.e., DCT, WA, and DFT, and two recovery frameworks, i.e., block sparse Bayesian learning (BSBL) and L1 minimization. Results of sparsifying model basis and reconstruction algorithm were compared. They have suggested that DFT for sparsifying and BSBL algorithm for reconstruction has the best results at 60% compression. The reconstruction of the RF ultrasound sampled data is done by solving the following L1 minimization problem:

Comparison of mentioned techniques and their results is given in Table 2.

6.3. CS Reconstruction Algorithms Based on Compression Ratio

CS is used to decrease the number of samples during acquisition or reconstruction of some data from a minimum number of samples. In this group, those research papers which have used CS and achieved good reconstruction results for large compression ratios up to 80% are discussed. Results are given in Table 3.

Zobly in [41] proposed a novel structure of CS sampling theory for Doppler US signal reconstruction. OMP and compressive sampling matching pursuit (CoSaMP) have been used in their study. OMP algorithm is less complex and fast for recovering high-dimensional sparse signals in comparison to CoSaMP. In their experiments, both algorithms recovered images in a short period of time with sufficient accuracy from a minimum number of samples. Results show that OMP performs faster than CoSaMP. However, CoSaMP has better reconstructed image quality in comparison to OMP.

In 2015, Chen [42] used novel framework of compressive deconvolution (CD) for reconstructing enhanced RF images from measurement vector. CS and deconvolution are combined and applied to US imaging, which resulted in compressive deconvolution model. This technique helps to reduce US data volume and improve image quality. Solution of proposed compressive deconvolution is centered on alternating direction method of multipliers (ADMM) that utilizes two constraints. The first constraint is image sparsity in some sparsifying domain like Fourier, wavelet, etc. The other constraint is the prior information for the tissue reflective function (TRF). For sparsifying transform generalized Gaussian distribution (GGD) model is used. The authors experimented the algorithm for the reconstruction on the modified Shepp-Logan phantom with 20–80% samples removed.

Chen [45] worked on compressive deconvolution for US imaging using supposition of GGD TRF. Their work focuses on application framework of CD on US imaging and suggested an updated Lp-norm algorithm based on ADMM. The solution is based on ADMM which utilizes two constraints, i.e., sparsity of the RF images in the 2D Fourier domain, and the LP norm, i.e., the norm used to recover the images [46]. The main purpose of their work was to approximate the TRF x from the fuzzy and compressed measurements y. Based on the supposition of GGD x, TRF’s LP-norm minimization is utilized. The results confirmed the dominance of proposed simultaneous CD method that has better performance than the sequential approach.

In 2017, Chen [43] applied CD framework to estimate point spread function (PSF). The model of compressive deconvolution uses 2D convolution operator, which has the information on the system PSF with a measurement matrix. The author has proposed a unique alternating minimization-based optimization method, which inverts the resulting linear model. This method results in enhanced US image and simultaneously gives the estimation of the PSF. Chen [44] used CD and achieved US data from a minimum number of samples for US image and improvement in image spatial resolution. In this research, previous work of the CD is extended and is aimed to jointly estimate system PSF and US image. For a known PSF (estimated in [42]), they proposed compressive semiblind deconvolution (CSBD) which has proved the capability to recover improved ultrasound images from compressed measurements by reversing forward linear model. To get quantitative results of the proposed algorithm, the authors have first used the restoration of TRF for RF images and then PSF for simulated data.

With proposed CSBD, two other CS deconvolution algorithms, CD and CD_true, were also implemented by the authors and the results were compared. All the mentioned algorithms and their results are summarized in Table 3.

6.4. CS Reconstruction of 3D Ultrasound

Not much research has been done in the field of 3D US compressive sensing. Research work in this field has mainly two directions, 3D US image CS acquisition and 3D US image reconstruction. In this section, we have included both mentioned categories. CS framework was first adapted to 3D ultrasound in 2010 by Basarab [47]. They used three different undersampling patterns and a nonlinear conjugate gradient algorithm for reconstruction. The US images are reconstructed in the Fourier domain as CS gives best results when original data samples are linear combinations of the data to be reconstructed. The results obtained from NRMSE between original image and reconstructed image from 50% samples of the original data show minimum data loss. To recover original K-space data, optimization routine is used as follows:

3D volume of consecutive RF images in K-space is represented by , sampling scheme is denoted by , represents the inverse Fourier transform and are the RF random sample locations, y denotes the RF ultrasound image compressed measurements, and Ψ denotes the sparsifying model basis and the coefficients weights for sparsity in model basis. Three unique sampling schemes Φ1, Φ2, and Φ3 are represented and evaluated in their research. Φ1 is a uniform random sampling pattern in three directions and Φ2 is proposed to reduce the number of US pulse emissions. To make US image K-space sparse, three sparsifier finite differences, discrete cosine, and daubechies wavelet transforms were used on the simulated RF images in frequency domain.

In [48], the authors have proposed a new approach for 3D ultrasound compressive sampling, based on learned overcomplete dictionaries. These dictionaries represent a signal sparse as these dictionaries are designed for a particular category of images. Two undersampling patterns for the 3D US imaging are experimented by them. The first one is a spatially uniform random acquisition and the second one is a line-wise random acquisition. K-SVD algorithm is used for dictionary learning on patches extracted from the training dataset. K-SVD learned dictionaries have shown a minimal information loss in comparison to fixed sparsifying transform indicating the efficiency of overcomplete dictionaries. In 2015, Lorintiu [49] used K-SVD with the improvements proposed by [51]. K-SVD algorithm is used for dictionary learning. On-log envelope data are utilized, by removing 20%–80% of the original acquired data as in [48].

The influence of the sampling strategy and training parameters are evaluated on simulated images. Comparison results of two sampling patterns, which were line-wise and point-wise, have shown that line-wise sampling pattern is more accurate, making CS acquisitions of 3D data possible. CS acquisition of 3D data offers increased frame rate by leaving out the acquisition of RF lines. The method is applied to several in vivo and ex vivo organs of US volumes. It was shown that overcomplete dictionaries have better performance in comparison to Fourier, Cosine, or other fixed transforms. Finally, the generality of learned dictionary approach is investigated with the possibility of building a general dictionary, which can be used to reconstruct different in vivo and ex vivo organs of different volumes reliably.

In 2017, Kruizinga [50] introduced an US transducer of one element with a plastic aperture mask to acquire a 3D US image. They used a plastic mask with a different coding in front of its aperture. This mask acts as a distorter (to RF reflected back signals) by inducing varying echo delays and guarantees that pixels in the image are independently identifiable in the measurement vector. The result is a compression of spatial US field on the transducer surface having enough information to make a 3D image. Using a single sensor transducer, measurements are obtained by rotating a plastic mask at 180° in front of transducer to maximize the condition of the reconstruction problem. An approximate signal model that captures the ultrasonic response of the mask and uses it to pose mask shape optimization as a sensor selection problem is defined. This is solved by converting it to a convex problem using a greedy selection method. Algorithms discussed in this section are summarized in Table 4.

6.5. CS-Based Deep Learning Novel Methods

The architecture of deep neural networks (DNNs) had been very useful by performing various artificial intelligence tasks. It is applied to US imaging in liver classification [52], locating standard plane in fetal US imaging [53], and classification of breast lesions [54]. In this group, a research work using DNN and CS for US image reconstruction is reviewed. Other papers added in the group are using CS and DNN for magnetic resonance image (MRI) recovery, as our review paper is not discussing MRI but the techniques/algorithms can be used and shall be helpful for US images recovery.

A novel framework using DNN for the compression and decompression of US signals based on stacked denoising autoencoders (SDAs) is proposed in [55]. The framework consists of four layers: the first layer is responsible for the compression of the signal and remaining hidden and output layers are applied for the reconstruction of the signal. They have investigated a linear measurement case (SDA-L), where the compression matrix is not learned, and also a nonlinear measurement case (SDA-NL), where the compression is indicated as a layer of the network. The publically available dataset PICUM is used to evaluate the performance of the network; the network is trained using simulated US signals. They have obtained and compared the PSNR results of the proposed SDA-NL, SDA-L, and CS algorithms, which indicated that the SDA-CL has better results than SDA-CNL and CS. The results of the proposed frameworks in the table are obtained using longitudinal common carotid images from the mentioned dataset.

In 2016, Sun [56] introduced a novel deep learning framework ADMM-net to get better reconstruction results. ADMM-net is defined on a dataflow graph, derived from the iterative process in ADMM used for the optimization of CS-based models. Limited memory Broyden–Fletcher–Goldfarb–Shanno (L-BFGS) algorithms are used to train all the parameters of the network, i.e., image transforms, shrinkage function, etc. The ADMM-net uses optimized parameters, which are learned from the training data for CS-based reconstruction task. The reconstruction results using different compression ratios are given in Table 5.

Wen et al. [58] proposed a robust CS method for sparse reconstruction, using a first order algorithm based on ADMM. The ADMM is not convergent for nonconvex problems, and L1 loss is used for convergence and as a smoothing operator. L1 norm is used as a loss function for the remaining error and uses generalized nonconvex penalty for sparsity. The L1 loss is more protective to the outliers in the measurements than L2 loss, and due to the nonconvex nature, more accurate recovery can be obtained. ADMM is proposed to solve the nonconvex and nonsmooth minimization problem. To make the algorithm convergent, a smoothing technique has been applied on L1 loss function and an adequate condition for the convergence of proposed algorithm has been made available for nonconvex regularization.

In 2016, Eksioglu [59] introduced a novel approach for the reconstruction of magnetic resonance (MR) images utilizing nonlocal block matching (BM3D) image model. The modified BM3D-MRI consists of fully decoupled observation fidelity and model fidelity steps. The decoupling allows for the adoption of a varying regularization parameter strategy, enhancing the performance. The final algorithm is very easy to use and understood, using only three parameters for tuning. The reconstruction results of BM3D are given in Table 6 for comparison.

In 2018, Li [60] used a novel composite robust ADMM (Co-Robust-ADMM) for reconstruction. SαS is a usual impulsive noise commonly formed during signal acquisition or transmission. Robust CS is a method of signal reconstruction from a minor number of undersampled data in the existence of impulsive noise. A smoothing strategy is utilized to counter the effect of L1 norm, as it has a nonsmooth optimization nature. They have trained DNN using a model of robust composite regularization to make use of more previous knowledge and image features. The DNN is employed to train the samples for the best possible parameters, shrinkage function and the transform domain, and can be kept as a network. The acquired network is utilized for refining recovery results in which DCT is used as an initial sparsifying transform. The comparison of the proposed framework is done with other existing robust CS methods using PSNR; it has proven that the proposed algorithm results are much better. The authors used 5 MR images with impulsive noise (α = 1.0, γ = 10–4) for network training and for the reconstruction of brain images with different noise levels. For comparison, we have shown their results of the offered algorithm in Table 5 with only one noise level, i.e., α = 1.0 and γ = 10–3.

7. Datasets

In this section, several US image datasets are discussed. Normally, the researchers test their proposed algorithms on simulated datasets followed by real US datasets. The most commonly used simulation software for US images generation is Field II [61]. This software can simulate US sample images, cyst images, or images mimicking human or animal organs. For simulation purpose, the size of the image plays important role; increasing or decreasing its size is somehow related to image recovery, quality, and algorithm execution time. Another important consideration for US image acquisition is the selection of scanning probe, sampling frequency, central frequency, sampling axis, and pitch. Field II can model the scanning probe, including probe voltage, number of US elements, scanning depth, sampling frequency, central frequency, sampling axis, pitch, scanning frequency, and frame rate. Datasets are mainly divided into two categories: (i) real acquired US images and (ii) simulated US images.

7.1. Real Images Datasets

Normally, US images are acquired on demand as per researcher requirement. However, there are some online databases which can be accessed for validating the results of the proposed algorithms. In this research, various real US images and datasets were observed including human thyroid, kidney, cardiac and liver images, mouse liver, kidney, and embryo images. In [36], real US images are acquired using US scanner for CS. The most commonly used US scanner for acquiring images was clinical scanner Sonoline Elegra, which was modified for research with linear probe (Siemens Medical Systems, Issaquah, WA, USA) [32, 34, 36]. The sampling frequency and central frequency are also changed in some research studies, in which RF US tissue images are commonly obtained with central frequency of 7.5 MHz and sampling frequency of 50 MHz and a real human right lobe thyroid gland image with sampling frequency adjusted to 50 MHz and 40 MHz [32, 34, 36]. In [34], an image of real human liver is acquired for the CS reconstruction using the same settings.

In [47], the authors used a SHERPA high-resolution single element probe with frame rate of 10 images per second. The probe frequency was 2 MHz; sampling frequency, 80 MHz; scanning frequency, 20 MHz; scanning width, 16 mm; and exploration depth, 2.8 mm. The acquired image was an in vivo anaesthetized mice embryo. In [42], real images of a mouse kidney and bladder were acquired using a single element probe. Also in [44] an image (size 250 × 180) of real mouse kidney was acquired using a single element US probe.

In [33], a dataset used for US recovery, which is available online at http://research.microsoft.com/enus/projects/objectclassrecognition. The dataset is known as “pixel-wise labelled image database v2” and consists of 591 US images. In [41], the experiments were completed using Doppler US imaging spectrum data. The data are available online at Torpp group websites having a total length of 2032.

In [50], 3D scanning of objects using only one element transducer having an aperture plastic mask with different coding embedded on it is used. For US scanning, a 3D object is made from plastic material consisting of alphabets “E” and “D.” Both of the letters are apart from each other at a distance of 10 mm. The size of the letters used in this work was 12 × 12 mm.

7.2. Simulated Datasets

Field II is a simulation software used to simulate US image and US probe along with its different attributes. In this review, we have observed that researchers have used Field II US RF simulator for experimentation and validation of their work.

In [35], a tissue mimicking phantom (model 040GSE, CIRS, Norfolk, VA, USA) B-mode image was generated using Field II. The center frequency of probe was 7.5 MHz and sampling frequency was 100 MHz. The generated image was a point phantom with a 50 dB dynamic range. Also simulated images of wire region and cyst region were generated utilizing the same settings. The point phantom consisted of six point targets with no attenuation. An RF US image of blood vessels was simulated using Field II in [36], and the size of the image was adjusted to 128 × 128 pixels. In [38], a speckle generating phantom with a hyperechoic cyst was simulated in Field II. The simulated transducer was concave probe with center frequency of 5 MHz and sampling frequency of 80 MHz.

An elastography phantom was generated using finite element analysis (FEA) and Field II in [39]. The authors have modeled a linear elastic phantom of size 40 × 50 × 10 mm3 having a cylindrical inclusion of D : 10 mm. The adjustment for US image acquisition was as follows: a linear probe of 152 elements, center frequency of 3.5 MHz, and sampling rate of 28 MHz. Using Field II, 128 RF lines were generated, where each line contains 2,589 samples through the depth. The phantom was designed to mimic a carcinoma in breast tissue.

A liner probe of 192 US elements is simulated using Field II simulator in [37]. The depth of transmitted and received US was focused at 70 mm. The simulated image was a phantom of 5 hypoechoic and 5 hyperechoic cysts of dimension 50 × 10 × 60 mm3, containing 1,000,000 scatterers. The data are calculated using delay and sum technique using continuous Hanning apodization counting all the 128 receive elements.

In [42], a modified Shepp-Logan phantom is used. The image is modified with introducing speckle noise which is presented in practical US images. This phantom is a built-in MATLAB image and is used in many biomedical image reconstructions and other image processing applications. The procedure to introduce speckle noise is as follows: the first scatterers are generated at uniformly random locations, and the amplitudes are distributed in proportion to zero-mean GGD with scale parameter set to 1 and shape parameter set to 1.3. The scatterers were further multiplied to original values of phantom pixels nearest to scatterers position. The resulting image mimics TRF. The probe settings were a 3.5 MHz linear probe, and the data were sampled in axial direction at 20 MHz. The image is blurred using a Gaussian noise of SNR 40 dBs. Field II is also used to simulate a mouse kidney and a cyst point object; both of the images were acquired by convolution of spatially invariant PSFs and TRFs. For point cyst, the same PSF settings were taken as in the previous case and TRF was a round hypoechoic inclusion with random variable, distributed according to a GGD having shape parameter set to 1. In the kidney image, the PSF was generated using Field II having central frequency of 4 MHz and axial sampling frequency of 40 MHz. A linear probe of 128 elements was used. The shape parameter of GGD was changed to 1.5 and scatterers were taken satisfactorily large enough in both cases.

In [4245], the authors have used Field II to simulate US cyst modified Shepp-Logan and simulated mouse kidney images. In these studies, Field II is used to generate the images with different settings of PSF, TRF, and various distributions of scatterers.

8. Results and Discussion

Techniques of compressive sensing are continuously evolving. There has been a lot of work done by the researchers in the field of CS application to ultrasound. In this work, CS US algorithms are broadly divided into five groups.

In the first group, NMSE, NRMSE, and SSIM results are compared for different algorithms. In [32, 33], the authors used AMP and had the best results of −21.23 dB using 30 iterations of AMP framework with ABE denoiser and DCT as a sparsifier. In [34], NRMSE results of three different algorithms were compared for 33% and 50% compression ratios. The results show that IRLS-DP gives best performance in both cases. The authors in [35] have used a framework of CS-STA for different numbers of RF firings to form an US image. The advantage of CS-STA is the reconstruction of US image from less number of firing or samples, also achieving a high frame rate, high contrast resolution, and high spatial resolution for the reconstructed image. Results are compared with the original image using NRMSE. In [36], Bayesian framework was used, where the error is very small between the original and reconstructed images. Better reconstruction results can be obtained by introducing image blocking, denoising filters, or use of GPUs.

In the second group, CS recovery algorithms are discussed on the basis of different sparsifying domains. Evaluation parameters MAE, MSE, SSIM, and PSNR are used for the comparison of reconstructed data and original data. In [39] and [37], L1 minimization and stochastic sparse recovery algorithms were used. From the obtained results, it can be concluded that BSBL minimization with the wavelet atom as sparsifying matrix has the best performance. In [38], CS is applied to three different data types with DCT and DWT as sparsifying basis. Their obtained MSE results establish that CS-FLOW (CS reconstruction of RF post-beam-formed data) with a DCT model basis has minimum error. In [40], the model basis is used to calculate TPSF. In the model basis, the column coherence of beam-forming matrices has minimum off-diagonal TPSF value. The results had shown that frequency domain model matrix with Fourier sparsifying basis has more accurate PSNR and SSIM values than others. In [32], DCT, wavelet, and time domain sparsifying bases and ABE as a denoiser with the CS recovery algorithm of AMP are used. Their results confirm that DCT as a sparsifying domain and ABE as a denoiser have the best results. From all the above-mentioned research papers, it can be concluded that DCT as a sparsifying basis is the most suitable sparse basis for CS data recovery.

The third group of the review papers deals with different compression ratios for the CS of ultrasound images. This group includes algorithms which were able to achieve more accurate results at higher compression ratio up to 80%. PSNR and SSIM were calculated using the above ratios for the last two papers. The maximum PSNR and SSIM achieved are 29.29 and 80.10, respectively, for CD_true at a compression ratio of 80%. From [41], it was concluded that OMP performs faster than CoSaMP but has less accuracy in comparison. In [42], CD is used to overcome the bandwidth limitation of RF US transducer. These limitations affect spatial resolution, contrast, and SNR. In [43], a 2D convolution operator carrying system PSF information is used with sensing matrix. The framework was called compressive blind deconvolution which aims to jointly approximate system PSF and US image. In [44], a framework compressive semiblind deconvolution is used for US image recovery with an unknown PSF. The results are promising, as they have better PSF and TRF. Future direction for research is the study of parametric model of US PSF and the use of nonconvex optimization techniques. In [45], the framework of CD is used based on ADMM to recover US images. Based on the estimation of generalized distributed TRF, LP-norm minimization is used for values of p between 1 and 2. Future research scope in this area is the automatic approximation of P for LP norm.

In group four, NRMSE was calculated for CS of 3D ultrasound. In [48], CS was introduced and applied to 3D US volumes using three different sparsifying patterns. The US volumes were reconstructed using L1 minimization with a very little information being lost at a sampling rate of 50%. In [47], to achieve sparser representation, learned overcomplete dictionaries are used. The authors used K-SVD for dictionary learning using nonlog envelop data. Results show that K-SVD with R2 sampling pattern has minimum NRMSE for all sampling ratios. In [49], K-SVD is used for sparse dictionary learning while Fourier and DCT transforms are used as sparse domain for in vivo and ex vivo data and also their results are compared. In [50], novel US image acquisition technique using sparse signal acquisition and recovery with a single element transducer and coded plastic mask is introduced. The benefits are as follows: very low cost of hardware, low power consumption, and low processing hardware for image acquisition. The future direction of research in 3D ultrasound CS is the study of different sparsifying basis and sensing matrices, optimal conditions, investigation of optimal routines or algorithms for 3D sparse recovery, sparse dictionary learning routine, and time optimization.

In the fifth group, sparse signal recovery algorithms are jointly used with DNN to improve sparse signal recovery. Some good DNN approaches to recover sparse data are reported in the literature, though a few have found their place in US image CS recovery. Also, some suggestions for improving its performance are the variation of the optimization algorithms, loss function, or new deep neural network for fast and accurate sparse signal estimation.

9. Conclusions

CS in medical ultrasound is a promising field of research and has opened a gateway for research on US scanners and its development. This work presents a concise review for CS in the field of US. Different modes of US image acquisition techniques for acquiring various types of preprocessed and processed data and its applications are presented. Moreover, fundamentals of ultrasound technology and its mathematical model for CS and theoretical foundations are discussed. Contributions made in various aspects of CS in US to improve image quality and to recover the US image from different types of data using various sparse bases for signal representation have also been discussed. This review also provides an overview of various algorithms and frameworks for US sparse signal and image recovery. Each acquisition technique and reconstruction framework has its own pros and cons and there is always a tradeoff between reconstructed image quality, computational complexity, sampling ratio, and noise in the image. Researchers suggested various model bases, sensing matrix, and noise removal frameworks for CS ultrasound reconstruction of sampled data at different rates, yet there is no perfect model for US image acquisition and reconstruction.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

The research done in this article was supported by the Higher Education Commission of Pakistan under the project National Research Program for Universities under Grant no 5805/Federal/NRPU.RD/HEC/2016.