Abstract

Automatic monitoring of group-housed pigs in real time through porcine acoustic signals has played a crucial role in automated farming. In the process of data collection and transmission, acoustic signals are generally interfered with noise. In this paper, an effective porcine acoustic signal denoising technique based on ensemble empirical mode decomposition (EEMD), independent component analysis (ICA), and wavelet threshold denoising (WTD) is proposed. Firstly, the porcine acoustic signal is decomposed into intrinsic mode functions (IMFs) by EEMD. In addition, permutation entropy (PE) is adopted to distinguish noise-dominant IMFs from the IMFs. Secondly, ICA is employed to extract the independent components (ICs) of the noise-dominant IMFs. The correlation coefficients of ICs and the first IMF are calculated to recognize noise ICs. The noise ICs will be removed. Then, WTD is applied to the other ICs. Finally, the porcine acoustic signal is reconstructed by the processed components. Experimental results show that the proposed method can effectively improve the denoising performance of porcine acoustic signal.

1. Introduction

With the development of precision livestock farming, it is hard for breeders to monitor porcine abnormal states. Sound recognition, as one of the noncontact detection methods, has been proven to be a valuable method to detect some diseases [1]. But during the process of acoustic signals collection and transmission, the recognition process is easy to be interfered by noise which will exert a negative impact on the recognition accuracy. Therefore, it is essential to eliminate the noise before analyzing the acoustic signals.

Empirical mode decomposition (EMD) is an effective automatic decomposition algorithm to analyze nonlinear, nonstationary, and non-Gaussian signals [2], and basis function is not required [3]. Because of this advantage, EMD is extensively applied in many different fields, such as blind source separation [4] and denoising [5]. The most common denoising method based on EMD is the threshold method of EMD, which determines the signal-dominated and noise-dominated intrinsic mode functions (IMFs) by the threshold. There are two types of denoising strategies for the threshold method of EMD. One is removing the noise-dominated IMFs directly [6], and the other is denoising IMFs by wavelet thresholding denoising (WTD) [7]. In order to distinguish the noise-dominated IMFs and signal-dominated IMFs, many methods have been proposed including the energy entropies of the IMFs [8], the correlation coefficients of the original signal and IMFs [9], and maximum variances of IMFs [10], among others. Ensemble empirical mode decomposition (EEMD) is proposed to overcome the problem of frequency aliasing of EMD [11]. Through adding Gaussian white noise, EEMD can avoid frequency aliasing to some extent. Some researchers have achieved good results in noise removal through replacing EMD with EEMD. Since the majority of useful information is lost when applying the first denoising strategy [12], some studies combine wavelet threshold denoising with the EMD threshold method [13]. However, the application of the wavelet threshold method in IMFs denoising may affect the continuity of reconstructed signals [14].

In order to effectively eliminate the noise produced in the process of sound collection and transmission, EEMD-ICA-WTD, which can be employed before porcine sound recognition, is proposed in this paper. EEMD is used to decompose the porcine acoustic signal into IMFs. Then noise-dominant IMFs are distinguished by permutation entropy (PE). The independent components (IC) of noise-dominant IMFs are extracted by independent component analysis (ICA). As the first IMF contains much of the high-frequency noise [15], the correlation coefficients of ICs and the first IMF are calculated to recognize noise ICs. After that, the noise ICs are removed and the other ICs are denoised by WTD. Finally, the processed ICs are used to reconstruct the porcine acoustic signal. Experimental results show that the proposed method can eliminate the noise in porcine acoustic signal efficiently.

This paper is organized as follows: Section 1 introduces the background significance of porcine acoustic signal denoising and the methods commonly used to eliminate the noises of different signals in recent years. Section 2 describes the porcine acoustic signals and the individual methods, including EEMD, PE, Fast-ICA, and WTD. The process of the proposed EEMD-ICA-WTD is presented in Section 3. The denoising performance evaluation indices, the simulation process, results of EEMD-ICA-WTD, and comparisons with other methods are presented in Section 4. Conclusions are drawn in Section 5.

2. Materials and Methods

The materials of this study are porcine acoustic signals. Below are the details of porcine acoustic signals. The methods are mainly comprised of EEMD, PE, Fast-ICA, and WTD, with a detailed explanation given below.

2.1. Materials

In this study, the original data are collected by an acoustic pickup device (ELITE model OS-100) from a large-scale pig farm in Shanxi Province, China. The schematic of the installation of the acoustic pickup in the pig farm is shown in Figure 1. The replacement gilts (PIC) at 5∼10 months old with weight ranging from 110 kg to 130 kg were studied in these experiments. Five replacement gilts were housed in a pigpen which is four meters wide and six meters long. The collection of sounds is controlled by the program developed in the numerical computing software (Python, ver. Python 3.5). The sampling frequency of the collected acoustic data is 1 kHz. The porcine acoustic signals we selected are scream [16] and cough [17], which are found most commonly in the pigpen. See the data in the Supplementary Material (available here) for the denoising experiments of porcine acoustic signals. In order to analyze the performance of the proposed method qualitatively, Gaussian white noise is added to the porcine acoustic signals to obtain signals with different signal-to-noise ratios (SNRs). In this paper, SNRin is defined as an input SNR of the added white Gaussian noise value and SNRout is defined as an output SNR of denoised signal.

2.2. Methods

The process of EEMD-ICA-WTD is mainly comprised of porcine acoustic signal decomposition based on EEMD, noise-dominant IMFs differentiation based on PE, independent components extraction by Fast-ICA, and denoising by WTD.

2.2.1. Porcine Acoustic Signal Decomposition Based on EEMD

In this paper, the porcine acoustic signal is firstly decomposed. EMD can decompose the signals into IMFs from high to low frequency self-adaptively [18, 19], which is based on the decomposition principle that any signal is composed of IMFs [20]. The IMF must satisfy two conditions: (1) The number of extreme points is equal to the number of zero-crossings. Or the difference of the number of extreme points and the number of zero-crossings is at most 1; (2) At any point, the mean value of the envelope defined by the local maxima and the envelope defined by the local minima is zero. The process of EMD is as follows:(1)The local extreme points which are detected from the original signal s(t) can be connected by cubic curve spline and formed the upper and lower envelops. And the list m1(t) is defined as the means of the upper and lower envelopes.(2)The list h1(t) is defined asIf h1(t) satisfies the above two conditions of IMF, it will be regarded as one of the IMF components:where c1(t) is the first IMF.(3)The residual list r1(t) is the difference between the original signal s(t) and c1(t), which is defined as(4)The residual list r1(t) is regarded as original signal. And then, repeat the steps 1–3. Finally, the other IMFs c1(t), c2(t), …, cn(t) and the final residual rn(t) (Res) can be obtained.Therefore, the original signal s(t) is decomposed as the sum of the IMF components and residual list rn(t):where n is the number of IMF components.

Classical EMD may cause frequency aliasing during signal decomposition. In order to overcome this shortcoming, EEMD was proposed by Wu and Huang [11]. The structure of EEMD is similar to EMD. The EEMD decomposition is done according to the following steps [21]:(1)Add a random Gaussian white noise signal to original signal s(t), which is defined aswhere si(t) is a synthetic signal after the ith iteration adding white noise.(2)Decompose si(t) by EMD into several IMFs and a residual aswhere cij(t) are the IMF components and rin(t) is the residual.(3)Repeat the process as described above N times and add the different Gaussian white noises. Therefore, the original signal adds the Gaussian white noise N times. The means of IMFs can be obtained aswhere cj(t) is the jth IMF.

2.2.2. Identification of Noise-Dominant IMFs by PE

Permutation entropy (PE), proposed by Bandt and Pompe [22], is an average entropy parameter based on the statistical attributes of time series elements [23]. It is sensitive to the variations of signals. PE can be described as follows:(1)For a given time series X = {x(i), i = 1, 2, …, n}, the matrix of phase space reconstruction A is obtained as [24]where λ is the time delay, m denotes the embedding dimension, and r is the number of vectors in phase space reconstruction components, r = n − (m − 1)λ.(2)Each row of A can be arranged in ascending order [25]:where j1, j2, …, jm are the column indexes of elements in reconstruction components.(3)Since the embedding dimension is m, there will be m! possible permutations. Each row of A can be represented by one of the permutations. Pj represents the probability of jth permutation. Then, the PE can be designated as follows [26]:where Pj is the probability of jth permutation and k is the number of possible permutations, k = m!.(4)The PE of order can be normalized as [26]The range of PE is 0 to 1. The PE can be used to recognize the IMF of noise [27]. Therefore, PE is adopted to distinguish noise-dominant IMFs in this paper.

2.2.3. Independent Components Extraction by Fast-ICA

The ICA, as one of the multivariate statistical methods, is widely used in statistical sources separation [28]. It can extract the independent components from the mixture with unknown mixing coefficients [3].

The independent sources can be denoted as s(t) = [s1(t), s2(t), …, sK(t)]T, where K is the number of independent sources. The observed signals can be defined as x(t) = [x1(t), x2(t), …, xM(t)]T, where M is the number of observed signals. represents the matrix of mixing coefficients. The mixing relationship is defined as follows:

This function can be expressed by the matrix:

The aim of ICA is to estimate the inverse of the mixing matrix W, which could be used to calculate the independent signals.where y(t) is the estimation of s(t), y(t) = [y1(t), y2(t), …, yN(t)]T.

Fast-ICA algorithm is one of the improved ICA algorithms, which is widely utilized to estimate the orthogonal matrix. Fast-ICA has higher convergence speed compared to the conventional method and the step-size parameters are not needed. In this paper, Fast-ICA is used to extract independent components from IMFs.

2.2.4. Denoising by WTD

Wavelet transform denoising (WTD) is one of the denoising algorithms based on wavelet transform (WT). WT can decompose signals at different scales. The discrete wavelet transform (DWT) is calculated as follows:where a is the decomposition level, b is the parameter of translation (for dyadic WT: b = 1), ψ is the wavelet basis function, and T is the number of sampling points.

The primary steps of WTD are described as follows:(1)Decompose the original signal by WT with proper wavelet basis function and decomposition level.(2)The threshold is performed by the selected proper threshold method for high-frequency coefficients at each decomposition scale. The low-frequency wavelet coefficient is kept unchanged.(3)The signal is reconstructed by the low-frequency coefficients and high-frequency coefficients after threshold processing.

It is crucial to select an appropriate threshold method for WTD. The common threshold selection methods fall into soft threshold method and hard threshold method [29]. The hard threshold method causes breakpoint at the threshold point. The reconstruction coefficient of the soft threshold method has good continuity [30]. Therefore, the soft threshold method is adopted in this paper.

3. Proposed Methodology

The process of the new efficient denoising technique proposed in this paper is shown in Figure 2. It consists of five main steps explained as follows:(1)The porcine acoustic signal is decomposed into IMFs by EEMD. Sorted in the increasing order of IMFs, the frequency distribution of the IMFs varies from high to low. The noise mainly concentrates in high frequency. Therefore, the first few IMFs contain both the information of porcine acoustic signal and noise [15]. PE of each IMF is calculated. If PE is more than 0.5, the corresponding IMF will be regarded as noise-dominant one.(2)Denoising the noise-dominant IMFs directly may destroy the continuity of reconstructed signals. It is harmful to the denoising effect [14]. Therefore, Fast-ICA is used to extract the ICs of noise-dominant IMFs to concentrate the noise and real information and improve SNR of components.(3)As the first IMF contains much of the high-frequency noise [15], the correlation coefficients of ICs and the first IMF are calculated. If the correlation coefficient is more than 0.8, the IC will be regarded as noise and will be removed.(4)Denoise the other ICs by WTD. The wavelet basis function and decomposition level we selected are db6 and 3.(5)The denoised ICs (DICs) are transformed to denoised IMFs (DIMFs) through the matrix of mixing coefficients. Then the porcine acoustic signal is reconstructed by these DIMFs and real IMFs.

4. Results and Discussion

This section introduces the simulation process and results of EEMD-ICA-WTD. In order to verify the performance of EEMD-ICA-WTD, the performance of the denoising is compared with the other six methods.

4.1. Simulation Process and Result of EEMD-ICA-WTD

In order to analysis the process and results of EEMD-ICA-WTD, porcine scream and porcine cough are selected for denoising, taking the noisy scream with 5 dB SNRin by adding Gaussian white noise as an example. The time-domain waveforms of porcine scream and noisy scream are shown in Figure 3. The sampling frequency of the collected acoustic data is 1 kHz. Therefore, the porcine scream of 1 s contains 1000 sampling points.

The noisy scream is decomposed as step 1. The time-domain waveforms of IMFs and Res are shown in Figure 4. The noisy scream is divided into 9 IMFs of different frequencies and 1 residual. Each IMF contains different local characteristics of the original noisy scream.

The PE of each IMF is calculated as step 1. The time delay λ is commonly used as 1 and the embedding dimension m is commonly used as 3 [31]. In this paper, λ = 1, m = 3. The PE of each IMF is shown in Table 1.

Table 1 shows that the PEs of IMF1, IMF2, IMF3, IMF4, and IMF5 are greater than 0.5. Therefore, these are noise-dominant IMFs.

The ICs of noise-dominant IMFs are extracted by Fast-ICA as step 2. The time-domain waveforms of ICs are shown in Figure 5. It can be observed that each IC concentrates more information.

The correlation coefficients are calculated as step 3. The correlation coefficients of ICs and the first IMF are shown in Table 2.

Table 2 shows that the correlation coefficient between IC4 and the first IMF is larger than 0.8. Therefore, it should be removed.

The other ICs are denoised by WTD as step 4. The denoising results are shown in Figure 6.

The end result of the reconstructed signal is shown in Figure 7.

In order to evaluate the denoising performance of the method quantitatively, the root mean square error (RMSE), SNRout, and correlation coefficient (R) are adopted in this article [32]:where s(t) is the porcine acoustic signal, s′(t) is the denoised porcine acoustic signal, T is the number of sampling points, cov(s′, s) is the covariance of s′ and s, σs is the standard deviation of s′, and σs is the standard deviation of s.

RMSE reflects the degree of error between the denoised porcine acoustic signal and the original porcine acoustic signal. The smaller the value, the better the denoising effect. SNRout reflects the ratio of the porcine acoustic signal to real noise. Therefore, the higher the value, the less noise mixes. R is used to evaluate the correlation between the denoised porcine acoustic signal and the original porcine acoustic signal. The higher value represents the better denoising effect.

The performance of the denoising is shown in Table 3.

4.2. Comparison with Other Methods

In order to verify the performance of EEMD-ICA-WTD, six different denoising methods are used as comparison methods. They are EMD-TD [33], EMD-WTD [34], EEMD-TD [35], EEMD-WTD [36], wavelet soft threshold denoising (WSTD) [37], and multiband spectral subtraction (MBSS) [38]. To verify the universality of the methods, two kinds of porcine acoustic signals with different SNRins are denoised. The denoising results of different methods are shown in Tables 4 and 5, where Table 4 is the denoising results of porcine scream and Table 5 is the denoising results of porcine cough.

The results, shown in Tables 4 and 5, are compared with different denoising methods evaluated using three parameters. According to the evaluations of RMSE, SNRout, and R, the EEMD-ICA-WTD has lower RMSE, higher SNRout, and R than the other six methods.

The results show that the EEMD-ICA-WTD proposed in this paper has the best denoising effects with different SNRins not only for porcine scream but also for porcine cough. The EEMD-WTD has the second-best denoising effects. Taking the denoising results for porcine cough as an example, when the SNRin of porcine cough is 10 dB, the values of RMSE, SNRout, and R after being denoised by EEMD-ICA-WTD are 0.0646, 14.6788, and 0.9859, respectively. These are close to the results of EEMD-WTD. The absolute differences of these three parameters between EEMD-ICA-WTD and EEMD-WTD are 0.0046, 0.4371, and 0.0005, respectively. With the increasing noise, the advantages of EEMD-ICA-WTD are more obvious. When the SNRin of porcine cough is −10 dB, the absolute differences of RMSE, SNRout, and R between EEMD-ICA-WTD and EEMD-WTD are 0.0138, 0.1996, and 0.0027, respectively. A large number of experiments for different kinds of porcine acoustic signals verify the universality of EEMD-ICA-WTD. In order to intuitively compare the performances of different denoising methods for porcine scream and porcine cough with different SNRins, the histograms are shown in Figures 8 and 9. Each histogram contains the denoising results of different methods with different SNRins. Different colors represent different SNRins. It can be observed that the RMSEs of EEMD-ICA-WTD with different SNRins are lower than the other six methods. And the SNRouts and Rs of EEMD-ICA-WTD are higher than the other six methods. In summary, the results show that the EEMD-ICA-WTD method is effective and suitable for porcine acoustic signal.

5. Conclusions

To improve the denoising performance of porcine acoustic signal, an efficient denoising technique based on EEMD-ICA-WTD is proposed in this paper. The approach has been developed with the purpose to reduce noise interference during the recognition of porcine abnormal sounds.

Firstly, the porcine acoustic signal is decomposed into different components in order of frequency. Because of the frequency aliasing of EMD, the EEMD is used to decompose the porcine acoustic signal into IMFs. As the noise mainly concentrates in high frequency, PE is used to distinguish the noise-dominant IMFs from the IMFs. Secondly, the continuity of the signal may be adversely affected if the noise-dominant IMFs are denoised directly. Therefore, the ICs of noise-dominant IMFs are extracted by Fast-ICA. The noise and real information are concentrated on the ICs. It has been shown that the first IMF contains much high-frequency noise. Therefore, the noise ICs are identified by correlation coefficients of ICs and the first IMF and are then removed. Finally, WTD is used for denoising the other ICs. The porcine acoustic signal is then reconstructed by processed ICs. The performance of this denoising method is shown to be superior to other methods.

In the future work, this approach will be optimized to reduce the run time on the premise of guaranteeing the performance of the denoising.

Data Availability

The porcine acoustic data used to support the findings of this study are included within the supplementary information file.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This study was supported by the National High Technology Research and Development Program of China (863 Program) (2013AA102306).

Supplementary Materials

The supplementary materials are porcine acoustic signals of scream and cough. The data are saved as WAV format. The length of each signal is 1 s. (Supplementary Materials)