Abstract

Dispersion entropy (DE), as a newly proposed entropy, has achieved remarkable results in its application. In this paper, on the basis of DE, combined with coarse-grained processing, we introduce the fluctuation and distance information of signal and propose the refined composite multiscale fluctuation-based reverse dispersion entropy (RCMFRDE). As an emerging complexity analysis mode, RCMFRDE has been used for the first time for the feature extraction of ship-radiated noise signals to mitigate the loss caused by the misclassification of ships on the ocean. Meanwhile, a classification and recognition method combined with K-nearest neighbor (KNN) came into being, namely, RCMFRDE-KNN. The experimental results indicated that RCMFRDE has the highest recognition rate in the single feature case and up to 100% in the double feature case, far better than multiscale DE (MDE), multiscale fluctuation-based DE (MFDE), multiscale permutation entropy (MPE), and multiscale reverse dispersion entropy (MRDE), and all the experimental results show that the RCMFRDE proposed in this paper improves the separability of the commonly used entropy in the hydroacoustic domain.

1. Introduction

The ocean is one of the main battlefields of future high-tech warfare [1, 2]. With the increasing development of electronic countermeasure technology, people can use signal processing technology to extract features of ship-radiated noise signals to complete military tasks such as target detection and target recognition [3], and these signal processing methods include wavelet transform (WT) [4], empirical mode decomposition (EMD) [5], and variational mode decomposition (VMD) [6]. However, the key step in the ship-radiated noise classification is the extracted features [7, 8]. The stronger the separability of the extracted features, the better the classification effect of ship-radiated noise signals.

Most of the traditional feature extraction methods extract the characteristic parameters of the waveform structure, which can be directly extracted from the time domain, including the mean value, variance, and peak value [9]. The transformation from the time domain to the frequency domain also transforms complex waveforms in the time domain into relatively simple single frequency component distributions in the frequency domain [10, 11]. Therefore, the low line spectrum features and broadband spectrum features in the power spectrum can also become effective features for ship-radiated noise target detection and recognition and have achieved good results in signal classification. In addition, there are also many other kinds of features, such as chaos feature [12], time-frequency joint feature [13,14], correlation dimension [15, 16], and auditory feature [17].

Lempel–Ziv complexity (LZC) is a commonly used indicator to identify dynamics [1820] and is increasingly used as a feature for signal classification in various fields and has found numerous applications in characterizing the randomness of biological and ship-radiated noise signals [21]. In a nutshell, the physical meaning of LZC is that it reflects the rate at which new patterns appear in a time series as the length increases. The higher the LZC value of a sequence, the more new changes occur in the sequence and the more disordered and uncertain the sequence is, while the lower the LZC value, the more regular the sequence is [22, 23]. In addition, more and more improved algorithms based on LZC are constantly being proposed, such as DLZC [24] and PLZC [25].

Another commonly used feature is entropy, which was first proposed by Shannon in 1948 [26]. It has been widely concerned and applied since the beginning of the 21st century because of its excellent performance of quickly and effectively quantifying the complexity of time series. In 2002, Bandt and Pompe first proposed Permutation Entropy (PE) [27], which was quickly applied to characterize the randomness of information in various fields, and many scholars improved it on this basis, such as weighted PE (WPE) [28] and multiscale PE (MPE) [29]; compared with PE, these improved PE-based measures have more substantial stability. However, PE only considers the order of amplitude values, ignores some information of amplitude values, and does not mention the influence of medium amplitude of each embedded vector. In some time series with large differences, the same PE value will be obtained, which will affect the stability of the extracted features [30].

To overcome the deficiencies of PE, Rostaghi and Azami proposed dispersion entropy (DE) in 2016 [31]. As an entropy has been proposed in recent years, DE has been proved to be more effective in measuring signal complexity than PE and improved PE-based measures. DE not only considers the order of amplitude values but also quantizes them through different mapping methods. The medium amplitude in the vector will also affect the DE value according to its size, and small changes will not change its corresponding class label in the vector. Compared with PE and improved PE-based measures, DE also has more vital noise anti-interference ability [3234]. Because of the excellent ability of DE as a feature to characterizing the randomness of signals, many DE-based improved algorithms have appeared in just a few years, such as fluctuation-based DE (FDE) [35], reverse DE (RDE) [36], and the combination of the two, namely, fluctuation-based reverse DE (FRDE) [7]. However, the study in [7] did not consider various modes at different scales.

On the basis of FRDE, we first introduced the concept of multiscale, where the same sequence can obtain multiple subsequences by multiscale operation, and then improved the multiscale calculation by introducing the concept of refined composite, which can divide each subsequence into multiple sequences and average the final obtained entropy value, and finally obtained a new entropy called RCMFRDE. Combined with the clustering methods for classification recognition, the common clustering methods are K-nearest neighbor (KNN) [37], K-mean clustering [38], etc. In this paper, we choose the K-nearest neighbor as the classifier. In addition, we can further improve the final calculated entropy by introducing fractional order [34, 37], but this may lead to a lower separability of the final result out instead due to the effect of the backward scattering entropy. Experimental results show that the classification method based on RCMFRDE has stronger separability compared with several other commonly used entropy-based classification methods.

The train of thought of the paper is as follows: Section 2 reveals formulas of RCMFRDE; Section 3 proposes a feature extraction method that comes from RCMFRDE; Sections 4 and 5 extract four types of ship-radiated noise signals under single feature and double features; ultimately, Section 6 embraces main conclusions obtained.

2. RCMFRDE

Step 1: for a known time series , is the coarse-grained time series of as follows:where is the scale factor (SF), and is the downward rounding function. For , is the element in the time series.Step 2: to map to , normal cumulative distribution function is employed, which can be described aswhere and represent expectation and variance of , respectively, and the range of distributes from 0 to 1.Step 3: is mapped to by using linear transformation as follows:where is the number of categories, is the integer function, and is an integer range from 1 to . And, can form time series , which can be expressed as .Step 4: phase space reconstruction of according to embedding dimension and time delay , and embedding vectors is created as follows:Step 5: each embedding vector is mapped to fluctuation-based dispersion patterns . After this, the total number of vectors is , with a maximum of types of possible fluctuation-based dispersion patterns. There are values in each vector, and each value ranges from to . Based on , can be expressed asAll the vectors contained in the matrix can be described asStep 6: each corresponds to a fluctuation-based dispersion pattern ; then, the probability of each fluctuation-based dispersion pattern can be estimated bywhere is the number of fluctuation-based dispersion pattern of assigned to . Next, the final average dispersion pattern can be computed asStep 7: is expressed asStep 8: the normalized is defined as

3. Feature Extraction Method

In this section, single feature extraction and double feature extraction are carried out, and Figure 1 shows the flowcharts of extracting features.

Detailed steps of extracting a single feature based on the method of RCMFRDE are as follows:Step 1: four different types of S-NS are gathered for the comparison experiments of feature extraction.Step 2: for each type of S-NS, 350 samples were randomly selected from them, and single feature extraction (from SF1 to SF5) was performed using RCMFRDE. To compare the various feature extraction methods, we performed the same operation for MDE, MFDE, MPE, and MRDE.Step 3: KNN is used for classification. For each type of S-NS, the first 50 samples are considered as training samples, and the remaining 300 samples are considered as test samples.Step 4: finally, the recognition rates of RCMFRDE (from SF1 to SF5) were obtained and compared with the other four feature extraction methods.

As shown in Figure 1(b), detailed steps of the double feature extraction method based on RCMFRDE are different from that of the single feature, especially in the number of features. There are four cases in which SF1 is combined with other scales for double feature extraction, including SF1 and SF2, SF1 and SF3, SF1 and SF4, and SF1 and SF5.

4. Single Feature Extraction of the Four Types of S-NS

4.1. Ship-Radiated Noise Signals

In this section, four types of S-NS are taken as the research object, called Ship I, Ship II, Ship III, and Ship IV, respectively. Each of the above S-NS lengths is , and the frequency of sample extraction is 44.1 kHz. The normalized results of each type of S-NS are represented in Figure 2.

4.2. Single Feature Extraction

In the experiment of single feature extraction, for each type of S-NS, 350 samples are selected, and each sample consists of 3,000 sampling points. Five kinds of entropies are employed to extract the complexity feature of each type of S-NS (from SF1 to SF5), including RCMFRDE, MDE, MFDE, MPE, and MRDE. Table 1 is the parameter settings of five kinds of entropies. For RCMFRDE, MDE, MFDE, MPE, and MRDE, time delay τ and embedding dimension m are 1 and 4, respectively. Except that MPE has no category number and mapping method, the category number and mapping method of other entropies are the same. Figures 3 to 7 demonstrate the distribution of five kinds of entropies for four types of S-NS for different scale factors.

From Figures 3 to 7, it can be demonstrated that the different entropy distributions of the four types of S-NS for different scale factors are at different levels; for MDE and MFDE, with the increase of scale, the coincidence of MDE and MFDE distributions of Ship I, Ship II, and Ship III becomes more and more obvious, especially for Ship I and Ship II; when the SF is 4 and 5, the MPE distributions of S-NS with 4 different scale factors are more independent and have less overlap; for RCMFRDE and MRDE, when the SF is 1, the distribution of S-NS for each type is at different levels for each scale factor, and the distribution of features for four types of S-NS is obviously different.

As evidenced by Table 2, the number of misidentified samples and the average recognition rate of the four S-NSs under different entropies and different scale factors differ to different degrees; for RCMFRDE, three samples of ship IV are misidentified when the scale is 1; with the increase of scale, the number of misidentified samples of ship I, ship II, and ship III gradually increases, leading to a decrease in the average recognition rate of RCMFRDE; for MDE and MFDE, the three samples of ship IV are incorrectly recognized when SF is 1, while ship IV can be correctly recognized under other scale factors. We can conclude that the same feature extraction method has different recognition abilities for various types of S-NS under different scale factor conditions.

The experimental results proved that for all entropies, RCMFRDE has the highest recognition rate at scale 1, up to 99.75%, which is 10.19% higher than the other average recognition rates. However, while using only one feature, four types of S-NS cannot be fully recognized by RCMFRDE. For the sake of reducing the number of error samples and further improving the average recognition rate, the experiment of double feature extraction is carried out.

5. Double Feature Extraction of the Four Types of S-NS

5.1. Double Feature Extraction

The feature (SF1) with the highest average recognition rate of RCMFRDE is combined with other scale factors for double feature extraction. There are four cases, including SF1 and SF2, SF1 and SF3, SF1 and SF4, and SF1 and SF5. Figures 8 to 12, respectively, express the distribution of five kinds of entropies for four types of S-NS under double features.

According to Figures 8 to 12, for the five kinds of entropies, Ship I and Ship IV are obviously different and easy to distinguish, while the difference between Ship II and Ship III is not obvious and difficult to distinguish; for Ship II and Ship III, RCMFRDE, MDE, and MFDE can distinguish them easily in all four cases, while other entropies cannot distinguish them completely.

5.2. Double Feature Classification and Recognition

For further comparing the double feature extraction effects of RCMFRDE, MDE, MFDE, MPE, and MRDE, KNN is applied to classify four types of S-NS, similar to feature classification and recognition, in which K value is selected as 1. Table 3 is the classification and recognition results of four types of S-NS under double features.

As shown in Table 3, only RCMFRDE can fully identify Ship III; all entropies except MRDE can fully identify Ship IV; among all combinations of entropies, the RCMFRDE-based feature extraction method combined with two scales has the highest recognition rate of S-NS up to 100%.

In summary, compared with the single scale condition, the average recognition rate of RCMFRDE is improved, utilizing 13.05%. The double feature extraction method of RCMFRDE for S-NS can fully recognize four types of S-NS.

6. Conclusions

In this paper, RCMFRDE is proposed on the basis of FRDE and applied to S-NS feature extraction. The feasibility of the proposed method is verified through feature extraction and classification recognition experiments on four types of measured S-NS, and the main conclusions are drawn as follows:(1)Based on the theory of DE, this paper introduces the operation of refined composite by combining the advantages of RDE and MFDE and proposes RCMFRDE for S-NS feature extraction.(2)Compared with the other four single feature extraction methods, the recognition rate of the RCMFRDE-based single feature extraction method proposed in this paper is the highest and reaches 99.75%, which is 10.19% higher than the other average recognition rates.(3)The proposed double feature extraction method further improves the classification performance of the single feature extraction method. Compared with the other four double feature extraction methods, the RCMFRDE-based double feature extraction method has the best recognition of the four types of S-NS with a recognition rate of 100%.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.