Abstract

The short-time Fourier transform (STFT) is a classical tool, used for characterizing the time varying signals. The limitation of the STFT is its fixed time-frequency resolution. Thus, an enhanced version of the STFT, which is based on the cross-level sampling, is devised. It can adapt the sampling frequency and the window function length by following the input signal local characteristics. Therefore, it provides an adaptive resolution time-frequency representation of the input signal. The computational complexity of the proposed STFT is deduced and compared to the classical one. The results show a significant gain of the computational efficiency and hence of the processing power.

1. Introduction

Most of the real-life signals like speech, Doppler, seismic, and biomedical signals are time varying in nature. The spectral contents of these signals vary with time, which is a direct consequence of the signal generation process [1]. The STFT is a classical tool for characterizing such signals [2]. The limitation with the STFT is that it provides a fixed resolution time-frequency representation of the input signal. This fixed resolution is the reason for the creation of the multiresolution analysis (MRA) techniques [35], which provide a good frequency but a poor time resolution for the low-frequency events and a good time but apoor frequency resolution for the high-frequency events. This type of analysis is well suited for most of the real-life signals [3].

In this article, the fixed resolution dilemma is resolved to a certain extent by revising the STFT. The motivation behind the proposed STFT is to achieve a smart time-frequency representation of the time varying signals. The idea is to adapt the time-frequency resolution along with the computational load by following the input signal local characteristics. An efficient solution is proposed by smartly combining the features of both uniform and nonuniform signal processing tools.

2. Proposed Adaptive Resolution STFT

The block diagram of the proposed STFT is shown in Figure 1. The description of different blocks is given below.

2.1. Asynchronous Analog to Digital Converter (AADC)

According to [6], the sampling instants of a nonuniformly sampled signal obtained with the level crossing sampling scheme (LCSS) are defined by (1). Where ???? is the current sampling instant, ????-1 is the previous one, and ?????? is the time delay between the current and the previous sampling instants (cf. (2)).

The LCSS drastically reduces the activity of the post processing chain, because it only captures the relevant information [79]. In this context, analog to digital converters based on the LCSS have been developed [1012]. The AADC [10] is employed for digitizing ??(??). An ??-bit resolution AADC has 2??-1 quantization levels which are uniformly disposed according to ??(??) amplitude dynamics. The AADC has a finite bandwidth. Thus, to assure a proper signal capturing a band pass filter with pass band [??min;??max] is employedat the AADC input. Let ???in and ???(??) be the AADC and ??(??) amplitude dynamics, respectively. In order to avail the complete AADC resolution in the studied case, ???(??) is always adapted to match ???in. For an AADC, the maximum and the minimum sampling frequencies [7] are defined by (3) and (4), respectively. Where, ????max and ????min are the maximum and the minimum sampling frequencies of the AADC. ??max is the bandwidth and ??min is the fundamental frequency of ??(??):

????=????-1+??????,(1)??????=????-????-1,(2)????max=2·??max·?2???-1,(3)????min=2·??min·?2???-1.(4)
2.1.1. Enhanced Activity Selection Algorithm (EASA) and Window Selector

The relevant parts of the nonuniformly sampled signal obtained with the AADC are selected—corresponds to the variable length rectangular window—by the EASA. The EASA is defined as shown in Algorithm 1. ??0=1/??min is the fundamental period of ??(??). ??0 and ?????? detect parts of the nonuniformly sampled signal with activity. The condition on ?????? is chosen in order to satisfy the Nyquist criterion for ??min, when sampling ??(??) nonuniformly with the AADC [13]. ???? represents the number of nonuniform samples lie in the ??th selected window, which lie on the ??th active part of the nonuniformly sampled signal. Where, ?? and ?? both belong to the set of natural numbers ??*. ??ref represents the upper bound on ????. The choice of ??ref depends on ??(??) characteristics and on system parameters. The above described loop repeats for each selected window, which occurs during the observation length of ??(??). Every time before repeating the loop, ?? is incremented and ???? is initialized to zero.

The EASA displays interesting features with the LCSS, which are not available in the classical case. It selects only the active parts of the nonuniformly sampled signal, obtained with the AADC. Moreover, it correlates the length of the selected window with the signal local characteristics.

The window selector implements the condition given by expression (5). Jointly, the EASA and the window selector provide an efficient spectral leakage reduction in the case of transient signals [13]. Indeed, spectral leakage occurs due to the signal truncation problem. Usually an appropriate smoothening (cosine) window function is employed to reduce the signal truncation. For the proposed case, as long as the condition 5 is true, the leakage problem is resolved by avoiding the signal truncation. As no signal truncation occurs so no cosine window is required. In this case the window decision ????=1, which makes the switch state 1 (cf. (Figure 1)). Otherwise, an appropriate cosine window is employed to reduce the signal truncation problem. In this case ????=0, which makes the switch state 0. In expression 5, ????1 represents the 1st sampling instant of the ??th selected window and ????-1end represents the last sampling instant of the (??-1)th selected window.

For proper spectral representation, the condition given by expression (6) should be satisfied [13]. Where, ???? is the length in seconds of the ??th selected window. In order to satisfy this condition for the worst case, which occurs for ????max, ??ref is calculated for an appropriate reference window length ??ref. Where, ??ref satisfies the condition ??ref=??0. The process is given by (7) as follows: if(????=??ref?and??????=????1-????-1end?>??02??),(5)??=??0??,(6)ref=??ref·????max.(7)??ref??0 The lower and the upper bounds on ??ref are posed, respectively, by ??th and the system resources (the maximum sample frame which the system can process at once). For ??0 (cf. (7)), the condition 6 holds for all selected windows except for the case when the actual length of the ??(??) activity is less than ??????.

2.1.2. Adaptive Sampling Rate

The AADC sampling frequency is correlated to ??th local variations [7, 13]. Let ?????? represent the AADC sampling frequency for the ??max?? selected window. ??min?? can be calculated by using (8). Where, ??th and ?????? are the final and the initial times of the ????max selected window. The upper and the lower bounds on ????min are posed by ????=??max??-??min??.??????=????????.(8) and ??ref, respectively:??Nyq=2·??maxThe selected data obtained with the EASA can be used directly for further nonuniform digital processing [8, 14]. However in the studied case, the selected data is resampled uniformly. It enables to take advantage of both nonuniform and uniform signal processing tools [7, 13]. Due to this resampling there will be an additional error. Nevertheless, prior to this transformation, one can take advantage of the inherent oversampling of the relevant signal parts in the system [7]. Hence, it adds to the accuracy of the post resampling process [11]. The nearest neighbour resampling interpolation (NNRI) is employed for data resampling. The reasons of inclination towards NNRI are discussed in [13, 15].

A reference sampling frequency ??ref is chosen, such as itremains greater than and closest to the ??????. Depending upon the values of ???????? and ??th, the resampling frequency ??????>??ref (cf. (Figure 1)) can be adapted for the ???????? selected window. For the case, ????????=??ref, ??th is chosen as: ??????=??ref,????????. It is done in order to resample the selected data, lie in the ????????=?????? selected window closer to the Nyquist frequency. It avoids the unnecessary interpolations during the data resampling process and so reduces the computational load of the proposed technique.

For the case, ??th is chosen as: ??Nyq. In this case, it appears that the data lie in the ??(??) selected window may be resampled at a frequency which is less than ??(??) and so it can cause aliasing. Since, the sampling rate of the AADC varies according to the slope of ???? [10]. A high-frequency signal part has a high slope and the AADC samples it at a higher rate and vice versa. Hence, a signal part with only low-frequency components can be sampled by the AADC at a subNyquist frequency of ??????. But still this signal part is locally oversampled in time with respect to its local bandwidth [7]. Hence, there is no danger of aliasing. This statement is further illustrated with the results summarized in Table 1.

2.1.3. Adaptive Resolution Analysis

The STFT of a sampled signal ?? is determined by computing the discrete Fourier transform (DFT) of an ???? samples segment centred on ??, which describes the spectral contents of ?? around the instant ??=??·????. Where ?? is defined as: ????. Here, ???? is the effective length in seconds of the window function ?? and ???? is the sampling frequency. The STFT can be expressed mathematically by (9). In Equation (9), ?? is the frequency index, which is normalized with respect to ????.

?? controls the STFT time and frequency resolution [2]. In the classical case, the input signal is sampled at a fixed sampling frequency ??, regardless of its local variations. Thus, a fixed ??? results into a fixed ???. In this case, the time resolution ????,??? and the frequency resolution ?? of the STFT can be defined by (10) and (11), respectively. Equation (11) shows that for a fixed ?? can be increased by increasing ??. But increasing ??? requires to increase ?? which will reduce ??? (cf. (10)). Thus, a larger ??? provides a better ??? but a poor ???, and vice versa. This conflict between ????????,???? and ?????? is the reason for the creation of the MRA techniques [35].

The proposed STFT is a smart alternative of the MRA techniques. It performs adaptive time-frequency resolution analysis, which is not attainable with the classical STFT. It is achieved by adapting the ??(??), and ?????? according to the local variations of ??th. ????? is the number of resampled data points that lie in the ????? selected window. Thus, the time resolution ??th and the frequency resolution ????????,???? of the proposed STFT can be specific for the ?????? selected window and are defined by (12) and (13), respectively. Because of this adaptive resolution, the proposed STFT will be named as the adaptive resolution STFT, (ARSTFT) throughout the following parts of this article. The adaptation of ????, and ???? also adds to the computational gain of the ARSTFT, compared to the classical one. It is achieved firstly by avoiding the unnecessary samples to process and secondly by avoiding the use of the cosine window function as far as the condition 5 is true. The ARSTFT is defined by (14). In (14), ??th and ???? are the central time and the frequency index of the ???????? selected window, respectively. ?? is normalized with respect to ??th. ?????? is the index of the resampled data points lie in the ???? selected window. The notation ??th represents that the window function length ????=??,????+??/2???=??-??/2?????·????-???·??-??·2??·??·??.(9)???=??.(10)???=??????.(11)?????=????.(12)?????=???????????????????.(13)??,?????=????+??????/2???=????-??????/2?????Resample??,??????·??????-?????·??-??·2??·????·??.(14) and shape (rectangle or cosine) can be adapted for the ??(??) selected window:??(??)????max????min??ref???in??ref=4096

3. Illustrative Example

In order to illustrate the ARSTFT an input signal ??????, shown on the left part of Figure 2 is employed. Its total duration is 30 seconds and it consists of three active parts. Each activity is a sinusoid of 0.9?v amplitude and of 50, 200, and 500?Hz frequency, respectively. The time length of each activity is 5, 0.5, and 1.6 seconds, respectively. ??(??) is band limited between 50 to 500?Hz and it is sampled by employing a 3-bit resolution AADC. Thus, ???? and ???????? become 7?kHz and 0.7?kHz, respectively (3), (4). ?????? = 1.25?kHz and ???????? = 1.8?v are chosen. The selected data obtained with the EASA is shown on the right part of Figure 2. By following the criteria given in Section 2, ???? is chosen, which leads to 5 selected windows. First, two selected windows correspond to the first two activities and the remaining corresponds to the third activity. The last three selected windows are not distinguishable on the right part of Figure 2, because they lie consecutively on the third activity. The parameters of each selected window are summarized in Table 1.

Table 1 exhibits the interesting features of the ARSTFT. ??(??) represents the sampling frequency adaptation by following the local variations of ????. It is achieved due to the smart features of the AADC and the EASA. ???????? shows that the relevant signal parts are locally oversampled in time with respect to their local bandwidths [7]. ?????? shows the adaptation of the resampling frequency for each selected window. It further adds to the computational gain of the ARSTFT, by avoiding the unnecessary interpolations during the resampling process. ????? shows how the adaptation of ????? avoids the processing of unnecessary samples during the spectral computation. ????? exhibits the EASA dynamic feature, which is to correlate the window function length with the local variations of ?????. Adaptation of ??(??), ??(??) and ???? leads to the adaptive time-frequency resolution, which is clear from the values of ??th and ?????? in Table 2.

Table 2 demonstrates that ARSTFT adapts its time-frequency resolution by following the local variations of ??th. It provides a good time but a poor frequency resolution for the high frequency parts of ????=1, and vice versa. It is the type of analysis, well suited for most of the real-life signals [3]. The spectrum of each selected window is computed and plotted with respect to ??ref=4096 on Figure 3. Figure 3 shows the fundamental and the periodic spectrum peaks of each selected window. In this case, the spectrum periodic frequency for the ????=0 selected window ????=??ref is equal to ??(??). It shows the adaptation of ??, which can be visualized from Figure 3.

The ARSTFT also adapts the window shape (rectangle or cosine) for the ?? selected window. The condition 5 remains true for the first two selected windows, which sets ??. As no signal truncation occurs so no cosine window is required in this case. On the other hand, the number of samples for the fourth activity is 11200. Therefore, ??(??) leads to the three selected windows for the time span of the fourth activity. The condition 5 becomes false in this case, which sets ???=3.3. As signal truncation occurs so suitable length cosine (Hanning) windows are employed to reduce this effect.

In the classical case, if ???=0.31 is chosen, in order to satisfy the Nyquist sampling criterion for ????. Then the whole signal will be sampled at 1.25?kHz, regardless of its local variations. It will produce unnecessary samples than required. Moreover, the windowing process is not able to select only the active parts of the sampled signal. In addition, ?? remains static and is not able to adapt with the signal local variations. Thus, it causes the system to process needless samples and so causes an increased computational activity than the proposed case. For classical case, fixed ?? = 4096 will produce nine fixed ?? = 3.3 second windows, for the total ???? time span of 30 seconds. It will lead to fix ???? seconds and 2·(??)2Hz for all nine windows (cf. (10) and (11) .

4. Computational Complexity

This section compares the computational complexity of the ARSTFT with the classical STFT. The complexity evaluation is made by considering the number of operations executed to perform the algorithm.

In the classical case, 2·(??)2 is fixed. In this case, a time invariant, fixed 4·(??)2, cosine window function is employed to window the sampled data. If ?? is the number of samples lie in the window then the windowing operation will perform ??1 multiplications between ?? and ??(??) (cf. (9)). The spectrum of the windowed data is obtained by computing its DFT. A complex term is involved in the DFT computation. The DFT complexity is calculated by taking the real and the imaginary parts separately. The DFT performs ?????? additions and ???????? multiplications, thus operations count becomes ?????? for ??(??) output frequencies. The combined computational complexity 2·???? of the STFT is given by (15). Where, ???? is the total number of windows occurs for the observation length of ??th.

For the proposed ARSTFT, ????????, ??????, and ????=0 are not fixed and are adapted according to the local variations of ??????. The EASA performs 4·(??????)2 comparisons and ??th increments for the ??2 selected window (cf. (Section 2)). The choice of ??=1,2,,?? and window shape requires three comparisons. The selected signal is resampled before computing its DFT. The NNRI is employed for the resampling purpose. The NNRI only requires a comparison operation for each resampled observation. Therefore, the resampler performs ?? comparisons. If ????=0, then a cosine window function is applied on the resampled data, which performs ????=1 multiplications (cf. (Figure 1)). The DFT performs ??1?????=??·??+4·2???.(15)2=??·3+?????=13·????+??·??????+???????+4·???????2.(16) operations for the ??1/??2 selected window. The combine computational complexity ??(??) of the ARSTFT is given by (16). Where ?????? represents the index of the selected window. ???????? is a multiplying factor, its value is 1 for ?????? and 0 for ??(??). The computational gain of the ARSTFT over the classical one is calculated by employing the results of the illustrative example. The results are summarized in Table 3. ??????????

Table 3 shows the computational gain of the ARSTFT over the STFT for each ??ref activity. It shows that the ARSTFT leads to a significant reduction of the total number of operations as compared to the classical one. This reduction in operations is achieved by adapting ??ref, ????????, and ?????? according to the local variations of ??th.

5. Conclusions

A new tool for the adaptive resolution time-frequency analysis is proposed. The ARSTFT is especially well suited for the low activity sporadic signals like electrocardiogram, phonocardiogram, seismic signals, and so forth. It is shown that ?????? and ???????? adapt by following the ???? local variations. Criteria to choose the appropriate ?????? and ?????? are developed. A complete methodology of adapting ??(??) and ??????????(??????=??0/2and????=??ref) for the ????=????+1; selected window has been demonstrated.

The ARSTFT outperforms the STFT. The advantages of the ARSTFT over the STFT are the adaptive time-frequency resolution and the computational gain. These smart features of the ARSTFT are achieved due to the joint benefits of the AADC, the EASA, and the resampling as they enable to adapt , , , , and by exploiting the local variations of . The employment of fast algorithms in place of the DFT for the spectrum computation is in progress, it will further add up to the computational efficiency of the ARSTFT. Moreover, the performance comparison of the ARSTFT with other MRA techniques, in terms of computational complexity and quality, opens the way to new research prospective.

end