Research Letter | Open Access
Saeed Mian Qaisar, Laurent Fesquet, Marc Renaudin, "An Adaptive Resolution Computationally Efficient Short-Time Fourier Transform", Journal of Electrical and Computer Engineering, vol. 2008, Article ID 932068, 5 pages, 2008. https://doi.org/10.1155/2008/932068
An Adaptive Resolution Computationally Efficient Short-Time Fourier Transform
The short-time Fourier transform (STFT) is a classical tool, used for characterizing the time varying signals. The limitation of the STFT is its fixed time-frequency resolution. Thus, an enhanced version of the STFT, which is based on the cross-level sampling, is devised. It can adapt the sampling frequency and the window function length by following the input signal local characteristics. Therefore, it provides an adaptive resolution time-frequency representation of the input signal. The computational complexity of the proposed STFT is deduced and compared to the classical one. The results show a significant gain of the computational efficiency and hence of the processing power.
Most of the real-life signals like speech, Doppler, seismic, and biomedical signals are time varying in nature. The spectral contents of these signals vary with time, which is a direct consequence of the signal generation process . The STFT is a classical tool for characterizing such signals . The limitation with the STFT is that it provides a fixed resolution time-frequency representation of the input signal. This fixed resolution is the reason for the creation of the multiresolution analysis (MRA) techniques [3–5], which provide a good frequency but a poor time resolution for the low-frequency events and a good time but apoor frequency resolution for the high-frequency events. This type of analysis is well suited for most of the real-life signals .
In this article, the fixed resolution dilemma is resolved to a certain extent by revising the STFT. The motivation behind the proposed STFT is to achieve a smart time-frequency representation of the time varying signals. The idea is to adapt the time-frequency resolution along with the computational load by following the input signal local characteristics. An efficient solution is proposed by smartly combining the features of both uniform and nonuniform signal processing tools.
2. Proposed Adaptive Resolution STFT
The block diagram of the proposed STFT is shown in Figure 1. The description of different blocks is given below.
2.1. Asynchronous Analog to Digital Converter (AADC)
According to , the sampling instants of a nonuniformly sampled signal obtained with the level crossing sampling scheme (LCSS) are defined by (1). Where is the current sampling instant, is the previous one, and is the time delay between the current and the previous sampling instants (cf. (2)).
The LCSS drastically reduces the activity of the post processing chain, because it only captures the relevant information [7–9]. In this context, analog to digital converters based on the LCSS have been developed [10–12]. The AADC  is employed for digitizing . An -bit resolution AADC has quantization levels which are uniformly disposed according to amplitude dynamics. The AADC has a finite bandwidth. Thus, to assure a proper signal capturing a band pass filter with pass band is employedat the AADC input. Let and be the AADC and amplitude dynamics, respectively. In order to avail the complete AADC resolution in the studied case, is always adapted to match . For an AADC, the maximum and the minimum sampling frequencies  are defined by (3) and (4), respectively. Where, and are the maximum and the minimum sampling frequencies of the AADC. is the bandwidth and is the fundamental frequency of :
2.1.1. Enhanced Activity Selection Algorithm (EASA) and Window Selector
The relevant parts of the nonuniformly sampled signal obtained with the AADC are selected—corresponds to the variable length rectangular window—by the EASA. The EASA is defined as shown in Algorithm 1. is the fundamental period of . and detect parts of the nonuniformly sampled signal with activity. The condition on is chosen in order to satisfy the Nyquist criterion for , when sampling nonuniformly with the AADC . represents the number of nonuniform samples lie in the selected window, which lie on the active part of the nonuniformly sampled signal. Where, and both belong to the set of natural numbers . represents the upper bound on . The choice of depends on characteristics and on system parameters. The above described loop repeats for each selected window, which occurs during the observation length of . Every time before repeating the loop, is incremented and is initialized to zero.
The EASA displays interesting features with the LCSS, which are not available in the classical case. It selects only the active parts of the nonuniformly sampled signal, obtained with the AADC. Moreover, it correlates the length of the selected window with the signal local characteristics.
The window selector implements the condition given by expression (5). Jointly, the EASA and the window selector provide an efficient spectral leakage reduction in the case of transient signals . Indeed, spectral leakage occurs due to the signal truncation problem. Usually an appropriate smoothening (cosine) window function is employed to reduce the signal truncation. For the proposed case, as long as the condition 5 is true, the leakage problem is resolved by avoiding the signal truncation. As no signal truncation occurs so no cosine window is required. In this case the window decision , which makes the switch state 1 (cf. (Figure 1)). Otherwise, an appropriate cosine window is employed to reduce the signal truncation problem. In this case , which makes the switch state 0. In expression 5, represents the 1st sampling instant of the selected window and represents the last sampling instant of the selected window.
For proper spectral representation, the condition given by expression (6) should be satisfied . Where, is the length in seconds of the selected window. In order to satisfy this condition for the worst case, which occurs for , is calculated for an appropriate reference window length . Where, satisfies the condition . The process is given by (7) as follows: The lower and the upper bounds on are posed, respectively, by and the system resources (the maximum sample frame which the system can process at once). For (cf. (7)), the condition 6 holds for all selected windows except for the case when the actual length of the activity is less than .
2.1.2. Adaptive Sampling Rate
The AADC sampling frequency is correlated to local variations [7, 13]. Let represent the AADC sampling frequency for the selected window. can be calculated by using (8). Where, and are the final and the initial times of the selected window. The upper and the lower bounds on are posed by and , respectively:The selected data obtained with the EASA can be used directly for further nonuniform digital processing [8, 14]. However in the studied case, the selected data is resampled uniformly. It enables to take advantage of both nonuniform and uniform signal processing tools [7, 13]. Due to this resampling there will be an additional error. Nevertheless, prior to this transformation, one can take advantage of the inherent oversampling of the relevant signal parts in the system . Hence, it adds to the accuracy of the post resampling process . The nearest neighbour resampling interpolation (NNRI) is employed for data resampling. The reasons of inclination towards NNRI are discussed in [13, 15].
A reference sampling frequency is chosen, such as itremains greater than and closest to the . Depending upon the values of and , the resampling frequency (cf. (Figure 1)) can be adapted for the selected window. For the case, , is chosen as: . It is done in order to resample the selected data, lie in the selected window closer to the Nyquist frequency. It avoids the unnecessary interpolations during the data resampling process and so reduces the computational load of the proposed technique.
For the case, is chosen as: . In this case, it appears that the data lie in the selected window may be resampled at a frequency which is less than and so it can cause aliasing. Since, the sampling rate of the AADC varies according to the slope of . A high-frequency signal part has a high slope and the AADC samples it at a higher rate and vice versa. Hence, a signal part with only low-frequency components can be sampled by the AADC at a subNyquist frequency of . But still this signal part is locally oversampled in time with respect to its local bandwidth . Hence, there is no danger of aliasing. This statement is further illustrated with the results summarized in Table 1.
2.1.3. Adaptive Resolution Analysis
The STFT of a sampled signal is determined by computing the discrete Fourier transform (DFT) of an samples segment centred on , which describes the spectral contents of around the instant . Where is defined as: . Here, is the effective length in seconds of the window function and is the sampling frequency. The STFT can be expressed mathematically by (9). In Equation (9), is the frequency index, which is normalized with respect to .
controls the STFT time and frequency resolution . In the classical case, the input signal is sampled at a fixed sampling frequency , regardless of its local variations. Thus, a fixed results into a fixed . In this case, the time resolution and the frequency resolution of the STFT can be defined by (10) and (11), respectively. Equation (11) shows that for a fixed can be increased by increasing . But increasing requires to increase which will reduce (cf. (10)). Thus, a larger provides a better but a poor , and vice versa. This conflict between and is the reason for the creation of the MRA techniques [3–5].
The proposed STFT is a smart alternative of the MRA techniques. It performs adaptive time-frequency resolution analysis, which is not attainable with the classical STFT. It is achieved by adapting the , and according to the local variations of . is the number of resampled data points that lie in the selected window. Thus, the time resolution and the frequency resolution of the proposed STFT can be specific for the selected window and are defined by (12) and (13), respectively. Because of this adaptive resolution, the proposed STFT will be named as the adaptive resolution STFT, (ARSTFT) throughout the following parts of this article. The adaptation of , and also adds to the computational gain of the ARSTFT, compared to the classical one. It is achieved firstly by avoiding the unnecessary samples to process and secondly by avoiding the use of the cosine window function as far as the condition 5 is true. The ARSTFT is defined by (14). In (14), and are the central time and the frequency index of the selected window, respectively. is normalized with respect to . is the index of the resampled data points lie in the selected window. The notation represents that the window function length and shape (rectangle or cosine) can be adapted for the selected window:
3. Illustrative Example
In order to illustrate the ARSTFT an input signal , shown on the left part of Figure 2 is employed. Its total duration is 30 seconds and it consists of three active parts. Each activity is a sinusoid of 0.9?v amplitude and of 50, 200, and 500?Hz frequency, respectively. The time length of each activity is 5, 0.5, and 1.6 seconds, respectively. is band limited between 50 to 500?Hz and it is sampled by employing a 3-bit resolution AADC. Thus, and become 7?kHz and 0.7?kHz, respectively (3), (4). = 1.25?kHz and = 1.8?v are chosen. The selected data obtained with the EASA is shown on the right part of Figure 2. By following the criteria given in Section 2, is chosen, which leads to 5 selected windows. First, two selected windows correspond to the first two activities and the remaining corresponds to the third activity. The last three selected windows are not distinguishable on the right part of Figure 2, because they lie consecutively on the third activity. The parameters of each selected window are summarized in Table 1.
Table 1 exhibits the interesting features of the ARSTFT. represents the sampling frequency adaptation by following the local variations of . It is achieved due to the smart features of the AADC and the EASA. shows that the relevant signal parts are locally oversampled in time with respect to their local bandwidths . shows the adaptation of the resampling frequency for each selected window. It further adds to the computational gain of the ARSTFT, by avoiding the unnecessary interpolations during the resampling process. shows how the adaptation of avoids the processing of unnecessary samples during the spectral computation. exhibits the EASA dynamic feature, which is to correlate the window function length with the local variations of . Adaptation of , and leads to the adaptive time-frequency resolution, which is clear from the values of and in Table 2.
Table 2 demonstrates that ARSTFT adapts its time-frequency resolution by following the local variations of . It provides a good time but a poor frequency resolution for the high frequency parts of , and vice versa. It is the type of analysis, well suited for most of the real-life signals . The spectrum of each selected window is computed and plotted with respect to on Figure 3. Figure 3 shows the fundamental and the periodic spectrum peaks of each selected window. In this case, the spectrum periodic frequency for the selected window is equal to . It shows the adaptation of , which can be visualized from Figure 3.
The ARSTFT also adapts the window shape (rectangle or cosine) for the selected window. The condition 5 remains true for the first two selected windows, which sets . As no signal truncation occurs so no cosine window is required in this case. On the other hand, the number of samples for the fourth activity is 11200. Therefore, leads to the three selected windows for the time span of the fourth activity. The condition 5 becomes false in this case, which sets . As signal truncation occurs so suitable length cosine (Hanning) windows are employed to reduce this effect.
In the classical case, if is chosen, in order to satisfy the Nyquist sampling criterion for . Then the whole signal will be sampled at 1.25?kHz, regardless of its local variations. It will produce unnecessary samples than required. Moreover, the windowing process is not able to select only the active parts of the sampled signal. In addition, remains static and is not able to adapt with the signal local variations. Thus, it causes the system to process needless samples and so causes an increased computational activity than the proposed case. For classical case, fixed = 4096 will produce nine fixed = 3.3 second windows, for the total time span of 30 seconds. It will lead to fix seconds and Hz for all nine windows (cf. (10) and (11) .
4. Computational Complexity
This section compares the computational complexity of the ARSTFT with the classical STFT. The complexity evaluation is made by considering the number of operations executed to perform the algorithm.
In the classical case, is fixed. In this case, a time invariant, fixed , cosine window function is employed to window the sampled data. If is the number of samples lie in the window then the windowing operation will perform multiplications between and (cf. (9)). The spectrum of the windowed data is obtained by computing its DFT. A complex term is involved in the DFT computation. The DFT complexity is calculated by taking the real and the imaginary parts separately. The DFT performs additions and multiplications, thus operations count becomes for output frequencies. The combined computational complexity of the STFT is given by (15). Where, is the total number of windows occurs for the observation length of .
For the proposed ARSTFT, , , and are not fixed and are adapted according to the local variations of . The EASA performs comparisons and increments for the selected window (cf. (Section 2)). The choice of and window shape requires three comparisons. The selected signal is resampled before computing its DFT. The NNRI is employed for the resampling purpose. The NNRI only requires a comparison operation for each resampled observation. Therefore, the resampler performs comparisons. If , then a cosine window function is applied on the resampled data, which performs multiplications (cf. (Figure 1)). The DFT performs operations for the selected window. The combine computational complexity of the ARSTFT is given by (16). Where represents the index of the selected window. is a multiplying factor, its value is 1 for and 0 for . The computational gain of the ARSTFT over the classical one is calculated by employing the results of the illustrative example. The results are summarized in Table 3.
Table 3 shows the computational gain of the ARSTFT over the STFT for each activity. It shows that the ARSTFT leads to a significant reduction of the total number of operations as compared to the classical one. This reduction in operations is achieved by adapting , , and according to the local variations of .
A new tool for the adaptive resolution time-frequency analysis is proposed. The ARSTFT is especially well suited for the low activity sporadic signals like electrocardiogram, phonocardiogram, seismic signals, and so forth. It is shown that and adapt by following the local variations. Criteria to choose the appropriate and are developed. A complete methodology of adapting and for the selected window has been demonstrated.
The ARSTFT outperforms the STFT. The advantages of the ARSTFT over the STFT are the adaptive time-frequency resolution and the computational gain. These smart features of the ARSTFT are achieved due to the joint benefits of the AADC, the EASA, and the resampling as they enable to adapt , , , , and by exploiting the local variations of . The employment of fast algorithms in place of the DFT for the spectrum computation is in progress, it will further add up to the computational efficiency of the ARSTFT. Moreover, the performance comparison of the ARSTFT with other MRA techniques, in terms of computational complexity and quality, opens the way to new research prospective.
- S. C. Sekhar and T. V. Sreenivas, “Adaptive window zero-crossing-based instantaneous frequency estimation,” EURASIP Journal on Applied Signal Processing, vol. 2004, no. 12, pp. 1791–1806, 2004.
- D. Gabor, “Theory of communication,” Journal of the IEE, vol. 93, no. 3, pp. 429–457, 1946.
- R. Polikar, “The engineer's ultimate guide to wavelet analysis,” Rowan University, College of Engineering, retrieved in 2006.
- M. Vetterli and C. Herley, “Wavelets and filter banks: theory and design,” IEEE Transactions on Signal Processing, vol. 40, no. 9, pp. 2207–2232, 1992.
- H. K. Kwok and D. L. Jones, “Improved instantaneous frequency estimation using an adaptive short-time Fourier,” IEEE Transactions on Signal Processing, vol. 48, no. 10, pp. 2964–2972, 2000.
- J. Mark and T. Todd, “A nonuniform sampling approach to data compression,” IEEE Transactions on Communications, vol. 29, no. 1, pp. 24–32, 1981.
- S. M. Qaisar, L. Fesquet, and M. Renaudin, “Computationally efficient adaptive rate sampling and filtering,” in Proceedings of the 15th European Signal Processing Conference (EUSIPCO '07), pp. 2139–2143, Poznan, Poland, September 2007.
- M. Gretains, “Time-frequency representation based chirp like signal analysis using multiple level crossings,” in Proceedings of the 15th European Signal Processing Conference (EUSIPCO '07), pp. 2154–2158, Poznan, Poland, September 2007.
- K. M. Guan and A. C. Singer, “Opportunistic sampling by level-crossing,” in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '07), vol. 3, pp. 1513–1516, Honolulu, Hawaii, USA, April 2007.
- E. Allier, G. Sicard, L. Fesquet, and M. Renaudin, “A new class of asynchronous A/D converters based on time quantization,” in Proceedings of the 9th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC '03), pp. 196–205, Vancouver, BC, Canada, May 2003.
- N. Sayiner, H. V. Sorensen, and T. R. Viswanathan, “A level-crossing sampling scheme for A/D conversion,” IEEE Transactions on Circuits and Systems II, vol. 43, no. 4, pp. 335–339, 1996.
- F. Akopyan, R. Manohar, and A. B. Apsel, “A level-crossing flash asynchronous analog-to-digital converter,” in Proceedings of the 12th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC '06), pp. 12–22, Grenoble, France, March 2006.
- S. M. Qaisar, L. Fesquet, and M. Renaudin, “Spectral analysis of a signal driven sampling scheme,” in Proceedings of the 14th European Signal Processing Conference (EUSIPCO '06), Florence, Italy, September 2006.
- F. Aeschlimann, E. Allier, L. Fesquet, and M. Renaudin, “Asynchronous FIR filters: towards a new digital processing chain,” in Proceedings of the 10th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC '04), pp. 198–206, Crete, Greece, April 2004.
- S. de Waele and P. M. T. Broersen, “A time domain error measure for resampled irregular data,” in Proceedings of the 16th IEEE Instrumentation and Measurement Technology Conference (IMTC '99), vol. 2, pp. 1172–1177, Venice, Italy, May 1999.
Copyright © 2008 Saeed Mian Qaisar et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.