The short-time Fourier transform (STFT) is a classical tool, used for characterizing the time varying signals. The limitation of the STFT is its fixed time-frequency resolution. Thus, an enhanced version of the STFT, which is based on the cross-level sampling, is devised. It can adapt the sampling frequency and the window function length by following the input signal local characteristics. Therefore, it provides an adaptive resolution time-frequency representation of the input signal. The computational complexity of the proposed STFT is deduced and compared to the classical one. The results show a significant gain of the computational efficiency and hence of the processing power.
1. Introduction
Most of the real-life signals like
speech, Doppler, seismic, and biomedical signals are time
varying in nature. The spectral contents of these signals vary with time, which
is a direct consequence of the signal generation process [1]. The STFT is a
classical tool for characterizing such signals [2]. The limitation with the
STFT is that it provides a fixed resolution time-frequency representation of
the input signal. This fixed resolution is the reason for the creation of the multiresolution
analysis (MRA) techniques [3–5], which provide a good frequency but a poor
time resolution for the low-frequency events and a good time but apoor
frequency resolution for the high-frequency
events. This type of analysis is
well suited for most of the real-life signals [3].
In this article, the fixed
resolution dilemma is resolved to a certain extent by revising the STFT. The
motivation behind the proposed STFT is to achieve a smart time-frequency
representation of the time varying signals. The idea is to adapt the
time-frequency resolution along with the computational load by following the
input signal local characteristics. An efficient solution is proposed by
smartly combining the features of both uniform and nonuniform signal processing
tools.
2. Proposed Adaptive Resolution STFT
The block diagram of the proposed
STFT is shown in Figure 1. The description of different blocks is given below.
Figure 1: Block diagram of the
proposed STFT. “—” represents the signal flow, “… …”
represents the control flow, and “- - - - -” represents the parameters flow, at
system different stages.
2.1. Asynchronous Analog to Digital Converter (AADC)
According to [6], the sampling instants of a nonuniformly sampled
signal obtained with the level crossing sampling scheme (LCSS) are defined by (1).
Where is the current sampling instant, is the previous one, and is the time delay between
the current and the previous sampling instants (cf. (2)).
The LCSS drastically reduces the
activity of the post processing chain, because it only captures the relevant
information [7–9]. In this context, analog to digital converters based
on the LCSS have been developed [10–12]. The AADC [10] is employed for
digitizing . An -bit resolution AADC has quantization levels which are uniformly disposed according to amplitude
dynamics. The AADC has a finite
bandwidth. Thus, to assure a proper signal capturing a band pass filter with pass band
is employedat the AADC input. Let and be the AADC and amplitude dynamics, respectively.
In order to avail the complete AADC resolution in the studied case, is always adapted to match . For an AADC, the maximum and the minimum sampling frequencies [7] are defined by (3) and (4),
respectively. Where, and are the maximum and the minimum sampling frequencies of the AADC. is the bandwidth and is the fundamental frequency of :
2.1.1. Enhanced Activity Selection Algorithm (EASA) and Window Selector
The
relevant parts of the nonuniformly sampled signal obtained
with the AADC are selected—corresponds to the
variable length rectangular window—by the EASA. The EASA is defined as shown in Algorithm 1.
is the fundamental period of . and detect parts of the nonuniformly sampled signal with activity. The condition on is chosen in order to satisfy the Nyquist criterion for ,
when sampling nonuniformly with the AADC [13]. represents the number of nonuniform samples lie in the selected window, which lie on the active part of the nonuniformly
sampled signal. Where, and both belong to the set of natural
numbers . represents the
upper bound on . The choice of depends on characteristics and on system parameters. The above
described loop repeats for each selected window, which occurs during the
observation length of . Every time before repeating the loop, is incremented and is initialized to zero.
The
EASA displays interesting features with the LCSS, which
are not available in the classical case. It selects only the active parts of
the nonuniformly sampled signal, obtained with the AADC. Moreover, it
correlates the length of the selected window with the signal local
characteristics.
The
window selector implements the condition given by expression (5). Jointly, the EASA and the window selector provide an efficient
spectral leakage reduction in the case of transient signals [13]. Indeed,
spectral leakage occurs due to the signal truncation problem. Usually an appropriate smoothening (cosine)
window function is employed to reduce the signal truncation. For the proposed
case, as long as the condition 5 is true, the leakage problem is resolved by
avoiding the signal truncation. As no signal truncation occurs so no cosine
window is required. In this case the window decision ,
which makes the switch state 1 (cf. (Figure 1)). Otherwise, an appropriate
cosine window is employed to reduce the signal truncation problem. In this case , which makes the switch state 0. In
expression 5, represents the 1st sampling instant of the selected window and represents the last sampling instant of the selected window.
For proper spectral representation,
the condition given by expression (6) should be satisfied [13]. Where, is the length in seconds of the selected window. In order to satisfy
this condition for the worst case, which occurs for , is calculated for an appropriate reference window length .
Where, satisfies the condition . The process
is given by (7) as follows:
The lower and the upper bounds on are posed, respectively, by and the system resources (the
maximum sample frame which the system can process at once). For (cf. (7)), the
condition 6 holds for all selected windows except for the case when the actual
length of the activity is less than .
2.1.2. Adaptive Sampling Rate
The AADC sampling frequency is correlated to local
variations [7, 13]. Let represent the AADC sampling
frequency for the selected window. can be calculated by using (8). Where, and are the final and the initial times of the
selected window. The upper and the lower bounds on are
posed by and , respectively:The selected data obtained with the
EASA can be used directly for further nonuniform digital processing [8, 14].
However in the studied case, the selected data is resampled uniformly. It
enables to take advantage of both nonuniform and uniform signal processing
tools [7, 13]. Due to this resampling there will be an additional error.
Nevertheless, prior to this transformation, one can take advantage of the
inherent oversampling
of the relevant signal parts in the system [7]. Hence, it adds to the accuracy
of the post resampling process [11]. The nearest neighbour resampling
interpolation (NNRI) is employed for data resampling. The reasons of
inclination towards NNRI are discussed in [13, 15].
A reference sampling frequency is chosen, such as itremains greater than and closest to the . Depending upon the values of and , the resampling frequency (cf. (Figure 1)) can be adapted for the selected window. For the
case, , is chosen as: . It is done
in order to resample the selected data, lie in the selected
window closer to the Nyquist frequency. It avoids the unnecessary
interpolations during the data resampling process and so reduces the
computational load of the proposed technique.
For the case, is chosen as: . In this case, it appears that the data lie in the selected window may be resampled at a frequency which is less than and so it can cause aliasing. Since, the sampling rate of the AADC varies
according to the slope of [10]. A high-frequency signal part has a
high slope and the AADC samples it at a higher rate and vice versa. Hence, a
signal part with only low-frequency components can be sampled by the AADC at a subNyquist
frequency of . But still this signal part is locally oversampled in
time with respect to its local bandwidth [7]. Hence, there is no danger of
aliasing. This statement is further illustrated with the results summarized in
Table 1.
Table 1: Summary of parameters of
the selected windows.
2.1.3. Adaptive Resolution Analysis
The STFT of a sampled signal is determined by computing
the discrete Fourier transform (DFT) of an samples segment centred on , which
describes the spectral contents of around the instant . Where is defined as: . Here, is
the effective length in seconds of the window function and is
the sampling frequency. The STFT can be expressed mathematically by (9). In Equation (9), is the frequency index, which is normalized with respect to .
controls the STFT time and frequency resolution [2]. In the classical case, the
input signal is sampled at a fixed sampling frequency , regardless of
its local variations. Thus, a fixed results into a fixed . In
this case, the time resolution and the
frequency resolution of the STFT
can be defined by (10) and (11), respectively. Equation (11) shows that for a
fixed can be increased by increasing . But increasing requires to increase which will reduce (cf. (10)). Thus, a larger provides a better but a poor , and vice
versa. This conflict between and is the
reason for the creation of the MRA techniques [3–5].
The proposed STFT is a smart
alternative of the MRA techniques. It performs adaptive time-frequency
resolution analysis, which is not attainable with the classical STFT. It is
achieved by adapting the , and according to the local variations of . is the number of resampled data points that lie in the selected window. Thus, the time
resolution and the frequency resolution of
the proposed STFT can be specific for the selected window and are defined by (12) and (13), respectively. Because of this
adaptive resolution, the proposed STFT will be named as the adaptive resolution
STFT, (ARSTFT) throughout the following parts of this article. The adaptation
of ,
and also adds to the
computational gain of the ARSTFT, compared to the classical one. It is achieved
firstly by avoiding the unnecessary samples to process and secondly by avoiding
the use of the cosine window function as far as the condition 5 is true. The
ARSTFT is defined by (14). In (14), and are the central
time and the frequency index of the selected window, respectively. is normalized with respect to . is the index of the resampled data
points lie in the selected
window. The notation represents that the window function length and shape (rectangle or cosine) can be adapted for the selected window:
3. Illustrative Example
In order to illustrate the ARSTFT an
input signal , shown on the left part of Figure 2 is
employed. Its total duration is 30 seconds and it consists of three active
parts. Each activity is a sinusoid of 0.9 v amplitude and of 50, 200, and 500 Hz frequency, respectively. The time length of each activity is 5, 0.5, and 1.6
seconds, respectively. is band
limited between 50 to 500 Hz and it
is sampled by employing a 3-bit resolution AADC. Thus, and become 7 kHz and 0.7 kHz, respectively (3), (4). = 1.25 kHz and = 1.8 v are chosen. The selected data obtained with the EASA is shown on
the right part of Figure 2. By following the criteria given in Section 2, is chosen, which
leads to 5 selected windows. First, two selected windows correspond to the
first two activities and the remaining corresponds to the third activity. The
last three selected windows are not distinguishable on the right part of Figure 2, because they lie consecutively on the third activity. The parameters of each
selected window are summarized in Table 1.
Figure 2: Input signal (left) and
selected signal (right).
Table 1 exhibits the interesting
features of the ARSTFT. represents
the sampling frequency adaptation by following the local variations of . It is achieved due to the smart
features of the AADC and the EASA. shows that the relevant signal parts are locally oversampled in time with
respect to their local bandwidths [7]. shows the adaptation of the resampling frequency for each selected window. It
further adds to the computational gain of the ARSTFT, by avoiding the
unnecessary interpolations during the resampling process. shows how the adaptation of avoids the processing of unnecessary samples during
the spectral computation. exhibits the EASA dynamic feature, which is to correlate the window
function length with the local variations of . Adaptation of , and leads to the adaptive time-frequency resolution, which is clear from the values
of and in Table 2.
Table 2: Time and frequency
resolution of the selected windows.
Table 2 demonstrates that ARSTFT
adapts its time-frequency resolution by following the local variations of . It provides a good time but a poor
frequency resolution for the high frequency parts of , and vice
versa. It is the type of analysis, well suited for most of the real-life
signals [3]. The spectrum of each selected window is computed and plotted with
respect to on Figure 3. Figure 3 shows the fundamental and the
periodic spectrum peaks of each selected window. In this case, the spectrum
periodic frequency for the selected window is equal to . It shows the
adaptation of , which can
be visualized from Figure 3.
Figure 3: The ARSTFT of the selected
windows.
The ARSTFT also adapts the window
shape (rectangle or cosine) for the selected window. The condition 5 remains true for the first two selected
windows, which sets . As no signal truncation occurs so no
cosine window is required in this case. On the other hand, the number of
samples for the fourth activity is 11200. Therefore, leads to the three selected windows for the
time span of the fourth activity. The condition 5 becomes false in this case,
which sets . As signal truncation occurs so
suitable length cosine (Hanning) windows are employed to reduce this effect.
In the classical case, if is chosen, in order to satisfy the Nyquist sampling criterion for .
Then the whole signal will be sampled at 1.25 kHz, regardless of its
local variations. It will produce unnecessary samples than required. Moreover,
the windowing process is not able to select only the active parts of the
sampled signal. In addition, remains static and is not able to adapt
with the signal local variations. Thus, it causes the system to process
needless samples and so causes an increased computational activity than the
proposed case. For classical case, fixed = 4096 will produce nine fixed = 3.3
second windows, for the total time span of 30 seconds. It will lead
to fix seconds
and Hz for
all nine windows (cf. (10) and (11) .
4. Computational Complexity
This section compares the
computational complexity of the ARSTFT with the classical STFT. The complexity
evaluation is made by considering the number of operations executed to perform
the algorithm.
In the classical case, is fixed. In this case, a time
invariant, fixed , cosine window
function is employed to window the sampled data. If is the number of samples lie in the window then the windowing
operation will perform multiplications between and (cf. (9)). The
spectrum of the windowed data is obtained by computing its DFT. A complex term
is involved in the DFT computation. The DFT complexity is calculated by taking
the real and the imaginary parts separately. The DFT performs additions and multiplications, thus
operations count becomes for output frequencies. The combined computational
complexity of the STFT
is given by (15). Where, is the
total number of windows occurs for the observation length of .
For the
proposed ARSTFT, , ,
and are not fixed and are adapted according to the
local variations of . The EASA performs comparisons and increments for the selected
window (cf. (Section 2)). The choice of and window shape requires three comparisons. The selected signal is
resampled before computing its DFT. The
NNRI is employed for the resampling purpose. The NNRI only requires a
comparison operation for each resampled observation. Therefore, the resampler
performs comparisons.
If , then a cosine
window function is applied on the resampled data, which performs multiplications (cf. (Figure 1)). The DFT
performs operations for the selected window. The combine computational complexity of the ARSTFT is given by (16). Where represents the index of
the selected window. is a
multiplying factor, its value is 1 for and 0 for . The computational
gain of the ARSTFT over the classical one is calculated by employing the
results of the illustrative example. The results are summarized in Table 3.
Table 3: Summary of the
computational gain.
Table 3 shows the computational gain
of the ARSTFT over the STFT for each activity. It shows that the
ARSTFT leads to a significant reduction of the total number of operations as
compared to the classical one. This reduction in operations is achieved by
adapting , , and according to the local variations of .
5. Conclusions
A new tool for the adaptive
resolution time-frequency analysis is proposed. The ARSTFT is especially well
suited for the low activity sporadic signals like electrocardiogram, phonocardiogram,
seismic signals, and so forth. It is shown that and adapt by following the local
variations. Criteria to choose the appropriate and are developed. A complete methodology of adapting and for the selected window has been
demonstrated.
The ARSTFT outperforms the STFT. The
advantages of the ARSTFT over the STFT are the adaptive time-frequency
resolution and the computational gain. These smart features of the ARSTFT are
achieved due to the joint benefits of the AADC, the EASA, and the resampling as
they enable to adapt , , , , and by exploiting the local variations of .
The employment of fast algorithms in place of the DFT for the spectrum
computation is in progress, it will further add up to the computational efficiency
of the ARSTFT. Moreover, the performance comparison of the ARSTFT with other
MRA techniques, in terms of computational complexity and quality, opens the way
to new research prospective.
Algorithm 1: Enhanced Activity Selection Algorithm (EASA).