Research Article

An Anomaly Detection Algorithm Selection Service for IoT Stream Data Based on Tsfresh Tool and Genetic Algorithm

Table 2

The name, design principle, and computing method of some features in Tsfresh.

NameDesign principleComputing method [16, 30]

MeanThe baseline of time series
Standard deviationThe standard deviation of time series
Coefficient of variationThe reflection of the degree of data dispersion
Local fluctuation 1The difference of the smooth curve and original curve
Local fluctuation 2Local fluctuation with a dynamic step
Smooth factorThe ratio of the whole number to the number of turning points
Symmetrical valueThe symmetry of the curve
Fluctuation ratioWhole fluctuation power
SkewnessThe estimation of the degree of statistical data distribution and the direction of skew is the digital characteristics of the asymmetric degree of statistical data distribution
approximate_entropyApproximate entropy is used to measure the periodicity, unpredictability, and volatility of a time seriesRefer to [16]
Autoregressive coefficientMeasure the cyclical nature of data
KurtosisThe feature number indicating the peak value of the probability density distribution curve at the average value
absolute_sum_of_changesAbsolute sum of first-order difference
Linear_trendCalculation of a linear least squares regression for the values of the time series to the sequence from 0 to the length of the time series −1Refer to [16]
fft_aggregatedReturns the variance, mean, kurtosis, skewness, and absolute Fourier transform spectrumRefer to [16]