Complexity

Research Article

Effect Improved for High-Dimensional and Unbalanced Data Anomaly Detection Model Based on KNN-SMOTE-LSTM

The SMOTE algorithm steps.

	Input:T for training sample, N for oversampling rate, K for k-nearest neighbor parameter, and n for the number of the small-sized sample ()
	Output:S for new training sample
	Step 1: calculate the k-nearest neighbors of of minority samples () (Euclidean distance is adopted in this paper)
	Step 2: randomly select a sample from the k-nearest neighbors.
	Step 3: generate a random number ζ between 0 and 1 for synthesis of a new sample
	.
	Step 4: repeat the step 2 and step 3 according to the oversampling rate N
	Step 5: obtain new training sample S
	Step 6: the new training sample S was classified with a classifier
	Step 7: output classification results