Research Article

Cost-Sensitive Approach to Improve the HTTP Traffic Detection Performance on Imbalanced Data

Algorithm 1

Character-level feature (HTTP traffic path).
Input:HTTP traffic path
(1)Configure the resources occupied by the spark task
(2)Init spark session
(3)Initialize: Truncated fixed length: L, result: res
(4)feat-contract URL and post
(5)Filter non-ASCII characters of feat
(6)if(the length of feat ≥ L){
(7)feat = the first L character of the feat
(8)}
(9)else{
(10)  do{
(11)   feat = merge two feat strings
(12)  }
(13)  While(getLength( feat) > L)
(14)}
(15)if(the length of feat ≥ L){
(16)  feat = the first L character of the feat
(17)}
(18)Return feat; //return the string of fixed length