Research Article

An Improved Feature Extraction Approach for Web Anomaly Detection Based on Semantic Structure

Table 1

Four examples of URL and each URL represents a type of HTTP requests.

No.URLParameter sequenceSemantic structure

1/question/search?q = dockerPath 1, Path 2, qquestion/search?q = 
2/question/top?q = windows server &page = 10Path 1, Path 2, q, pagequestion/top?q = &page = 
3/user/news?page = 1Path 1, Path 2, page/user/news?page = 
4/teams/create/LinuxPath 1, Path 2, Path 3/teams/create/

Parameter sequence is the parameter-sequence extracting from and . Semantic structure is the semantic structure information of each type of HTTP requests. The “” in semantic structure denotes the corresponding part is trivial, and other symbols mean the corresponding segments are salient.