Research Article

On Internet Traffic Classification: A Two-Phased Machine Learning Approach

Table 1

Traffic classification approaches.

CategoryClassification methodologyAttribute(s) usedGranularityProcessing timeSample tools/ML techniques

Port basedProtocol portProtocol portsHighLowAny (custom), PRTG network monitor [55], Nagios [56], Wireshark [48]

Payload inspectionDeep packet inspectionPayload inspection of, for example, first packets, first packet per directionHighHighOpenDPI [1], nDPI [45], L7 (TIE) [35]
Stochastic packet inferenceStatistical properties inherent in packet header and payloadHighHighNetzob [57], Polyglot [58], KISS [8]

Behavioural techniquesEnd-point behaviour monitoringIdentifying host (communication) behaviour patternLowModerateBLINC [46], SVM [59], naïve Bayes [60]
Traffic accounting Heuristic analysis of inspected packets, flows HighHighANTCs [61], naïve Bayes [60], Bayesian network [62]

Statistical approachesPacket basedPacket and payload size, interpacket arrival timeHighModerateNN [63], Hidden Markov/Gaussian Mixture Models
Flow basedDuration, transmission rate, multiple flow featuresLowLow-means/hierarchical clustering [27], J48 [30], C5.0 [31], BFTree [64], SVM [59]