Research Article

Learning to Discriminate Adversarial Examples by Sensitivity Inconsistency in IoHT Systems

Table 3

Detection performance of three detection methods. The model, dataset, and attack method are consistent for the training and testing phases. As Deepwordbug is a character-level attack and FGWS detection is just designed for word-level attacks, the experimental results of Deepwordbug detection with FGWS are not meaningful, and “—” in the table indicates that the experiment is not conducted.

ModelDatasetAttackRecall (%)F1-score (%)
FGWSWDRSIFDFGWSWDRSIFD

BERTAG’s newsTextFooler81.583.091.787.586.190.7
PWWS85.187.991.989.790.592.2
BAE49.780.086.757.281.284.5
Deepwordbug75.485.078.385.6
IMDBTextFooler79.995.597.286.695.896.4
PWWS82.592.795.585.894.296.0
BAE56.790.396.267.893.196.3
Deepwordbug92.094.292.794.8

CNNAG’s newsTextFooler82.992.095.586.289.791.5
PWWS86.891.094.091.286.090.6
BAE56.788.292.462.185.588.5
Deepwordbug91.092.486.384.9
IMDBTextFooler75.989.999.785.391.597.8
PWWS80.287.299.086.087.296.5
BAE59.888.998.270.187.196.5
Deepwordbug91.297.989.695.7

LSTMAG’s newsTextFooler86.291.396.290.187.891.2
PWWS84.784.694.590.486.888.5
BAE62.288.291.767.988.890.3
Deepwordbug83.488.683.384.1
IMDBTextFooler77.494.897.883.895.095.4
PWWS70.592.592.080.092.492.7
BAE48.895.596.957.495.597.7
Deepwordbug92.092.293.691.5

Bold values indicate the optimal results among three defense methods.