Journal of Healthcare Engineering

Research Article

Learning to Discriminate Adversarial Examples by Sensitivity Inconsistency in IoHT Systems

Table 4

Generalization evaluation of different attacks. Rec is recall, F1 is F1-score, R is the variation of the current adversarial recall rate relative to the default effect, and F is the variation of the current weighted average F1-score relative to the default effect. denotes the baseline, which is the experimental setup for training the detector, followed by testing the detector against other attack methods with the same dataset and model.


Attack	CNN					LSTM				BERT
Attack	Rec (%)	&R(%)	F1 (%)	&F(%)	Rec (%)	&R(%)	F1 (%)	&F(%)	Rec (%)	&R(%)	F1 (%)	&F(%)

TextFooler	99.2		96.3		97.8		95.4		97		97
PWWS	98.2	+0.1	96.0	+0.8	91.7	−0.7	91.8	−0.4	95.4	−2.0	96.8	+0.3
BAE	99.2	+1	96.3	+0.5	96.5	−0.1	95.0	−0.8	93.2	−2.8	94.6	−0.9
Deepwordbug	97.4	−0.8	94.8	−0.4	92.0	0	91.8	+0.4	93.2	−3.0	94.9	−1.0

TextFooler	99.0	−0.2	95.5	−0.8	97.4	−0.4	95.8	+0.4	97.5	+0.5	96.2	−0.8
PWWS	98.1		95.2		92.4		92.2		97.4		96.5
BAE	98.8	+0.6	94.7	−1.1	97.2	+0.6	94.6	−1.2	95	−1.0	95.3	−0.2
Deepwordbug	97.8	−0.4	94.8	−0.4	92.2	+0.2	91.2	−0.2	94.6	−1.6	94.3	−1.6

TextFooler	98.8	−0.4	95.9	−0.4	96.9	−0.9	95.5	+0.1	97.4	+0.4	96.3	−0.7
PWWS	97.6	−0.6	96.4	+1.2	94.6	+2.4	93.9	+1.7	95.6	−1.8	95.7	−0.8
BAE	98.2		95.8		96.6		95.8		96		95.5
Deepwordbug	97.0	−1.2	94.7	−1.1	91.0	−1.0	91.9	+0.5	95.6	−0.6	95.3	−0.6

TextFooler	99.4	+0.2	96.3	0	95.8	−2.0	95.1	−0.3	97.8	+0.8	96.6	−0.4
PWWS	97.6	−0.6	95.6	+0.4	93.6	+1.2	91.7	−0.5	95.0	−2.4	95.7	−0.8
BAE	98.6	+0.4	95.3	−0.5	95.1	−1.5	94.3	−1.5	93.8	−2.2	93.9	−1.6
Deepwordbug	98.2		95.2		92.0		91.4		96.2		95.9