Research Article

Vision Transformer-Based Video Hashing Retrieval for Tracing the Source of Fake Videos

Table 4

Comparison experiment of fine-grained accuracy (ACC) with recent works on FaceForensics++ high-quality (HQ) and low-quality (LQ) datasets.

MethodsFF++ (HQ)FF++ (LQ)Celeb-DF
DFF2FFSNTDFF2FFSNT

Xception98.998.999.695.096.891.194.687.199.4
I3D92.992.996.490.491.186.491.478.699.2
LSTM99.699.398.293.996.488.294.388.295.7
TEI97.997.197.594.395.091.194.690.499.1
ADDNet-3d92.183.992.578.290.478.280.069.395.2
S-MIL98.699.399.395.796.891.494.688.699.2
S-MIL-T99.699.6100.094.397.191.196.186.898.8
STIL99.699.3100.095.498.292.197.191.899.8
VTN99.699.399.695.497.992.195.790.499.3
ISTVT99.699.6100.096.898.996.197.592.199.8
Ours99.999.999.90.99999.9100.099.999.999.4