Research Article

Vision Transformer-Based Video Hashing Retrieval for Tracing the Source of Fake Videos

Table 1

Robustness experiments with various video preprocessing and different hash bits on FaceForensics++.

Video processingFF++ raw (bits)FF++ C23 (bits)FF++ C40 (bits)
641282565121024641282565121024641282565121024

None0.8520.9320.9480.9980.9910.8470.9440.9440.9980.9900.8460.9410.9460.9970.991
Sharpening0.8500.9300.9490.9990.9910.8450.9430.9450.9990.9890.8470.9420.9450.9960.991
Noise0.8440.9370.9440.9990.9900.8440.9330.9400.9990.9910.8530.9440.9420.9990.991
Blur0.8460.9340.9470.9980.9910.8440.9440.9390.9990.9910.8480.9420.9410.9990.991
Median filter0.8500.9350.9480.9980.9910.8440.9450.9410.9980.9910.8510.9420.9460.9970.992
Video crop0.6330.8010.8620.9830.9630.6360.8620.8140.9860.9620.6290.8160.8590.9880.964

Bold values represent the best results in the correlation domain.