Research Article

A Deep Multimodal Model for Predicting Affective Responses Evoked by Movies Based on Shot Segmentation

Table 4

Comparison of state-of-the-art results for experienced emotion prediction.

FeaturesArousal (loss1)Valence (loss2)
MSEPCCMSEPCC

All features0.02750.61870.06320.3443
 −Action features0.02910.60380.06730.3259
 −Face features0.02770.61360.06370.3667
 −Person features0.02800.61810.06530.3726
 −Place features0.02800.59810.06630.3315
 −VGGish features0.02900.59520.06690.3444
 −OpenSMILE features0.02950.60030.06660.3345
All_visual_features0.03160.49310.07510.2694
All_audio_features0.02970.61410.07260.3356

“−” indicates without the feature.