Research Article

A Deep Multimodal Model for Predicting Affective Responses Evoked by Movies Based on Shot Segmentation

Figure 3

Extraction of vision and audio features of a shot.