A Deep Multimodal Model for Predicting Affective Responses Evoked by Movies Based on Shot Segmentation

<div>Extraction of vision and audio features of a shot.</div>

Security and Communication Networks

Figure 3: A Deep Multimodal Model for Predicting Affective Responses Evoked by Movies Based on Shot Segmentation