Emotional Video to Audio Transformation Using Deep Recurrent Neural Networks and a Neuro-Fuzzy System

<div>Overall architecture of the proposed method, with (a) ANFIS being trained for emotion classification from visual features, (b) deep LSTM-RNN being trained for domain transformation from visual to audio features, and (c) experimental stage for music generation from visual features.</div>

Mathematical Problems in Engineering

fig2

Figure 2

Figure 2: Emotional Video to Audio Transformation Using Deep Recurrent Neural Networks and a Neuro-Fuzzy System