Research Article

Emotional Video to Audio Transformation Using Deep Recurrent Neural Networks and a Neuro-Fuzzy System

Figure 2

Overall architecture of the proposed method, with (a) ANFIS being trained for emotion classification from visual features, (b) deep LSTM-RNN being trained for domain transformation from visual to audio features, and (c) experimental stage for music generation from visual features.