Research Article
Emotional Video to Audio Transformation Using Deep Recurrent Neural Networks and a Neuro-Fuzzy System
Figure 2
Overall architecture of the proposed method, with (a) ANFIS being trained for emotion classification from visual features, (b) deep LSTM-RNN being trained for domain transformation from visual to audio features, and (c) experimental stage for music generation from visual features.