Intelligence Is beyond Learning: A Context-Aware Artificial Intelligent System for Video Understanding

<div>Semantic gap between human and computer perception of the physical world. (1) Human perception is represented by high-level features (concepts): watch the penalty (visual signal, scream (audio signal), and talk with the crowd (natural language processing). (2) Machine perception is represented by low-level features (texture, color, resolution, and encoding).</div>

Computational Intelligence and Neuroscience

fig1

Figure 1

Figure 1: Intelligence Is beyond Learning: A Context-Aware Artificial Intelligent System for Video Understanding