Research Article

Intelligence Is beyond Learning: A Context-Aware Artificial Intelligent System for Video Understanding

Figure 1

Semantic gap between human and computer perception of the physical world. (1) Human perception is represented by high-level features (concepts): watch the penalty (visual signal, scream (audio signal), and talk with the crowd (natural language processing). (2) Machine perception is represented by low-level features (texture, color, resolution, and encoding).