Research Article

Visual Experience-Based Question Answering with Complex Multimodal Environments

Figure 7

Three-dimensional Localization Network (TLN).