Research Article

Visual Experience-Based Question Answering with Complex Multimodal Environments

Table 4

Performance analysis of scene graph generation depending on different state recognition models.

ModelsObject mAP (%)SGGen (%)
AttributeRelationTotal

68.7956.8746.7953.73
85.1269.6960.0967.35
85.1284.6160.0979.62
100.0100.095.8998.80