Research Article

Scaling Human-Object Interaction Recognition in the Video through Zero-Shot Learning

Table 6

The impact of the RNNs (LSTM/GRU) on the proposed system’s performance. Training and testing are performed on all data.

MethodmAP (%)
LSTMGRU

2Stream + WE - VLAD22.4522.20
3Stream + WE - VLAD23.7623.52
2Stream + WE + VLAD(rnn)23.6423.43
3Stream + WE + VLAD(rnn)24.7324.58