[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
nlp video vision captioning-videos vision-and-language grounding pytorch-implementation visual-grounding video-grounding video-object-grounding object-grounding
-
Updated
Jun 10, 2020 - Python