[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
video-understanding weakly-supervised-learning video-captioning multimodal-learning vision-and-language dense-video-captioning pre-training temporal-language-grounding video-chapter-generation vid2seq
-
Updated
Nov 13, 2023 - Jupyter Notebook