You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I had a question about the training dataset for the end-to-end VNLI model. In the paper you mention:
Specifically, we finetune BLIP2 and PaLI-17B using a dataset comprising 110K text-image pairs labeled with alignment annotations. This includes 44K examples from COCO-Con, 3.5K from PickaPic-Con, 20K from COCO t2i and 40K from the training split of the SNLI-VE dataset.
However, I was unable to find the training split on AWS/Huggingface. Are there plans to release it, or if it has already been released, could you please point me to where I can find it?
The text was updated successfully, but these errors were encountered:
Hey, thanks for the great work!
I had a question about the training dataset for the end-to-end VNLI model. In the paper you mention:
Specifically, we finetune BLIP2 and PaLI-17B using a dataset comprising 110K text-image pairs labeled with alignment annotations. This includes 44K examples from COCO-Con, 3.5K from PickaPic-Con, 20K from COCO t2i and 40K from the training split of the SNLI-VE dataset.
However, I was unable to find the training split on AWS/Huggingface. Are there plans to release it, or if it has already been released, could you please point me to where I can find it?
The text was updated successfully, but these errors were encountered: