Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR2024]
-
Updated
Feb 17, 2025 - Python
Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR2024]
Caption images across your datasets with state of the art models from Hugging Face and Replicate!
Fuyu multi-modal language model for use with Autodistill.
The Fuyu programming language
Hands on some MultiModal Models
Testing Nvidia Machine Learning api models
Add a description, image, and links to the fuyu topic page so that developers can more easily learn about it.
To associate your repository with the fuyu topic, visit your repo's landing page and select "manage topics."