multimodal-llm

Star

Here are 9 public repositories matching this topic...

eric-ai-lab / MiniGPT-5

Star

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

transformers diffusion-models multimodal-generation multimodal-llm

Updated Dec 12, 2024
Python

alipay / Ant-Multi-Modal-Framework

Star

Research Code for Multimodal-Cognition Team in Ant Group

video-editing multimodal-learning video-text-retrieval image-text-retrieval multimodal-llm

Updated Jul 11, 2024
Python

Zhoues / MineDreamer

Star

[NeurIPSw'24] This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control "

minecraft diffusion-model embodied-agent multimodal-llm

Updated Jun 30, 2024
Python

UCSC-VLAA / vllm-safety-benchmark

Star

[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"

benchmark safety datasets robustness adversarial-attacks llm vision-language-model multimodal-llm

Updated Nov 28, 2023
Python

AIDC-AI / Wings

Star

The code repository for "Wings: Learning Multimodal LLMs without Text-only Forgetting" [NeurIPS 2024]

deep-learning mllm multimodal-large-language-models multimodal-llm text-only-forgetting

Updated Dec 28, 2024
Python

iamaziz / chat_with_images

Star

Streamlit app to chat with images using Multi-modal LLMs.

streamlit llms llava multimodal-llm

Updated Mar 17, 2024
Python

zhudotexe / kani-vision

Sponsor

Star

Kani extension for supporting vision-language models (VLMs). Comes with model-agnostic support for GPT-Vision and LLaVA.

extension kani large-language-models vision-language-model llava multimodal-llm gpt-vision

Updated Nov 22, 2023
Python

autodistill / autodistill-llava

Star

LLaVA base model for use with Autodistill.

computer-vision llava autodistill multimodal-llm

Updated Jan 24, 2024
Python

abdur75648 / MedicalGPT

Star

Medical Report Generation And VQA (Adapting XrayGPT to Any Modality)

medical-imaging vqa llama vqa-dataset medical-dataset vicuna llm medical-report-generation llms chatgpt minigpt4 multimodal-llm medicalgpt chatgpt4o xraygpt

Updated Jun 24, 2024
Python

Improve this page

Add a description, image, and links to the multimodal-llm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-llm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multimodal-llm

Here are 9 public repositories matching this topic...

eric-ai-lab / MiniGPT-5

alipay / Ant-Multi-Modal-Framework

Zhoues / MineDreamer

UCSC-VLAA / vllm-safety-benchmark

AIDC-AI / Wings

iamaziz / chat_with_images

zhudotexe / kani-vision

autodistill / autodistill-llava

abdur75648 / MedicalGPT

Improve this page

Add this topic to your repo