Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
-
Updated
Sep 28, 2024 - Python
Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
OCR and Document Search Web Application
A new novel multi-modality (Vision) RAG architecture
LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.
A minimalist yet highly performant, lightweight, lightning fast, multisource, multimodal and local embedding solution, built in rust.
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
The code used to train and run inference with the ColPali architecture.
Add a description, image, and links to the colpali topic page so that developers can more easily learn about it.
To associate your repository with the colpali topic, visit your repo's landing page and select "manage topics."