[FEAT]: Add pixtral vision capabilities #2342

nekopep · 2024-09-21T18:31:28Z

What would you like to see?

Currenlty we are able to select new MistralAi Pixtral 12b model.

This model is a multimodal model specialized in image analysis. In latest AnythingLLM 1.2.2 we can upload image but we need the send the image to pixtral in base64 in the request.

messages = [
# Define the messages for the chat
messages = [
    {
        "role": "user",
        "content": [
            {
                "type": "text",
                "text": "What's in this image?"
            },
            {
                "type": "image_url"
                "image_url": f"data:image/jpeg;base64,{base64_image}"
            }
        ]
    }
]

It can be used to transcribe image (OCR) see example in image below. You can even compare two images.

I could be very cool to add this feature to anythingLLM :)

Doc about the feature:
https://docs.mistral.ai/capabilities/vision/#passing-a-base64-encoded-image

The text was updated successfully, but these errors were encountered:

timothycarambat · 2024-09-21T21:02:32Z

Are you using the "Mistral" LLM provider using that uses their hosted API service? I am presuming so

nekopep · 2024-09-22T11:01:35Z

Yes, I'm using AnythingLLM as a "proxy" to diverse LLM provider (currently Mistral and ChatGPT)
Thanks for the merge ;)

nekopep · 2024-09-22T16:52:36Z

Hi timothy works perfectly! Thks!

nekopep added enhancement New feature or request feature request labels Sep 21, 2024

timothycarambat mentioned this issue Sep 21, 2024

Enable Mistral Multimodal #2343

Merged

10 tasks

timothycarambat closed this as completed in #2343 Sep 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEAT]: Add pixtral vision capabilities #2342

[FEAT]: Add pixtral vision capabilities #2342

nekopep commented Sep 21, 2024

timothycarambat commented Sep 21, 2024

nekopep commented Sep 22, 2024

nekopep commented Sep 22, 2024 •

edited

Loading

[FEAT]: Add pixtral vision capabilities #2342

[FEAT]: Add pixtral vision capabilities #2342

Comments

nekopep commented Sep 21, 2024

What would you like to see?

timothycarambat commented Sep 21, 2024

nekopep commented Sep 22, 2024

nekopep commented Sep 22, 2024 • edited Loading

nekopep commented Sep 22, 2024 •

edited

Loading