[Model] Initialize deepseek-vl support #5817

liuyancong-enflame-tech · 2024-06-25T07:24:40Z

Test On NVIDIA L40S

Test Example

from vllm import LLM
from PIL import Image


from vllm import SamplingParams

sample_params = SamplingParams(temperature=0, max_tokens=1024)

model_path = "/pretrained_models/deepseek-vl-7b-chat"


llm = LLM(
    model=model_path,
    max_model_len=3072,
)
print("model load finish")
prompt = f"You are a helpful language and vision assistant. You are able to understand the visual content that the user provides, and assist the user with a variety of tasks using natural language.\n User: <image_placeholder> Describe each stage of this image detail.\nAssistant:"

image = Image.open("/opt/TV/VLM01/tests/images/cherry_blossom.jpg")
image = image.convert("RGB")
outputs = llm.generate(
    {
        "prompt": prompt,
        "multi_modal_data": {"image": image},
    },
    sample_params,
)
for o in outputs:
    generated_text = o.outputs[0].text
    print(generated_text)

Output

The image captures a scene where a tall tower, which appears to be a communication or observation tower, is partially obscured by cherry blossom trees in full bloom. The tower is situated in the background, and the sky behind it is a clear blue.

The cherry blossom trees, which are in the foreground, are in various stages of bloom. Some branches are densely packed with pink blossoms, while others have fewer or no flowers at all. The blossoms are in various shades of pink, ranging from light to deep, indicating the progression of the bloom cycle.

The perspective of the image is from below, looking upwards towards the tower, which gives a sense of scale and grandeur to the tower. The branches of the cherry blossom trees frame the tower, creating a natural border that draws the viewer's eye towards the tower.

There are no discernible texts or other objects in the image. The focus is solely on the tower and the cherry blossom trees, with the blue sky providing a contrasting backdrop. The image does not contain any people or moving elements, suggesting a still, serene moment captured in time.

FIX #3356
FIX #4982

liuyancong-enflame-tech · 2024-06-25T07:34:13Z

Contributed by enflame-tech

DarkLight1337

Thanks for the contribution! I have a few initial comments.

Apart from that, can you add a test case (similar to test_llava.py) to test the correctness of the model in CI?

vllm/transformers_utils/configs/deepseek_vl.py

vllm/model_executor/models/deepseek_vl.py

liuyancong-enflame-tech · 2024-06-26T04:03:03Z

now support deepseek-ai/deepseek-vl-7b-chat deepseek-ai/deepseek-vl-1.3b-chat

liuyancong-enflame-tech · 2024-06-26T10:23:23Z

This model depends on timm>=0.9.16, which depends on torch, but it will conflict with the dependencies of other third-party components and cause the pipeline to fail. Therefore, running this model requires additional installation. I don’t know if this is appropriate. In addition, it depends on many modules of timm, which is difficult to remove.

DarkLight1337 · 2024-06-26T10:35:45Z

This model depends on timm>=0.9.16, which depends on torch, but it will conflict with the dependencies of other third-party components and cause the pipeline to fail. Therefore, running this model requires additional installation. I don’t know if this is appropriate. In addition, it depends on many modules of timm, which is difficult to remove.

Can you implement the individual timm modules inside vLLM? (where possible, you should use vLLM-specific layers to improve the performance anyway)

liuyancong-enflame-tech · 2024-06-27T01:21:25Z

This model depends on timm>=0.9.16, which depends on torch, but it will conflict with the dependencies of other third-party components and cause the pipeline to fail. Therefore, running this model requires additional installation. I don’t know if this is appropriate. In addition, it depends on many modules of timm, which is difficult to remove.

Can you implement the individual timm modules inside vLLM? (where possible, you should use vLLM-specific layers to improve the performance anyway)

OK，I will try to do this and I think it will take some time

DarkLight1337 · 2024-06-27T01:24:00Z

This model depends on timm>=0.9.16, which depends on torch, but it will conflict with the dependencies of other third-party components and cause the pipeline to fail. Therefore, running this model requires additional installation. I don’t know if this is appropriate. In addition, it depends on many modules of timm, which is difficult to remove.

Can you implement the individual timm modules inside vLLM? (where possible, you should use vLLM-specific layers to improve the performance anyway)

OK，I will try to do this and I think it will take some time

You can make use of our implementation of CLIPVisionModel to save some effort.

DarkLight1337 · 2024-06-28T06:40:40Z

To speed up the CI queue for #5905, I've cancelled the distributed tests for the latest CI run in this PR since they won't pass anyway until #5905 has been merged. Please merge main into your branch after that happens so that the CI can pass once again.

liuyancong-enflame-tech · 2024-06-28T09:37:04Z

This test [tests/models/test_deepseek_vl.py] case depends on the project https://github.com/deepseek-ai/DeepSeek-VL, and it seems that pip installation will fail when build the docker .I think it is possible not to add this test case

And The [examples/deepseek_vl_example.py] can run successfully.

DarkLight1337 · 2024-07-01T01:46:37Z

This test [tests/models/test_deepseek_vl.py] case depends on the project https://github.com/deepseek-ai/DeepSeek-VL, and it seems that pip installation will fail when build the docker .I think it is possible not to add this test case

In this case it won't function for users of vLLM either since they can't install it (so you should still keep the tests). Can you figure out which dependency is causing the issue?

liuyancong-enflame-tech · 2024-07-01T08:38:04Z

This test [tests/models/test_deepseek_vl.py] case depends on the project https://github.com/deepseek-ai/DeepSeek-VL, and it seems that pip installation will fail when build the docker .I think it is possible not to add this test case

In this case it won't function for users of vLLM either since they can't install it (so you should still keep the tests). Can you figure out which dependency is causing the issue?
In the tests/models/test_deepseek_vl.py test, when hf runner loads the model, if we do not import deepseek_vl, an error will occur
(ValueError: The checkpoint you are trying to load has model type multi_modality but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.)
This is because the model part of the code is not in the hf repository, but in the github repository. In the code: https://github.com/deepseek-ai/DeepSeek-VL/blob/main/deepseek_vl/models/modeling_vlm.py, we can see some registration codes:
AutoConfig.register("multi_modality", MultiModalityConfig)
AutoModelForCausalLM.register(MultiModalityConfig, MultiModalityCausalLM)
If I add the dependency in vllm\requirements-test.txt:
deepseek_vl@git+https://github.com/deepseek-ai/DeepSeek-VL.git@main
Then, docker will report an error when building the image. The error is that metadata is None and cannot be recognized. The package can be packaged into whl, but it will not be found when it is installed.

DarkLight1337 · 2024-07-01T08:46:12Z

You can manually register the model to HuggingFace inside the test case.

liuyancong-enflame-tech · 2024-07-01T08:50:26Z

You can manually register the model to HuggingFace inside the test case.

Ok, I'll try it

liuyancong-enflame-tech · 2024-07-16T08:50:23Z

buildkite/fastcheck/pr/tensorizer-metrics-tracing-test — Failed (exit status 1)
What functionality does this test case test?

DarkLight1337 · 2024-07-16T08:52:16Z

It's unrelated to this PR.

examples/deepseek_vl_example.py

…ech/vllm into deepseek-vl-7b-chat

github-actions · 2024-10-25T02:04:06Z

This pull request has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this pull request should remain open. Thank you!

SinanAkkoyun · 2024-10-27T00:52:00Z

Will this PR get merged?

SinanAkkoyun · 2024-10-27T00:59:23Z

Or rather where could I help, what is missing or needs work, only the models/test_deepseek_vl.py?

DarkLight1337 · 2024-10-27T07:20:30Z

Or rather where could I help, what is missing or needs work, only the models/test_deepseek_vl.py?

Sorry for forgetting about this! I think we need to fix the merge conflicts and update the tests (you might need to fork this branch and create a new PR). Afterwards we can work on removing redundant code in the model file.

SinanAkkoyun · 2024-10-27T12:10:04Z

Sorry for forgetting about this!

No problem at all! I am sorry to have prematurely commented here, as Qwen2 VL architecturally fits my usecase way better.
Although I am having problems with the OpenAI endpoint, as Qwen2 VL is the only model that generates junk, I opened an issue for that #9732

mergify · 2024-11-26T05:51:11Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @liuyancong-enflame-tech.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

[Model] Initialize deepseek-vl-7b-chat support

4bd5aa9

DarkLight1337 mentioned this pull request Jun 25, 2024

[RFC]: Multi-modality Support on vLLM #4194

Open

86 tasks

DarkLight1337 reviewed Jun 25, 2024

View reviewed changes

vllm/transformers_utils/configs/deepseek_vl.py Show resolved Hide resolved

vllm/model_executor/models/deepseek_vl.py Outdated Show resolved Hide resolved

vllm/model_executor/models/deepseek_vl.py Outdated Show resolved Hide resolved

DarkLight1337 requested a review from ywang96 June 25, 2024 07:58

fix requirement for deepseek-vl

de63a4c

liuyancong-enflame-tech changed the title ~~[Model] Initialize deepseek-vl-7b-chat support~~ [Model] Initialize deepseek-vl support Jun 26, 2024

刘衍聪_Yancong added 2 commits June 26, 2024 17:27

Removed unused code, added documentation, added examples

10b5cdd

Added test cases, deleted model dependencies, and optimized code

0963337

刘衍聪_Yancong added 2 commits June 27, 2024 15:31

code reformat

9752b0c

Remove timm dependency and Code Formatting

de6879e

刘衍聪_Yancong and others added 3 commits June 28, 2024 15:41

fix test failed

7cf0671

Merge branch 'vllm-project:main' into deepseek-vl-7b-chat

a26627a

Modify the deepseek-vl version number

d2d3eeb

刘衍聪_Yancong and others added 3 commits July 1, 2024 09:30

Delete failed test cases and dependencies to resolve conflicts

23311f6

Merge branch 'vllm-project:main' into deepseek-vl-7b-chat

152059f

resolve conflicts

89d7856

刘衍聪_Yancong added 2 commits July 1, 2024 10:07

fix code bug

1eb7d48

fix error Line too long

0f127c6

刘衍聪_Yancong and others added 5 commits July 15, 2024 15:14

update test

a063b71

update test

8be4a36

fix test failed

c105475

fix test failed

5378c10

Merge branch 'vllm-project:main' into deepseek-vl-7b-chat

577a382

Merge branch 'vllm-project:main' into deepseek-vl-7b-chat

77d5715

DarkLight1337 reviewed Jul 18, 2024

View reviewed changes

examples/deepseek_vl_example.py Outdated Show resolved Hide resolved

刘衍聪_Yancong and others added 8 commits July 18, 2024 10:26

update example

e6d1aeb

update example

bbef748

Merge branch 'main' into deepseek-vl-7b-chat

9c054f2

fix ruff error

3552c03

Merge branch 'deepseek-vl-7b-chat' of github.com:liuyancong-enflame-t…

26482e8

…ech/vllm into deepseek-vl-7b-chat

fix conflict

d615870

Merge branch 'main' into deepseek-vl-7b-chat

cd01465

delete useless code

f48ba9b

github-actions bot added the stale label Oct 25, 2024

github-actions bot added unstale and removed stale labels Oct 27, 2024

simon-mo requested review from robertgshaw2-redhat and simon-mo as code owners November 26, 2024 05:49

mergify bot added documentation Improvements or additions to documentation needs-rebase labels Nov 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Initialize deepseek-vl support #5817

[Model] Initialize deepseek-vl support #5817

liuyancong-enflame-tech commented Jun 25, 2024 •

edited

Loading

liuyancong-enflame-tech commented Jun 25, 2024

DarkLight1337 left a comment

liuyancong-enflame-tech commented Jun 26, 2024

liuyancong-enflame-tech commented Jun 26, 2024

DarkLight1337 commented Jun 26, 2024

liuyancong-enflame-tech commented Jun 27, 2024

DarkLight1337 commented Jun 27, 2024

DarkLight1337 commented Jun 28, 2024

liuyancong-enflame-tech commented Jun 28, 2024 •

edited

Loading

DarkLight1337 commented Jul 1, 2024 •

edited

Loading

liuyancong-enflame-tech commented Jul 1, 2024

DarkLight1337 commented Jul 1, 2024 •

edited

Loading

liuyancong-enflame-tech commented Jul 1, 2024

liuyancong-enflame-tech commented Jul 16, 2024

DarkLight1337 commented Jul 16, 2024

github-actions bot commented Oct 25, 2024

SinanAkkoyun commented Oct 27, 2024

SinanAkkoyun commented Oct 27, 2024

DarkLight1337 commented Oct 27, 2024 •

edited

Loading

SinanAkkoyun commented Oct 27, 2024 •

edited

Loading

mergify bot commented Nov 26, 2024

[Model] Initialize deepseek-vl support #5817

Are you sure you want to change the base?

[Model] Initialize deepseek-vl support #5817

Conversation

liuyancong-enflame-tech commented Jun 25, 2024 • edited Loading

Test On NVIDIA L40S

liuyancong-enflame-tech commented Jun 25, 2024

DarkLight1337 left a comment

Choose a reason for hiding this comment

liuyancong-enflame-tech commented Jun 26, 2024

liuyancong-enflame-tech commented Jun 26, 2024

DarkLight1337 commented Jun 26, 2024

liuyancong-enflame-tech commented Jun 27, 2024

DarkLight1337 commented Jun 27, 2024

DarkLight1337 commented Jun 28, 2024

liuyancong-enflame-tech commented Jun 28, 2024 • edited Loading

DarkLight1337 commented Jul 1, 2024 • edited Loading

liuyancong-enflame-tech commented Jul 1, 2024

DarkLight1337 commented Jul 1, 2024 • edited Loading

liuyancong-enflame-tech commented Jul 1, 2024

liuyancong-enflame-tech commented Jul 16, 2024

DarkLight1337 commented Jul 16, 2024

github-actions bot commented Oct 25, 2024

SinanAkkoyun commented Oct 27, 2024

SinanAkkoyun commented Oct 27, 2024

DarkLight1337 commented Oct 27, 2024 • edited Loading

SinanAkkoyun commented Oct 27, 2024 • edited Loading

mergify bot commented Nov 26, 2024

liuyancong-enflame-tech commented Jun 25, 2024 •

edited

Loading

liuyancong-enflame-tech commented Jun 28, 2024 •

edited

Loading

DarkLight1337 commented Jul 1, 2024 •

edited

Loading

DarkLight1337 commented Jul 1, 2024 •

edited

Loading

DarkLight1337 commented Oct 27, 2024 •

edited

Loading

SinanAkkoyun commented Oct 27, 2024 •

edited

Loading