[Model] New interface and automatic detection for PP support #9000

DarkLight1337 · 2024-10-01T14:11:51Z

In this PR, I have updated the model registry to import model modules in a separate process to check whether they support PP. This avoids initializing CUDA for the main program. I have also applied this to the check for multimodal models.

With this improvement, I won't have to add almost every model to the hardcoded list for #7168.

While working on this, I have also moved the iteration over architectures from ModelConfig into ModelRegistry. This supersedes #8924.

github-actions · 2024-10-01T14:12:06Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

youkaichao · 2024-10-01T18:19:04Z

ideally we should support pp for all models. I'm waiting for @andoorve to support this.

for multi-modality support, we can use this approach.

DarkLight1337 · 2024-10-03T19:18:56Z

Closing as #7168 will include these changes.

DarkLight1337 added 2 commits October 1, 2024 14:09

Add SupportsPP interface and stateless protocol check

eea3fc5

Subclass SupportsPP in relevant models

b4ce5f7

DarkLight1337 requested review from youkaichao and ywang96 October 1, 2024 14:11

Remove hardcoded list

30e454a

DarkLight1337 force-pushed the supports-pp branch from 7b649de to 30e454a Compare October 1, 2024 14:18

DarkLight1337 added 7 commits October 1, 2024 14:19

Remove unused import

e9ea5b7

Check using function

8b40176

Update docstring

ec4c6b3

Simplify

cdc4dbe

Add tests

dcc2a49

Test CUDA initialization

7280766

Add platform guard

37cc51b

DarkLight1337 changed the title ~~[Core] New interface and automatic detection for PP support~~ [Model] New interface and automatic detection for PP support Oct 1, 2024

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 1, 2024

DarkLight1337 force-pushed the supports-pp branch from 11e95ff to 37cc51b Compare October 1, 2024 15:19

DarkLight1337 added 4 commits October 1, 2024 15:20

Trigger CI

3814246

Fix OOT registration

cf91f7b

Update docstring

38b090a

Remove unnecessary global

d394985

DarkLight1337 added 4 commits October 3, 2024 04:25

Update interfaces

6a4287a

format

1e010c7

Fix error check

1e0baba

Make prefix required

9ef69de

DarkLight1337 mentioned this pull request Oct 3, 2024

[Models] Add remaining model PP support #7168

Merged

DarkLight1337 added 3 commits October 3, 2024 16:52

Fix environment variables not being copied over

a36f7ed

Merge branch 'main' into supports-pp

5b960bc

Fix the real problem, which is that modelscope is not installed

ed669a5

DarkLight1337 added 2 commits October 3, 2024 13:33

Move modelscope installation into regression test

b8958a9

Fix LLMWrapper

e9f0601

DarkLight1337 closed this Oct 3, 2024

DarkLight1337 deleted the supports-pp branch October 4, 2024 02:57

DarkLight1337 mentioned this pull request Oct 4, 2024

[Core][VLM] Test registration for OOT multimodal models #8717

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] New interface and automatic detection for PP support #9000

[Model] New interface and automatic detection for PP support #9000

DarkLight1337 commented Oct 1, 2024 •

edited

Loading

github-actions bot commented Oct 1, 2024

youkaichao commented Oct 1, 2024

DarkLight1337 commented Oct 3, 2024

[Model] New interface and automatic detection for PP support #9000

[Model] New interface and automatic detection for PP support #9000

Conversation

DarkLight1337 commented Oct 1, 2024 • edited Loading

github-actions bot commented Oct 1, 2024

youkaichao commented Oct 1, 2024

DarkLight1337 commented Oct 3, 2024

DarkLight1337 commented Oct 1, 2024 •

edited

Loading