Make initialization of tokenizer and detokenizer optional #3748

GeauxEric · 2024-03-30T19:19:14Z

Add a flag to initialize the LLM engine to disable tokenizer.
If tokenizer is disabled, so does detokenizer.

ywang96 · 2024-03-31T01:48:08Z

Thanks for making this PR! It appears to me that unlike #3749 to allow skipping detokenization at sampling param level, this PR simply disables the use of tokenizer as a whole, so I guess the two PRs can still co-exist.

We might want to be careful with all these API changes though - cc @simon-mo

GeauxEric · 2024-04-02T16:49:19Z

Ready for review.
@simon-mo @ywang96
could you pls take a look?

ywang96 · 2024-04-04T16:33:47Z

@GeauxEric Thank you for the contribution! Could you update this branch with the changes merged in the main branch? Happy to review after the conflicts are resolved!

with a ',', it becomes a tuple, and deemed as True.

GeauxEric · 2024-04-06T15:55:50Z

@GeauxEric Thank you for the contribution! Could you update this branch with the changes merged in the main branch? Happy to review after the conflicts are resolved!

@ywang96
conflicts resolved.

GeauxEric · 2024-04-17T19:59:51Z

Maybe others can help take a look? @simon-mo?

ywang96

Sorry for the late review - I can see the purpose of having this. Since it doesn't affect other downstream components too much, we should be fine with having this feature!

ywang96 · 2024-04-18T08:16:38Z

vllm/engine/arg_utils.py

@@ -95,6 +96,10 @@ def add_cli_args(
            type=str,
            default=EngineArgs.tokenizer,
            help='name or path of the huggingface tokenizer to use')
+        parser.add_argument(
+            '--skip_tokenizer_init',


format nit

Suggested change

'--skip_tokenizer_init',

'--skip-tokenizer-init',

ywang96 · 2024-04-18T08:17:58Z

vllm/engine/llm_engine.py

+        eos_token_id = None
+        if self.tokenizer:
+            eos_token_id = self.tokenizer.get_lora_tokenizer(
+                lora_request).eos_token_id


I still think it's worth having an warning here about eos_token_id being None.

GeauxEric · 2024-04-20T21:08:18Z

Could you please take another look and consider merging/approving it when you have a moment?

ywang96

@GeauxEric Sorry for the delayed review and thank you for addressing the comments and contributing this PR!

…ct#3748) Co-authored-by: Yun Ding <yunding@nvidia.com> Co-authored-by: Roger Wang <ywang@roblox.com>

ywang96 mentioned this pull request Mar 31, 2024

[Core] [Frontend] Make detokenization optional #3749

Merged

GeauxEric force-pushed the optional-tokenizer branch from 69e732d to 50cc4b7 Compare March 31, 2024 01:32

ywang96 self-assigned this Apr 1, 2024

GeauxEric marked this pull request as draft April 2, 2024 05:24

GeauxEric marked this pull request as ready for review April 2, 2024 16:47

GeauxEric force-pushed the optional-tokenizer branch from 9c6d6cc to 3a6a0e7 Compare April 4, 2024 17:01

EricDingNVD added 11 commits April 4, 2024 10:06

[optional-tokenizer] make tokenzier optional in initialization

0ca0b3d

[tokenizer] make detokenization optional

a50f0c7

[tokenizer] fix parameter description

e144c2e

[tokenizer] fix initialize engine args

5fb16f2

[tokenizer] fix format

904edcc

[tokenization] fix arg parser field

cfc2660

[tokenizer] fix the order of initializing tokenizer and de-tokenizer

013a36a

[tokenizer] Never disable tok in LLM initialization

fb3eefd

[tokenizer] Add flag value to log info to help debug

07cc2e5

[tokenizer] fix type

5b30825

with a ',', it becomes a tuple, and deemed as True.

[tokenizer] fix yapf errors

676256f

GeauxEric force-pushed the optional-tokenizer branch from 3a6a0e7 to 676256f Compare April 4, 2024 17:11

[tokenizer] fix formatting

f7cd883

GeauxEric and others added 7 commits April 6, 2024 20:43

Merge branch 'vllm-project:main' into optional-tokenizer

8dfb59b

[optional-tokenizer] make tokenzier optional in initialization

0ea8446

[tokenizer] make detokenization optional

a7be734

[tokenizer] fix parameter description

1e613f6

[tokenizer] fix initialize engine args

3b94adb

[tokenizer] fix format

eab1dd7

[tokenization] fix arg parser field

58ccf64

EricDingNVD added 4 commits April 16, 2024 17:27

Merge branch 'main' into optional-tokenizer

68f77b1

[tokenizer] fix integration test

c0951f3

Merge branch 'main' of github.com:GeauxEric/vllm into optional-tokenizer

5f8b5fd

[tokenizer] merge with main

47dce6e

GeauxEric marked this pull request as ready for review April 17, 2024 19:57

ywang96 approved these changes Apr 18, 2024

View reviewed changes

GeauxEric marked this pull request as draft April 18, 2024 17:48

EricDingNVD added 3 commits April 18, 2024 10:53

[tokenizer] log warning if eos_token_id is None

50a7fad

Merge branch 'main' of github.com:GeauxEric/vllm into optional-tokenizer

7c25549

[tokenizer] work around mypy errors

5f2b8ed

GeauxEric marked this pull request as ready for review April 18, 2024 22:18

GeauxEric requested a review from ywang96 April 18, 2024 22:18

Merge remote-tracking branch 'upstream/main' into optional-tokenizer

e584673

ywang96 approved these changes Apr 21, 2024

View reviewed changes

ywang96 enabled auto-merge (squash) April 21, 2024 21:33

ywang96 merged commit a37d815 into vllm-project:main Apr 21, 2024
46 of 47 checks passed

ZackBradshaw pushed a commit to ZackBradshaw/vllm that referenced this pull request Apr 22, 2024

Make initialization of tokenizer and detokenizer optional (vllm-proje…

8fbff87

…ct#3748) Co-authored-by: Yun Ding <yunding@nvidia.com> Co-authored-by: Roger Wang <ywang@roblox.com>

ZackBradshaw pushed a commit to ZackBradshaw/vllm that referenced this pull request Apr 22, 2024

Make initialization of tokenizer and detokenizer optional (vllm-proje…

0e3c7f1

…ct#3748) Co-authored-by: Yun Ding <yunding@nvidia.com> Co-authored-by: Roger Wang <ywang@roblox.com>

ZackBradshaw pushed a commit to ZackBradshaw/vllm that referenced this pull request Apr 22, 2024

Make initialization of tokenizer and detokenizer optional (vllm-proje…

05323b6

…ct#3748) Co-authored-by: Yun Ding <yunding@nvidia.com> Co-authored-by: Roger Wang <ywang@roblox.com>

xjpang pushed a commit to xjpang/vllm that referenced this pull request Apr 25, 2024

Make initialization of tokenizer and detokenizer optional (vllm-proje…

552b2d2

…ct#3748) Co-authored-by: Yun Ding <yunding@nvidia.com> Co-authored-by: Roger Wang <ywang@roblox.com>

alexeykondrat pushed a commit to alexeykondrat/ci-vllm that referenced this pull request May 1, 2024

Make initialization of tokenizer and detokenizer optional (vllm-proje…

972e1f4

…ct#3748) Co-authored-by: Yun Ding <yunding@nvidia.com> Co-authored-by: Roger Wang <ywang@roblox.com>

z103cb pushed a commit to z103cb/opendatahub_vllm that referenced this pull request May 7, 2024

Make initialization of tokenizer and detokenizer optional (vllm-proje…

1b3cb93

…ct#3748) Co-authored-by: Yun Ding <yunding@nvidia.com> Co-authored-by: Roger Wang <ywang@roblox.com>

dtrifiro mentioned this pull request May 15, 2024

bump ubi base image tag opendatahub-io/vllm#24

Merged

DarkLight1337 mentioned this pull request May 25, 2024

[Core] Consolidate prompt arguments to LLM engines #4328

Merged

Temirulan pushed a commit to Temirulan/vllm-whisper that referenced this pull request Sep 6, 2024

Make initialization of tokenizer and detokenizer optional (vllm-proje…

50e57ca

…ct#3748) Co-authored-by: Yun Ding <yunding@nvidia.com> Co-authored-by: Roger Wang <ywang@roblox.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make initialization of tokenizer and detokenizer optional #3748

Make initialization of tokenizer and detokenizer optional #3748

GeauxEric commented Mar 30, 2024

ywang96 commented Mar 31, 2024 •

edited

Loading

GeauxEric commented Apr 2, 2024 •

edited

Loading

ywang96 commented Apr 4, 2024

GeauxEric commented Apr 6, 2024 •

edited

Loading

GeauxEric commented Apr 17, 2024

ywang96 left a comment •

edited

Loading

ywang96 Apr 18, 2024

ywang96 Apr 18, 2024

GeauxEric commented Apr 20, 2024

ywang96 left a comment •

edited

Loading

Make initialization of tokenizer and detokenizer optional #3748

Make initialization of tokenizer and detokenizer optional #3748

Conversation

GeauxEric commented Mar 30, 2024

ywang96 commented Mar 31, 2024 • edited Loading

GeauxEric commented Apr 2, 2024 • edited Loading

ywang96 commented Apr 4, 2024

GeauxEric commented Apr 6, 2024 • edited Loading

GeauxEric commented Apr 17, 2024

ywang96 left a comment • edited Loading

Choose a reason for hiding this comment

ywang96 Apr 18, 2024

Choose a reason for hiding this comment

ywang96 Apr 18, 2024

Choose a reason for hiding this comment

GeauxEric commented Apr 20, 2024

ywang96 left a comment • edited Loading

Choose a reason for hiding this comment

ywang96 commented Mar 31, 2024 •

edited

Loading

GeauxEric commented Apr 2, 2024 •

edited

Loading

GeauxEric commented Apr 6, 2024 •

edited

Loading

ywang96 left a comment •

edited

Loading

ywang96 left a comment •

edited

Loading