Rename to vLLM #150

WoosukKwon · 2023-06-17T01:20:06Z

The current plan is

Merge this PR
Change the repo name to vllm
Move the repo to an organization
Fix readthedocs URL

zhuohan123

LGTM! vLLM rocks!

benchmarks/benchmark_latency.py

WoosukKwon · 2023-06-17T09:09:44Z

@zhuohan123 FYI, I've just renamed the attention classes: GPTPagedAttention to PagedAttention, and GPTNeoXPagedAttention to PagedAttentionWithRoPE.

Summary: The 2024-03-25 nightly benchmarks failed due to performance regressions. We find that this is either due to, - the inherent flakiness in the benchmark experiment itself (experiments with small work loads), or - the inherent flakiness in the metrics. Please look at https://docs.google.com/document/d/1478BMToQIcpSCloiEWqmHoZVrVOZVV-1u4gCyqtjkKE/edit?usp=sharing for more details. Updates in this PR: - Serving case : Remove the 3000 num prompts at 10 qps experiments. - Serving case : Mark the p90, p99 statistics as "Observation" metrics so they dont trigger failure. - Engine case (benchmark_throughput.py) : Remove the 16 and 32 prefill cases. Test: Some local testing --------- Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>

WoosukKwon added 20 commits June 16, 2023 17:33

Rename cacheflow -> vllm

3e1bda3

Change import cacheflow -> vllm

21ff588

Change import cacheflow -> vllm

cf925ff

Change import cacheflow -> vllm

1ae1a0c

Fix examples

818aa8f

Fix benchmark

9820dd6

Minor

ab9fd1c

Fix setup.py

2f3741f

CacheFlowAttention -> PagedAttention

f758b9f

Minor

9c1ad6f

CacheFlow team -> vLLM team

8a6e1d2

CacheFlow team -> vLLM team

26f8ade

Fix docs

f005eeb

misc

dd2b98e

Roll back README

83a1db1

namespace cacheflow -> vllm

e831833

cacheflow -> vllm

2b48626

Minor

9f89970

Add description in setup.py

3996b39

Add FIXME

fb0714a

WoosukKwon requested a review from zhuohan123 June 17, 2023 01:31

Minor

ec53736

zhuohan123 approved these changes Jun 17, 2023

View reviewed changes

benchmarks/benchmark_latency.py Show resolved Hide resolved

WoosukKwon added 3 commits June 17, 2023 09:03

Address comment

fafe6fa

Rename attentions

23af0d0

Fix format

dedd8b9

WoosukKwon added 3 commits June 17, 2023 09:42

Merge branch 'main' into rename-vllm

be3bb0a

Delete server

390cd5f

Fix

1be17ac

WoosukKwon merged commit 0b98ba1 into main Jun 17, 2023

WoosukKwon deleted the rename-vllm branch June 17, 2023 10:08

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Change the name to vLLM (vllm-project#150)

79af1eb

sjchoi1 pushed a commit to casys-kaist-internal/vllm that referenced this pull request May 7, 2024

Change the name to vLLM (vllm-project#150)

17c4504

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rename to vLLM #150

Rename to vLLM #150

WoosukKwon commented Jun 17, 2023 •

edited

Loading

zhuohan123 left a comment

WoosukKwon commented Jun 17, 2023

Rename to vLLM #150

Rename to vLLM #150

Conversation

WoosukKwon commented Jun 17, 2023 • edited Loading

zhuohan123 left a comment

Choose a reason for hiding this comment

WoosukKwon commented Jun 17, 2023

WoosukKwon commented Jun 17, 2023 •

edited

Loading