New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Write README and front page of doc #147

Merged

WoosukKwon merged 63 commits into main from readme

Jun 18, 2023

Collaborator

WoosukKwon commented Jun 11, 2023 •

edited

Loading

Closes #124

WoosukKwon added 13 commits

June 11, 2023 19:36


          Write new README


          Minor

56cd729


          Minor

69cc609


          Intelligent -> advanced

7560a8a


          Add Contributing

f383031


          Minor

f08d197


          Minor

be1fad7


          News -> Latest News

4da3392


          Add Guanaco

ef5aaf6


          Add front page of doc

bbe916b


          Merge branch 'doc-front' into readme

5f3dbe5


          Minor

d7de269


          Add slides

5de5333

WoosukKwon changed the title ~~[WIP] Add README~~ [WIP] Write README and front page of doc

WoosukKwon commented

View reviewed changes

README.md Outdated Show resolved Hide resolved

WoosukKwon commented

View reviewed changes

README.md Outdated Show resolved Hide resolved

WoosukKwon commented

View reviewed changes

README.md Show resolved Hide resolved

WoosukKwon commented

View reviewed changes

README.md Outdated Show resolved Hide resolved

WoosukKwon commented

View reviewed changes

README.md Outdated Show resolved Hide resolved

WoosukKwon added 11 commits

June 15, 2023 02:38


          Address comments

aecb1a5


          Remove .

f01acc3

Fix

b8ca6b5

Fix

c6ae832


          roll back

9e52850


          Add URL

ca274b3


          Minor

939835a


          Merge branch 'main' into readme

d87cddc


          Minor

01f7c70


          Minor

4ea1ef1


          Add URL

fdf23f4

WoosukKwon added 11 commits

June 18, 2023 01:29


          Add links & Fix key features

e09daa9


          Bold

289f613


          Minor

97c4b86


          Add pip install

2b99c2f


          Fix figures

6a9a0f7


          Minor fix

40d9fe3

Fix

e1a38da


          Remove table


          Numeric

99d1f85


          bullets

1aee527


          Fix front page

6c9bc40

WoosukKwon requested a review from zhuohan123

June 18, 2023 09:13

Collaborator Author

WoosukKwon commented Jun 18, 2023

@zhuohan123 PTAL.

zhuohan123 approved these changes

View reviewed changes

Member

zhuohan123 left a comment

LGTM! Left some comments on sentence phrasing and formatting.

README.md

-              ```bash
-              python test_cli_client.py
-              ```
+              - State-of-the-art performance in serving throughput

Member

zhuohan123 Jun 18, 2023

Suggested change

      
            - State-of-the-art performance in serving throughput
          
            - State-of-the-art serving throughput

README.md Outdated

-              python test_cli_client.py
-              ```
+              - State-of-the-art performance in serving throughput
+              - Efficient management of cached attention keys and values with **PagedAttention**

Member

zhuohan123 Jun 18, 2023

Suggested change

      
            - Efficient management of cached attention keys and values with **PagedAttention**
          
            - Efficient management of cached attention keys and values memory with **PagedAttention**

Collaborator Author

WoosukKwon Jun 18, 2023

I think "memory" here is redundant and a bit confusing as we already said they are "cached".

Collaborator Author

WoosukKwon Jun 18, 2023

Fixed to Efficient management of attention key and value memory with **PagedAttention**

README.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

docs/source/index.rst Outdated

+              - Efficient support for various decoding algorithms such as parallel sampling and beam search
+              - Tensor parallelism support for multi-GPU inference
+              - Streaming outputs
+              - OpenAI-compatible API

Member

zhuohan123 Jun 18, 2023

Ditto on comments in README.

WoosukKwon added 11 commits

June 18, 2023 02:48


          Address comments

5ffaafc


          Multi-GPU -> distributed

a920670


          Address comments:

852c090

Fix

1821a80


          Use figure

92b653d


          Reduce width

4faf03a


          Remove align

5eb4577


          Use p with br

510d9b8


          Fix docs

898d5f9


          Increase image resolution

60eaff8


          cached -> memory

ea6180f

WoosukKwon merged commit dcda03b into main

WoosukKwon deleted the readme branch

June 18, 2023 10:21

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request


          Write README and front page of doc (vllm-project#147)

c3f0a81

sjchoi1 pushed a commit to casys-kaist-internal/vllm that referenced this pull request


          Write README and front page of doc (vllm-project#147)

fbecc0f

Xaenalt pushed a commit to Xaenalt/vllm that referenced this pull request


          Overhaul HPU memory management in HPUGraph capture (vllm-project#147)

14c20a3

* Log more HPU memory metrics during vLLM startup

* Overhaul memory management in HPUGraph capture

* fix percentage in decode buckets

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet