[Question] MBU in automated CI? #237

cadedaniel · 2024-05-10T20:00:29Z

Hi folks, thanks for the great work.

With #135 merged, vLLM could see benefit from torch.compile backend given compiler-native integration with PagedAttention kernels.

Is there an easy way to see what the latest/nightly MBU is for torch compile on say, H100 / Llama3 70B?

Also interested in cold start compile time

cc @msaroufim

supriyar · 2024-05-10T21:32:30Z

@anijain2305 do we have any benchmark numbers for the cold start compile time?

msaroufim · 2024-05-11T18:43:11Z

Related pytorch/pytorch#125958

* remove redundancy * no int4 linear on ET

* clean up gguf loading. Move model loading to meta. * remove cpu * Fix CI and validation scripts (pytorch#154) * missing device (pytorch#232) * Use generator args to group all arguments to generator (pytorch#231) * prompt * chat_mode, num_samples * Move more generator args to use dataclass (pytorch#233) * prompt * chat_mode, num_samples * move more args * more gen args * update * args * undo some changes * typos * Minor lint fixes (pytorch#236) * remove redundancy & remove int4 linear test from ET tests (pytorch#237) * remove redundancy * no int4 linear on ET * small changes --------- Co-authored-by: Guang Yang <42389959+guangy10@users.noreply.github.com> Co-authored-by: Michael Gschwind <61328285+mikekgfb@users.noreply.github.com> Co-authored-by: Mergen Nachin <mnachin@meta.com>

msaroufim added benchmark ci labels May 10, 2024

yanbing-j pushed a commit to yanbing-j/ao that referenced this issue Dec 9, 2024

remove redundancy & remove int4 linear test from ET tests (pytorch#237)

772a804

* remove redundancy * no int4 linear on ET

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] MBU in automated CI? #237

[Question] MBU in automated CI? #237

cadedaniel commented May 10, 2024 •

edited

Loading

supriyar commented May 10, 2024

msaroufim commented May 11, 2024

[Question] MBU in automated CI? #237

[Question] MBU in automated CI? #237

Comments

cadedaniel commented May 10, 2024 • edited Loading

supriyar commented May 10, 2024

msaroufim commented May 11, 2024

cadedaniel commented May 10, 2024 •

edited

Loading