Add NM benchmarking scripts & utils #14

varun-sundar-rabindranath · 2024-02-15T13:25:46Z

Summary:
Add benchmarking scripts and utils.
Things to note :

All files are stored in neuralmagic folder.
neuralmagic/benchmarks/scripts/* : Actual benchmarking scripts that interact with vllm engine.
neuralmagic/benchmarks/configs/* : JSON config files that define what benchmark commands to run.
neuralmagic/benchmarks/run_*.py : Scripts that consume some config file and run the benchmark scripts.
neuralmagic/tools : Add tools

Testing:
Local testing

neuralmagic/benchmarks/configs/benchmark_serving.json

neuralmagic/benchmarks/run_benchmark_serving.py

robertgshaw2-redhat · 2024-02-19T19:37:17Z

Thanks Varun. In general this is looking good. I'm going to leave some more comments in a bit.

The one thing that is a bit tricky here is that the dataset processing is it a bit too tied into the server benchmarking, which makes it a bit tricky to support swapping in and out the datasets we are benchmarking (which we will do over time). For example, we will want a different request pattern for prefix caching performance benchmarking than general

I'm going to put up a PR with an idea for how to make this a bit more pluggable

varun-sundar-rabindranath · 2024-02-19T20:07:28Z

Thanks @robertgshaw2-neuralmagic

The one thing that is a bit tricky here is that the dataset processing is it a bit too tied into the server benchmarking, which makes it a bit tricky to support swapping in and out the datasets we are benchmarking (which we will do over time).

I made a recent refactor that moves the dataset related stuff into neuralmagic-vllm/neuralmagic/benchmarks/scripts/common.py, so it should be a little more organized now. But, I agree with the sentiment there are some kinks with the dataset stuff. Happy to talk about it .

robertgshaw2-redhat · 2024-02-19T22:27:21Z

@varun-sundar-rabindranath

I completely hacked the relative import system to do a simple proof of concept of how the dataset_registry could work. https://github.com/neuralmagic/neuralmagic-vllm/pull/30/files#diff-dca4e725ece41a665c0423924d56c905ce4c00188d79bc5dbeacb222c0ae6c6a

Idea was to have a programatic way to add new datasets. Thus, the get_sharegpt and get_ultrachat functions are responsible both for downloading and preprocessing the data.

This structure removes the need for a dataset_download_cmds functionality as this can now be handled programatically within the dataset registry.
This structure separates the preprocessing functionality from the benchmarking scripts. Since each dataset will have a different format, this is crucial

This should make it easy to add new datasets over time.

neuralmagic/benchmarks/scripts/benchmark_serving.py

neuralmagic/benchmarks/configs/benchmark_serving.json

neuralmagic/benchmarks/configs/benchmark_serving_sparse.json

neuralmagic/benchmarks/README.md

neuralmagic/benchmarks/configs/benchmark_serving.json

neuralmagic/benchmarks/scripts/backend_request_func.py

neuralmagic/benchmarks/scripts/benchmark_throughput.py

andy-neuma · 2024-02-22T14:04:58Z

neuralmagic/benchmarks/scripts/common.py

@@ -0,0 +1,178 @@
+"""
+Common functions used in all benchmarking scripts


i think we want to point out these are for our benchmarking scripts.

neuralmagic/tools/time.sh

andy-neuma · 2024-02-22T14:09:56Z

neuralmagic/tools/time.sh

looks like we are going down this rabbit whole again.

andy-neuma

thanks for the "meetup". looks good.

varun-sundar-rabindranath marked this pull request as draft February 15, 2024 13:25

varun-sundar-rabindranath requested review from tlrmchlsmth, mgoin, LucasWilkinson, andy-neuma and robertgshaw2-redhat February 16, 2024 15:28

varun-sundar-rabindranath commented Feb 16, 2024

View reviewed changes

neuralmagic/benchmarks/configs/benchmark_serving.json Show resolved Hide resolved

varun-sundar-rabindranath commented Feb 16, 2024

View reviewed changes

neuralmagic/benchmarks/configs/benchmark_serving.json Outdated Show resolved Hide resolved

varun-sundar-rabindranath commented Feb 16, 2024

View reviewed changes

neuralmagic/benchmarks/run_benchmark_serving.py Outdated Show resolved Hide resolved

varun-sundar-rabindranath commented Feb 16, 2024

View reviewed changes

neuralmagic/benchmarks/run_benchmark_serving.py Outdated Show resolved Hide resolved

varun-sundar-rabindranath marked this pull request as ready for review February 16, 2024 21:57

varun-sundar-rabindranath force-pushed the varun/nm-benchmarks branch from 493d4a1 to 2646860 Compare February 19, 2024 20:16

robertgshaw2-redhat reviewed Feb 19, 2024

View reviewed changes

neuralmagic/benchmarks/scripts/benchmark_serving.py Outdated Show resolved Hide resolved

robertgshaw2-redhat reviewed Feb 19, 2024

View reviewed changes

neuralmagic/benchmarks/configs/benchmark_serving.json Outdated Show resolved Hide resolved

robertgshaw2-redhat reviewed Feb 19, 2024

View reviewed changes

neuralmagic/benchmarks/configs/benchmark_serving.json Show resolved Hide resolved

robertgshaw2-redhat reviewed Feb 19, 2024

View reviewed changes

neuralmagic/benchmarks/configs/benchmark_serving_sparse.json Outdated Show resolved Hide resolved

Varun Sundar Rabindranath added 11 commits February 21, 2024 18:45

Crete nm-benchmarks - wip

1802429

move nm benchmark scripts to neural magic folder

a62a1c1

move scripts to scripts folder

76c0064

add test config

77f7d03

add call_cmd from wand

e43ad27

import time.sh from wand

c8cac08

add time.sh from wand

2cfc013

Add benchmark runner scripts

171421d

ruff formatting

88e80bc

rename test_config -> benchmark_serving

8cb4473

add empty throughput config json

a213440

Varun Sundar Rabindranath added 10 commits February 21, 2024 18:45

cleanup

3acbebb

fix benchmark throughput

0554d69

fixes

8c583a6

download model beforehand

a9b732b

yapf

7518d93

add server warmup command

b32601a

add vllm engine warmpup

17c258b

yapf

2491bd3

fix serving becnch

bb7752c

fix benchmark throughput

a86a0bf

varun-sundar-rabindranath force-pushed the varun/nm-benchmarks branch from c887ae5 to a86a0bf Compare February 21, 2024 18:46

varun-sundar-rabindranath requested a review from robertgshaw2-redhat February 21, 2024 18:47

varun-sundar-rabindranath force-pushed the varun/nm-benchmarks branch 2 times, most recently from 3b58cab to a86a0bf Compare February 21, 2024 21:22

yapf

fa634dd

andy-neuma reviewed Feb 22, 2024

View reviewed changes

andy-neuma approved these changes Feb 22, 2024

View reviewed changes

Varun added 4 commits February 23, 2024 00:59

update readme

fb98d2f

update note

80c36f7

update time.sh

c953aab

remove sparse version of the configs - to add in future

8fde41a

varun-sundar-rabindranath changed the title ~~Varun/nm benchmarks~~ Add NM Benchmarking Feb 22, 2024

fix dataset registry

02cb4ab

varun-sundar-rabindranath changed the title ~~Add NM Benchmarking~~ Add NM benchmarking scripts & utils Feb 22, 2024

Varun Sundar Rabindranath added 4 commits February 22, 2024 20:30

update readme

75e6c7c

appease ruff

2aad328

fix strip in backend_request_func

1f59d64

yapf

dc64948

varun-sundar-rabindranath merged commit 77928e0 into main Feb 22, 2024
2 checks passed

varun-sundar-rabindranath deleted the varun/nm-benchmarks branch February 22, 2024 22:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NM benchmarking scripts & utils #14

Add NM benchmarking scripts & utils #14

varun-sundar-rabindranath commented Feb 15, 2024 •

edited

Loading

robertgshaw2-redhat commented Feb 19, 2024 •

edited

Loading

varun-sundar-rabindranath commented Feb 19, 2024

robertgshaw2-redhat commented Feb 19, 2024

andy-neuma Feb 22, 2024

andy-neuma Feb 22, 2024

andy-neuma left a comment

		@@ -0,0 +1,178 @@
		"""
		Common functions used in all benchmarking scripts

Add NM benchmarking scripts & utils #14

Add NM benchmarking scripts & utils #14

Conversation

varun-sundar-rabindranath commented Feb 15, 2024 • edited Loading

robertgshaw2-redhat commented Feb 19, 2024 • edited Loading

varun-sundar-rabindranath commented Feb 19, 2024

robertgshaw2-redhat commented Feb 19, 2024

andy-neuma Feb 22, 2024

Choose a reason for hiding this comment

andy-neuma Feb 22, 2024

Choose a reason for hiding this comment

andy-neuma left a comment

Choose a reason for hiding this comment

varun-sundar-rabindranath commented Feb 15, 2024 •

edited

Loading

robertgshaw2-redhat commented Feb 19, 2024 •

edited

Loading