Update nightly llama benchmarking tests #754

aviator19941 · 2025-01-04T11:13:57Z

Updates nightly llama benchmarking tests to benchmark input token lengths of 128 and 2048 for llama 8b, 70b, and 405b.
Switch IREE compile flag from --iree-hal-target-backends to --iree-hal-target-device

TODO: Add 405b decode benchmark calls to 405b fp16 tests when decode is fixed

Signed-off-by: aviator19941 <avinash.sharma@amd.com>

.github/workflows/ci-llama-large-tests.yaml

…hark-ai into fix_sharded_llama_tests

- Updates nightly llama benchmarking tests to benchmark input token lengths of 128 and 2048 for llama 8b, 70b, and 405b. - Switch IREE compile flag from `--iree-hal-target-backends` to `--iree-hal-target-device` TODO: Add 405b decode benchmark calls to 405b fp16 tests when decode is fixed --------- Signed-off-by: aviator19941 <avinash.sharma@amd.com> Co-authored-by: Archana Ramalingam <98564406+archana-ramalingam@users.noreply.github.com> Co-authored-by: archana-ramalingam <archana.ramalingam@amd.com>

aviator19941 added 4 commits January 3, 2025 20:31

Fix 8b nightly tests

50eb2e6

Signed-off-by: aviator19941 <avinash.sharma@amd.com>

Update 70b test

a3a1a39

Signed-off-by: aviator19941 <avinash.sharma@amd.com>

Update 70b tests

db7d79e

Signed-off-by: aviator19941 <avinash.sharma@amd.com>

Update 405b 128 and 2048 tests

2abe6ad

Signed-off-by: aviator19941 <avinash.sharma@amd.com>

aviator19941 requested a review from archana-ramalingam January 4, 2025 11:13

archana-ramalingam and others added 7 commits January 6, 2025 09:47

Merge branch 'main' into fix_sharded_llama_tests

e68ee2f

Test benchmark nightly

afc8cff

Add missing iree_hal_target_device flags

69997e6

Use iree_hal_target_device flag to compile

4849c76

Add --iree-hal-target-device fixture

1a3c4cb

Use --iree-hal-target-device flag in perplexity tests

d747d89

Merge branch 'main' into fix_sharded_llama_tests

33815da

archana-ramalingam requested a review from IanNod January 6, 2025 20:04

ScottTodd reviewed Jan 6, 2025

View reviewed changes

.github/workflows/ci-llama-large-tests.yaml Outdated Show resolved Hide resolved

archana-ramalingam and others added 6 commits January 7, 2025 03:56

Fix benchmark nightly tests

5c7e6ff

Merge branch 'fix_sharded_llama_tests' of https://github.com/nod-ai/s…

30cf1c6

…hark-ai into fix_sharded_llama_tests

Merge branch 'main' into fix_sharded_llama_tests

31badf9

Revert debug changes

3842b62

Merge branch 'fix_sharded_llama_tests' of https://github.com/nod-ai/s…

0670af4

…hark-ai into fix_sharded_llama_tests

Merge branch 'main' into fix_sharded_llama_tests

34a0674

IanNod approved these changes Jan 7, 2025

View reviewed changes

archana-ramalingam merged commit ab29d88 into main Jan 7, 2025
22 of 24 checks passed

archana-ramalingam deleted the fix_sharded_llama_tests branch January 7, 2025 21:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update nightly llama benchmarking tests #754

Update nightly llama benchmarking tests #754

aviator19941 commented Jan 4, 2025 •

edited by archana-ramalingam

Loading

Update nightly llama benchmarking tests #754

Update nightly llama benchmarking tests #754

Conversation

aviator19941 commented Jan 4, 2025 • edited by archana-ramalingam Loading

aviator19941 commented Jan 4, 2025 •

edited by archana-ramalingam

Loading