Skip to content

Commit

Permalink
Update summary.md (#125)
Browse files Browse the repository at this point in the history
  • Loading branch information
qihqi authored Jun 17, 2024
1 parent 8bffb5d commit 7526a90
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions benchmarks/summary.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,8 @@ Date | Device | dtype | batch size | cache length |max input length |max output
----| ------- | ------ |---------- | -------------|-----------------|------------------|----------------------
2024-05-14 | TPU v5e-8 | bfloat16 | 512 | 2048 | 1024 | 1024 | 8700
2024-05-14 | TPU v5e-8 | int8 | 1024 | 2048 | 1024 | 1024 | 8746
2024-06-13 | TPU v5e-1 | bfloat16 | 1024 | 2048 | 1024 | 1024 | 4249


** NOTE: ** Gemma 2B uses `--shard_on_batch` flag so it's data parallel instead
of model parallel.
Expand Down

0 comments on commit 7526a90

Please sign in to comment.