Skip to content

Add Pallas GPU decode attention in Maxtext inference #1376

Add Pallas GPU decode attention in Maxtext inference

Add Pallas GPU decode attention in Maxtext inference #1376

Triggered via pull request February 25, 2025 23:06
Status Success
Total duration 24m 25s
Artifacts

RunTests.yml

on: pull_request
gpu_image  /  Build and upload image (a100-40gb-4)
43s
gpu_image / Build and upload image (a100-40gb-4)
tpu_image  /  Build and upload image (v4-8)
3m 14s
tpu_image / Build and upload image (v4-8)
Clean up
5s
Clean up
Notify failed build
2s
Notify failed build
Fit to window
Zoom out
Zoom in