Skip to content

Add Pallas GPU decode attention in Maxtext inference #1369

Add Pallas GPU decode attention in Maxtext inference

Add Pallas GPU decode attention in Maxtext inference #1369

Triggered via pull request February 25, 2025 21:22
Status Success
Total duration 28m 53s
Artifacts

RunTests.yml

on: pull_request
gpu_image  /  Build and upload image (a100-40gb-4)
40s
gpu_image / Build and upload image (a100-40gb-4)
tpu_image  /  Build and upload image (v4-8)
3m 14s
tpu_image / Build and upload image (v4-8)
Clean up
4s
Clean up
Notify failed build
2s
Notify failed build
Fit to window
Zoom out
Zoom in