Skip to content

Add Pallas GPU decode attention in Maxtext inference #1376

Add Pallas GPU decode attention in Maxtext inference

Add Pallas GPU decode attention in Maxtext inference #1376

tpu_unit_tests  /  run

succeeded Feb 25, 2025 in 20m 35s