Skip to content

CUDA: add FP32 FlashAttention vector kernel #12040

CUDA: add FP32 FlashAttention vector kernel

CUDA: add FP32 FlashAttention vector kernel #12040

Annotations

1 warning

Push Docker image to Docker Hub (light-rocm, .devops/main-rocm.Dockerfile, linux/amd64,linux/arm64)

succeeded May 11, 2024 in 10m 49s