Skip to content

Actions: huggingface/text-generation-inference

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
14,874 workflow runs
14,874 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

feat: add triton kernels to decrease latency of large batches
CI build #1624: Pull request #2687 synchronize by OlivierDehaene
October 24, 2024 17:17 1h 8m 11s feat/triton_prepare
October 24, 2024 17:17 1h 8m 11s
feat: add triton kernels to decrease latency of large batches
Server Tests #3282: Pull request #2687 synchronize by OlivierDehaene
October 24, 2024 17:17 6m 37s feat/triton_prepare
October 24, 2024 17:17 6m 37s
feat: add triton kernels to decrease latency of large batches
Automatic Documentation for Launcher #1587: Pull request #2687 synchronize by OlivierDehaene
October 24, 2024 17:17 7m 24s feat/triton_prepare
October 24, 2024 17:17 7m 24s
feat: add triton kernels to decrease latency of large batches
Nix Tests #410: Pull request #2687 synchronize by OlivierDehaene
October 24, 2024 17:17 5m 45s feat/triton_prepare
October 24, 2024 17:17 5m 45s
fix kernel
Secret Leaks #1995: Commit 383a6ba pushed by OlivierDehaene
October 24, 2024 17:17 25s feat/triton_prepare
October 24, 2024 17:17 25s
feat: add triton kernels to decrease latency of large batches
CI build #1623: Pull request #2687 synchronize by OlivierDehaene
October 24, 2024 17:01 16m 9s feat/triton_prepare
October 24, 2024 17:01 16m 9s
feat: add triton kernels to decrease latency of large batches
Nix Tests #409: Pull request #2687 synchronize by OlivierDehaene
October 24, 2024 17:01 8m 1s feat/triton_prepare
October 24, 2024 17:01 8m 1s
feat: add triton kernels to decrease latency of large batches
Automatic Documentation for Launcher #1586: Pull request #2687 synchronize by OlivierDehaene
October 24, 2024 17:01 7m 16s feat/triton_prepare
October 24, 2024 17:01 7m 16s
feat: add triton kernels to decrease latency of large batches
Server Tests #3281: Pull request #2687 synchronize by OlivierDehaene
October 24, 2024 17:01 8m 53s feat/triton_prepare
October 24, 2024 17:01 8m 53s
cast to int32
Secret Leaks #1994: Commit fb91860 pushed by OlivierDehaene
October 24, 2024 17:01 16s feat/triton_prepare
October 24, 2024 17:01 16s
feat: add support for qwen2 vl model
Secret Leaks #1993: Commit b735e79 pushed by drbh
October 24, 2024 15:40 22s support-qwen2-vl
October 24, 2024 15:40 22s
feat: add support for qwen2 vl model
Secret Leaks #1992: Commit edbf0f7 pushed by drbh
October 24, 2024 15:37 17s support-qwen2-vl
October 24, 2024 15:37 17s
Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels
CI build #1622: Pull request #2688 synchronize by danieldk
October 24, 2024 15:32 1h 8m 0s feature/cc89-cutlass-w8a8
October 24, 2024 15:32 1h 8m 0s
Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels
Server Tests #3280: Pull request #2688 synchronize by danieldk
October 24, 2024 15:32 7m 31s feature/cc89-cutlass-w8a8
October 24, 2024 15:32 7m 31s
Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels
Automatic Documentation for Launcher #1585: Pull request #2688 synchronize by danieldk
October 24, 2024 15:32 7m 4s feature/cc89-cutlass-w8a8
October 24, 2024 15:32 7m 4s
Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels
Automatic Documentation for Launcher #1584: Pull request #2688 opened by danieldk
October 24, 2024 15:31 7m 20s feature/cc89-cutlass-w8a8
October 24, 2024 15:31 7m 20s
feat: add triton kernels to decrease latency of large batches
CI build #1620: Pull request #2687 opened by OlivierDehaene
October 24, 2024 14:49 50m 56s feat/triton_prepare
October 24, 2024 14:49 50m 56s