Workflow runs · huggingface/text-generation-inference

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows

14,874 workflow runs

feat: add triton kernels to decrease latency of large batches CI build #1624: Pull request #2687 synchronize by OlivierDehaene

October 24, 2024 17:17

1h 8m 11s feat/triton_prepare

feat/triton_prepare

October 24, 2024 17:17

1h 8m 11s

feat: add triton kernels to decrease latency of large batches Server Tests #3282: Pull request #2687 synchronize by OlivierDehaene

October 24, 2024 17:17

6m 37s feat/triton_prepare

feat/triton_prepare

October 24, 2024 17:17

6m 37s

feat: add triton kernels to decrease latency of large batches Automatic Documentation for Launcher #1587: Pull request #2687 synchronize by OlivierDehaene

October 24, 2024 17:17

7m 24s feat/triton_prepare

feat/triton_prepare

October 24, 2024 17:17

7m 24s

feat: add triton kernels to decrease latency of large batches Nix Tests #410: Pull request #2687 synchronize by OlivierDehaene

October 24, 2024 17:17

5m 45s feat/triton_prepare

feat/triton_prepare

October 24, 2024 17:17

5m 45s

fix kernel Secret Leaks #1995: Commit 383a6ba pushed by OlivierDehaene

October 24, 2024 17:17

25s feat/triton_prepare

feat/triton_prepare

October 24, 2024 17:17

25s

feat: add triton kernels to decrease latency of large batches CI build #1623: Pull request #2687 synchronize by OlivierDehaene

October 24, 2024 17:01

16m 9s feat/triton_prepare

feat/triton_prepare

October 24, 2024 17:01

16m 9s

feat: add triton kernels to decrease latency of large batches Nix Tests #409: Pull request #2687 synchronize by OlivierDehaene

October 24, 2024 17:01

8m 1s feat/triton_prepare

feat/triton_prepare

October 24, 2024 17:01

8m 1s

feat: add triton kernels to decrease latency of large batches Automatic Documentation for Launcher #1586: Pull request #2687 synchronize by OlivierDehaene

October 24, 2024 17:01

7m 16s feat/triton_prepare

feat/triton_prepare

October 24, 2024 17:01

7m 16s

feat: add triton kernels to decrease latency of large batches Server Tests #3281: Pull request #2687 synchronize by OlivierDehaene

October 24, 2024 17:01

8m 53s feat/triton_prepare

feat/triton_prepare

October 24, 2024 17:01

8m 53s

cast to int32 Secret Leaks #1994: Commit fb91860 pushed by OlivierDehaene

October 24, 2024 17:01

16s feat/triton_prepare

feat/triton_prepare

October 24, 2024 17:01

16s

feat: add support for qwen2 vl model Secret Leaks #1993: Commit b735e79 pushed by drbh

October 24, 2024 15:40

22s support-qwen2-vl

support-qwen2-vl

October 24, 2024 15:40

22s

feat: add support for qwen2 vl model Secret Leaks #1992: Commit edbf0f7 pushed by drbh

October 24, 2024 15:37

17s support-qwen2-vl

support-qwen2-vl

October 24, 2024 15:37

17s

Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels CI build #1622: Pull request #2688 synchronize by danieldk

October 24, 2024 15:32

1h 8m 0s feature/cc89-cutlass-w8a8

feature/cc89-cutlass-w8a8

October 24, 2024 15:32

1h 8m 0s

Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels Server Tests #3280: Pull request #2688 synchronize by danieldk

October 24, 2024 15:32

7m 31s feature/cc89-cutlass-w8a8

feature/cc89-cutlass-w8a8

October 24, 2024 15:32

7m 31s

Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels Automatic Documentation for Launcher #1585: Pull request #2688 synchronize by danieldk

October 24, 2024 15:32

7m 4s feature/cc89-cutlass-w8a8

feature/cc89-cutlass-w8a8

October 24, 2024 15:32

7m 4s

Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels Nix Tests #408: Pull request #2688 synchronize by danieldk

October 24, 2024 15:32

6m 35s feature/cc89-cutlass-w8a8

feature/cc89-cutlass-w8a8

October 24, 2024 15:32

6m 35s

Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels Secret Leaks #1991: Commit c6281a4 pushed by danieldk

October 24, 2024 15:32

17s feature/cc89-cutlass-w8a8

feature/cc89-cutlass-w8a8

October 24, 2024 15:32

17s

Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels Automatic Documentation for Launcher #1584: Pull request #2688 opened by danieldk

October 24, 2024 15:31

7m 20s feature/cc89-cutlass-w8a8

feature/cc89-cutlass-w8a8

October 24, 2024 15:31

7m 20s

Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels Server Tests #3279: Pull request #2688 opened by danieldk

October 24, 2024 15:31

1m 40s feature/cc89-cutlass-w8a8

feature/cc89-cutlass-w8a8

October 24, 2024 15:31

1m 40s

Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels Nix Tests #407: Pull request #2688 opened by danieldk

October 24, 2024 15:31

1m 41s feature/cc89-cutlass-w8a8

feature/cc89-cutlass-w8a8

October 24, 2024 15:31

1m 41s

Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels CI build #1621: Pull request #2688 opened by danieldk

October 24, 2024 15:31

1m 43s feature/cc89-cutlass-w8a8

feature/cc89-cutlass-w8a8

October 24, 2024 15:31

1m 43s

Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels Secret Leaks #1990: Commit 197d45e pushed by danieldk

October 24, 2024 15:30

18s feature/cc89-cutlass-w8a8

feature/cc89-cutlass-w8a8

October 24, 2024 15:30

18s

Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels Secret Leaks #1989: Commit bee95a3 pushed by danieldk

October 24, 2024 15:29

21s feature/cc89-cutlass-w8a8

feature/cc89-cutlass-w8a8

October 24, 2024 15:29

21s

feat: add triton kernels to decrease latency of large batches CI build #1620: Pull request #2687 opened by OlivierDehaene

October 24, 2024 14:49

50m 56s feat/triton_prepare

feat/triton_prepare

October 24, 2024 14:49

50m 56s

feat: add triton kernels to decrease latency of large batches Nix Tests #406: Pull request #2687 opened by OlivierDehaene

October 24, 2024 14:49

6m 7s feat/triton_prepare

feat/triton_prepare

October 24, 2024 14:49

6m 7s

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actions

Workflows

Management

All workflows

Actions

Loading...
Loading

All workflows

Filter by Event

Sorry, something went wrong.

Sorry, something went wrong.

No matching events.

Filter by Status

Sorry, something went wrong.

Sorry, something went wrong.

No matching statuses.

Filter by Branch

Sorry, something went wrong.

Sorry, something went wrong.

No matching branches.

Filter by Actor

Sorry, something went wrong.

Sorry, something went wrong.

No matching users.

Actions: huggingface/text-generation-inference

Actions

All workflows All workflows Actions Loading... Loading Sorry, something went wrong.

All workflows

All workflows

Actions

Loading...
Loading