Skip to content

Actions: huggingface/text-generation-inference

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
14,877 workflow runs
14,877 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add support for FP8 KV cache scales
Nix Tests #403: Pull request #2628 synchronize by danieldk
October 24, 2024 12:37 8m 21s feature/fp8-kv-cache-scale
October 24, 2024 12:37 8m 21s
Add support for FP8 KV cache scales
Automatic Documentation for Launcher #1580: Pull request #2628 synchronize by danieldk
October 24, 2024 12:37 7m 6s feature/fp8-kv-cache-scale
October 24, 2024 12:37 7m 6s
Add support for FP8 KV cache scales
CI build #1614: Pull request #2628 synchronize by danieldk
October 24, 2024 12:37 54m 53s feature/fp8-kv-cache-scale
October 24, 2024 12:37 54m 53s
Add support for FP8 KV cache scales
Server Tests #3275: Pull request #2628 synchronize by danieldk
October 24, 2024 12:37 9m 11s feature/fp8-kv-cache-scale
October 24, 2024 12:37 9m 11s
Fix Phi 3.5 MoE tests
CI build #1613: Pull request #2684 opened by danieldk
October 24, 2024 12:08 52m 0s maintenance/phi35-test-fix
October 24, 2024 12:08 52m 0s
Fix Phi 3.5 MoE tests
Automatic Documentation for Launcher #1579: Pull request #2684 opened by danieldk
October 24, 2024 12:08 7m 1s maintenance/phi35-test-fix
October 24, 2024 12:08 7m 1s
Fix Phi 3.5 MoE tests
Secret Leaks #1980: Commit 9bbbe47 pushed by danieldk
October 24, 2024 12:07 23s maintenance/phi35-test-fix
October 24, 2024 12:07 23s
Upload PR Documentation
Upload PR Documentation #109: completed by Narsil
October 24, 2024 09:55 35s
October 24, 2024 09:55 35s
Choosing input/total tokens automatically based on available VRAM?
Build PR Documentation #220: Pull request #2673 synchronize by Narsil
October 24, 2024 09:54 47s auto_length
October 24, 2024 09:54 47s
Choosing input/total tokens automatically based on available VRAM?
CI build #1612: Pull request #2673 synchronize by Narsil
October 24, 2024 09:54 54m 21s auto_length
October 24, 2024 09:54 54m 21s
Choosing input/total tokens automatically based on available VRAM?
Automatic Documentation for Launcher #1578: Pull request #2673 synchronize by Narsil
October 24, 2024 09:54 7m 6s auto_length
October 24, 2024 09:54 7m 6s
Choosing input/total tokens automatically based on available VRAM?
Nix Tests #402: Pull request #2673 synchronize by Narsil
October 24, 2024 09:54 5m 49s auto_length
October 24, 2024 09:54 5m 49s
Choosing input/total tokens automatically based on available VRAM?
Server Tests #3274: Pull request #2673 synchronize by Narsil
October 24, 2024 09:54 6m 40s auto_length
October 24, 2024 09:54 6m 40s
Fix integration mt0 (transformers update).
Secret Leaks #1979: Commit e3db525 pushed by Narsil
October 24, 2024 09:54 17s auto_length
October 24, 2024 09:54 17s
Upload PR Documentation
Upload PR Documentation #108: completed by Narsil
October 24, 2024 09:40 31s
October 24, 2024 09:40 31s
Choosing input/total tokens automatically based on available VRAM?
Build PR Documentation #219: Pull request #2673 synchronize by Narsil
October 24, 2024 09:39 42s auto_length
October 24, 2024 09:39 42s
Choosing input/total tokens automatically based on available VRAM?
CI build #1611: Pull request #2673 synchronize by Narsil
October 24, 2024 09:39 24m 16s auto_length
October 24, 2024 09:39 24m 16s
Choosing input/total tokens automatically based on available VRAM?
Automatic Documentation for Launcher #1577: Pull request #2673 synchronize by Narsil
October 24, 2024 09:39 6m 57s auto_length
October 24, 2024 09:39 6m 57s
Choosing input/total tokens automatically based on available VRAM?
Nix Tests #401: Pull request #2673 synchronize by Narsil
October 24, 2024 09:39 9m 58s auto_length
October 24, 2024 09:39 9m 58s
Choosing input/total tokens automatically based on available VRAM?
Server Tests #3273: Pull request #2673 synchronize by Narsil
October 24, 2024 09:39 9m 8s auto_length
October 24, 2024 09:39 9m 8s
Simple updates.
Secret Leaks #1978: Commit 199973c pushed by Narsil
October 24, 2024 09:39 21s auto_length
October 24, 2024 09:39 21s