Skip to content

Actions: pytorch/torchchat

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
17,448 workflow runs
17,448 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add new torchao experimental a8wxdq for fast ARM CPU quantization
Run the aoti runner with CUDA using stories #1813: Pull request #1145 synchronize by metascroy
September 18, 2024 20:52 9m 1s a8wxdq
September 18, 2024 20:52 9m 1s
Add new torchao experimental a8wxdq for fast ARM CPU quantization
Run the README instructions - with stories - on MPS/MacOS #1317: Pull request #1145 synchronize by metascroy
September 18, 2024 20:52 7m 55s a8wxdq
September 18, 2024 20:52 7m 55s
Add new torchao experimental a8wxdq for fast ARM CPU quantization
pull #2739: Pull request #1145 synchronize by metascroy
September 18, 2024 20:52 In progress a8wxdq
September 18, 2024 20:52 In progress
Add new torchao experimental a8wxdq for fast ARM CPU quantization
Run the README instructions - with stories #1428: Pull request #1145 synchronize by metascroy
September 18, 2024 20:52 In progress a8wxdq
September 18, 2024 20:52 In progress
Add new torchao experimental a8wxdq for fast ARM CPU quantization
Run parallel prefill #1703: Pull request #1145 synchronize by metascroy
September 18, 2024 20:52 In progress a8wxdq
September 18, 2024 20:52 In progress
Add new torchao experimental a8wxdq for fast ARM CPU quantization
Run the README instructions - with stories - on MacOS #1428: Pull request #1145 synchronize by metascroy
September 18, 2024 20:52 8m 43s a8wxdq
September 18, 2024 20:52 8m 43s
Add new torchao experimental a8wxdq for fast ARM CPU quantization
pull #2738: Pull request #1145 synchronize by metascroy
September 18, 2024 20:47 In progress a8wxdq
September 18, 2024 20:47 In progress
Add new torchao experimental a8wxdq for fast ARM CPU quantization
Run the README instructions - with stories #1427: Pull request #1145 synchronize by metascroy
September 18, 2024 20:47 In progress a8wxdq
September 18, 2024 20:47 In progress
Add new torchao experimental a8wxdq for fast ARM CPU quantization
Run parallel prefill #1702: Pull request #1145 synchronize by metascroy
September 18, 2024 20:47 In progress a8wxdq
September 18, 2024 20:47 In progress
Add new torchao experimental a8wxdq for fast ARM CPU quantization
Run the aoti runner with CUDA using stories #1812: Pull request #1145 synchronize by metascroy
September 18, 2024 20:47 9m 25s a8wxdq
September 18, 2024 20:47 9m 25s
Add new torchao experimental a8wxdq for fast ARM CPU quantization
Run the README instructions - with stories - on MPS/MacOS #1316: Pull request #1145 synchronize by metascroy
September 18, 2024 20:47 7m 59s a8wxdq
September 18, 2024 20:47 7m 59s
Add new torchao experimental a8wxdq for fast ARM CPU quantization
Run the README instructions - with stories - on MacOS #1427: Pull request #1145 synchronize by metascroy
September 18, 2024 20:47 8m 39s a8wxdq
September 18, 2024 20:47 8m 39s
Update Chat Max_seq_len to 1024
Run the aoti runner with CUDA using stories #1811: Pull request #1163 opened by Jack-Khuu
September 18, 2024 20:31 10m 1s update_chat_max_seq_len
September 18, 2024 20:31 10m 1s
Update Chat Max_seq_len to 1024
Run the README instructions - with stories #1426: Pull request #1163 opened by Jack-Khuu
September 18, 2024 20:31 17m 40s update_chat_max_seq_len
September 18, 2024 20:31 17m 40s
Update Chat Max_seq_len to 1024
Run the README instructions - with stories - on MPS/MacOS #1315: Pull request #1163 opened by Jack-Khuu
September 18, 2024 20:31 7m 18s update_chat_max_seq_len
September 18, 2024 20:31 7m 18s
Update Chat Max_seq_len to 1024
pull #2737: Pull request #1163 opened by Jack-Khuu
September 18, 2024 20:31 In progress update_chat_max_seq_len
September 18, 2024 20:31 In progress
Update Chat Max_seq_len to 1024
Run parallel prefill #1701: Pull request #1163 opened by Jack-Khuu
September 18, 2024 20:31 29m 45s update_chat_max_seq_len
September 18, 2024 20:31 29m 45s
Update Chat Max_seq_len to 1024
Run the README instructions - with stories - on MacOS #1426: Pull request #1163 opened by Jack-Khuu
September 18, 2024 20:31 8m 9s update_chat_max_seq_len
September 18, 2024 20:31 8m 9s
[Distributed] Separate prefill and decode
Run the aoti runner with CUDA using stories #1810: Pull request #1162 opened by kwen2501
September 18, 2024 08:46 9m 36s decode2
September 18, 2024 08:46 9m 36s
[Distributed] Separate prefill and decode
Run the README instructions - with stories #1425: Pull request #1162 opened by kwen2501
September 18, 2024 08:46 17m 19s decode2
September 18, 2024 08:46 17m 19s
[Distributed] Separate prefill and decode
Run parallel prefill #1700: Pull request #1162 opened by kwen2501
September 18, 2024 08:46 29m 2s decode2
September 18, 2024 08:46 29m 2s
[Distributed] Separate prefill and decode
Run the README instructions - with stories - on MPS/MacOS #1314: Pull request #1162 opened by kwen2501
September 18, 2024 08:46 2h 27m 39s decode2
September 18, 2024 08:46 2h 27m 39s
[Distributed] Separate prefill and decode
pull #2736: Pull request #1162 opened by kwen2501
September 18, 2024 08:46 2h 25m 17s decode2
September 18, 2024 08:46 2h 25m 17s
[Distributed] Separate prefill and decode
Run the README instructions - with stories - on MacOS #1425: Pull request #1162 opened by kwen2501
September 18, 2024 08:46 7m 58s decode2
September 18, 2024 08:46 7m 58s
[Distributed] Use Tensor Parallel instead of Sequence Parallel
Run the README instructions - with stories #1424: Pull request #1160 synchronize by kwen2501
September 18, 2024 08:36 17m 40s tp_not_sp
September 18, 2024 08:36 17m 40s