Skip to content

Actions: openai/evals

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
270 workflow runs
270 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Updates on existing evals; readmes; solvers
Run unit tests #1640: Pull request #1483 opened by ojaffe
March 13, 2024 09:45 2m 25s ojaffe:ollie/updates-20240313
March 13, 2024 09:45 2m 25s
Log model and usage stats in record.sampling
Run unit tests #1638: Pull request #1449 synchronize by JunShern
March 13, 2024 07:48 2m 29s jun/log-token-counts
March 13, 2024 07:48 2m 29s
Drop two datasets from steganography (#1481)
Run unit tests #1637: Commit 7e958fe pushed by JunShern
March 12, 2024 09:23 2m 50s main
March 12, 2024 09:23 2m 50s
Drop two datasets from steganography
Run unit tests #1636: Pull request #1481 opened by thesofakillers
March 12, 2024 07:54 2m 3s thesofakillers:steg-data
March 12, 2024 07:54 2m 3s
Drop two datasets from steganography
Run new evals #2211: Pull request #1481 opened by thesofakillers
March 12, 2024 07:54 2m 0s thesofakillers:steg-data
March 12, 2024 07:54 2m 0s
Drop two datasets from steganography
Run unit tests #1635: Pull request #1477 synchronize by thesofakillers
March 12, 2024 07:44 2m 16s thesofakillers:change-steg-datasets
March 12, 2024 07:44 2m 16s
Drop two datasets from steganography
Run new evals #2210: Pull request #1477 synchronize by thesofakillers
March 12, 2024 07:44 2m 17s thesofakillers:change-steg-datasets
March 12, 2024 07:44 2m 17s
Add info about logging and link to logviz
Run unit tests #1634: Pull request #1480 synchronize by JunShern
March 12, 2024 06:19 2m 25s jun/link-to-logviz
March 12, 2024 06:19 2m 25s
Add info about logging and link to logviz
Run unit tests #1633: Pull request #1480 opened by JunShern
March 12, 2024 05:58 2m 29s jun/link-to-logviz
March 12, 2024 05:58 2m 29s
Drop two datasets from steganography
Run unit tests #1632: Pull request #1477 synchronize by JunShern
March 12, 2024 04:45 2m 20s thesofakillers:change-steg-datasets
March 12, 2024 04:45 2m 20s
Drop two datasets from steganography
Run new evals #2209: Pull request #1477 synchronize by JunShern
March 12, 2024 04:45 2m 3s thesofakillers:change-steg-datasets
March 12, 2024 04:45 2m 3s
Investigate failing Assistants test
Run unit tests #1631: Pull request #1479 synchronize by JunShern
March 12, 2024 04:40 2m 14s JunShern:jun/test-failing-assistants
March 12, 2024 04:40 2m 14s
Investigate failing Assistants test
Run unit tests #1630: Pull request #1479 synchronize by JunShern
March 12, 2024 04:40 2m 24s JunShern:jun/test-failing-assistants
March 12, 2024 04:40 2m 24s
Investigate failing Assistants test
Run unit tests #1629: Pull request #1479 synchronize by JunShern
March 12, 2024 04:36 2m 23s JunShern:jun/test-failing-assistants
March 12, 2024 04:36 2m 23s
Investigate failing Assistants test
Run unit tests #1628: Pull request #1479 synchronize by JunShern
March 12, 2024 04:30 2m 22s JunShern:jun/test-failing-assistants
March 12, 2024 04:30 2m 22s
Investigate failing Assistants test
Run unit tests #1627: Pull request #1479 opened by JunShern
March 12, 2024 04:14 2m 24s JunShern:jun/test-failing-assistants
March 12, 2024 04:14 2m 24s
Investigating failing AssistantsSolver test
Run unit tests #1626: Pull request #1478 opened by JunShern
March 12, 2024 03:40 2m 31s jun/bugfix-assistants-test
March 12, 2024 03:40 2m 31s
Adding Indian Women Menstrual Health Chatbot Eval
Run unit tests #1619: Pull request #1430 synchronize by phalgunagopal
February 27, 2024 18:57 2m 14s cranberrydeveloper:main
February 27, 2024 18:57 2m 14s
Adding Indian Women Menstrual Health Chatbot Eval
Run new evals #2204: Pull request #1430 synchronize by phalgunagopal
February 27, 2024 18:57 2m 8s cranberrydeveloper:main
February 27, 2024 18:57 2m 8s
Suppress 'HTTP/1.1 200 OK' logs from openai library (#1468)
Run unit tests #1615: Commit 82ec660 pushed by etr2460
February 23, 2024 02:15 2m 25s main
February 23, 2024 02:15 2m 25s
Suppress 'HTTP/1.1 200 OK' logs from openai library
Run unit tests #1614: Pull request #1468 opened by JunShern
February 15, 2024 04:28 2m 15s jun/suppress-httpx-logs
February 15, 2024 04:28 2m 15s
Fix small typos and inconsistencies in README (#1464)
Run unit tests #1613: Commit 902f750 pushed by logankilpatrick
February 13, 2024 14:29 2m 12s main
February 13, 2024 14:29 2m 12s