-
Notifications
You must be signed in to change notification settings - Fork 2.7k
Pull requests: openai/evals
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Emergency department classification eval
#587
opened Apr 5, 2023 by
abhinayvyas
•
Review required
11 tasks done
Intracerebral Haemorrhage Prediction in Patients with Complex Chronic…
#688
opened Apr 15, 2023 by
odeak
•
Review required
11 of 12 tasks
Eval: Advanced emotion analysis for complex scenarios based on a Ph.D. dissertation
#836
opened Apr 27, 2023 by
tuanlemau
•
Review required
11 of 12 tasks
Now I have the change in place, it seems wrong.
#1209
opened Jun 21, 2023 by
CholoTook
•
Review required
[Resolves Issue #1228] Improve ModelGraded Evals Formatting for Increased GPT Compliance
#1258
opened Jun 28, 2023 by
douglasmonsky
•
Review required
[evals] Jailbreaks for refusal of harmful content
#291
opened Mar 17, 2023 by
devxpy
•
Review required
10 of 12 tasks
Add a new eval : chinese_literary_grace
#1375
opened Oct 7, 2023 by
Conghui-Niu
•
Review required
12 of 13 tasks
Valid Hanabi clues eval & update Includes to optionally take Exclusions
#1385
opened Oct 17, 2023 by
sjadler2004
•
Review required
13 tasks done
Choose completion function for evaluation of modelgraded evals
#1418
opened Nov 17, 2023 by
LoryPack
Loading…
6 tasks done
Adding Indian Women Menstrual Health Chatbot Eval
#1430
opened Dec 11, 2023 by
cranberrydeveloper
Loading…
13 tasks done
[Evals] Add eval for Dhivehi diacritical marks
#1495
opened Mar 16, 2024 by
aanaseer
Loading…
11 of 12 tasks
Fix AttributeError: Update OpenAI error imports (Closes #1564)
#1577
opened Jan 27, 2025 by
SaiKrishna-KK
Loading…
6 of 13 tasks
Add Eval: Interpreting balance sheet absolute changes
#1336
opened Aug 16, 2023 by
TensorTemplar
Loading…
12 of 13 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.