Skip to content

Pull requests: openai/evals

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add gpt4facts Eval
#1363 opened Sep 25, 2023 by mmtmn Review required
13 tasks done
Add few-shot-abuse eval
#478 opened Mar 28, 2023 by githubman2718 Review required
12 tasks done
Race & Gender Bias
#630 opened Apr 10, 2023 by nickls Review required
11 of 12 tasks
commonsense morality
#750 opened Apr 22, 2023 by lucid-max Review required
12 tasks done
Nits
#1308 opened Jul 7, 2023 by mrzu Draft
5 of 6 tasks
MMLU eval
#1324 opened Jul 29, 2023 by Livegan Review required
13 tasks
[evals] Jailbreaks for refusal of harmful content
#291 opened Mar 17, 2023 by devxpy Review required
10 of 12 tasks
Add a new eval : chinese_literary_grace
#1375 opened Oct 7, 2023 by Conghui-Niu Review required
12 of 13 tasks
Deepcopy in recorder
#1376 opened Oct 12, 2023 by johny-b Review required
Add Eval: name well known security weaknesses
#1392 opened Oct 28, 2023 by ourmony Loading…
1 task
Choose completion function for evaluation of modelgraded evals
#1418 opened Nov 17, 2023 by LoryPack Loading…
6 tasks done
Adding Indian Women Menstrual Health Chatbot Eval
#1430 opened Dec 11, 2023 by cranberrydeveloper Loading…
13 tasks done
Extending to Azure OpenAI implementation
#1470 opened Feb 23, 2024 by pkt1583 Loading…
Add **kwargs to OpenAIChatCompletionFn
#1494 opened Mar 15, 2024 by ezraporter Loading…
[Evals] Add eval for Dhivehi diacritical marks
#1495 opened Mar 16, 2024 by aanaseer Loading…
11 of 12 tasks
Fix AttributeError: Update OpenAI error imports (Closes #1564)
#1577 opened Jan 27, 2025 by SaiKrishna-KK Loading…
6 of 13 tasks
Add Eval: Interpreting balance sheet absolute changes
#1336 opened Aug 16, 2023 by TensorTemplar Loading…
12 of 13 tasks
ProTip! Follow long discussions with comments:>50.