Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BFCL] Multi Turn Dataset Fix (Miss Func & Long Context) #728

Merged
merged 6 commits into from
Oct 31, 2024

Conversation

HuanzhiMao
Copy link
Collaborator

This PR updates the question and ground truth for the multi_turn_miss_func and multi_turn_long_context accordingly, since they are augmented from multi_turn_base and the fix for the base entries was finalized in #723.

Following #719, #722, #723 and #725, this is also part of the effort to thoroughly bug fix the multi turn categories. There will be one more PR coming for the multi_turn_miss_param category fix.


Co-authored-by: Charlie Cheng-Jie Ji 55744150+CharlieJCJ@users.noreply.github.com
Co-authored-by: Fanjia-Yan 78303449+Fanjia-Yan@users.noreply.github.com
Co-authored-by: VishnuSuresh27 112032533+VishnuSuresh27@users.noreply.github.com

@HuanzhiMao HuanzhiMao added the BFCL-Dataset BFCL Dataset-Related Issue label Oct 30, 2024
@HuanzhiMao HuanzhiMao marked this pull request as ready for review October 31, 2024 09:13
@HuanzhiMao HuanzhiMao merged commit 12935b0 into ShishirPatil:main Oct 31, 2024
@HuanzhiMao HuanzhiMao deleted the miss_func_long_context branch October 31, 2024 12:22
HuanzhiMao added a commit that referenced this pull request Oct 31, 2024
This PR updates the question and ground truth for the
`multi_turn_miss_param` category, since they are augmented from
`multi_turn_base` and the fix for the base entries was finalized in
#723.

Following #719, #722, #723, #725 and #728, this is also part of the
effort to thoroughly bug fix the multi turn categories.

---------

Co-authored-by: Charlie Cheng-Jie Ji
<55744150+CharlieJCJ@users.noreply.github.com>
Co-authored-by: Fanjia-Yan
<78303449+Fanjia-Yan@users.noreply.github.com>
Co-authored-by: VishnuSuresh27
<112032533+VishnuSuresh27@users.noreply.github.com>
Copy link
Collaborator

@Fanjia-Yan Fanjia-Yan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Miss func & Long Context has been exhaustively verified for completeness.

VishnuSuresh27 pushed a commit to VishnuSuresh27/gorilla that referenced this pull request Nov 11, 2024
…l#728)

This PR updates the question and ground truth for the
`multi_turn_miss_func` and `multi_turn_long_context` accordingly, since
they are augmented from `multi_turn_base` and the fix for the base
entries was finalized in ShishirPatil#723.

Following ShishirPatil#719, ShishirPatil#722, ShishirPatil#723 and ShishirPatil#725, this is also part of the effort to
thoroughly bug fix the multi turn categories. There will be one more PR
coming for the `multi_turn_miss_param` category fix.

---------

Co-authored-by: Charlie Cheng-Jie Ji
<55744150+CharlieJCJ@users.noreply.github.com>
Co-authored-by: Fanjia-Yan
<78303449+Fanjia-Yan@users.noreply.github.com>
Co-authored-by: VishnuSuresh27
<112032533+VishnuSuresh27@users.noreply.github.com>
VishnuSuresh27 pushed a commit to VishnuSuresh27/gorilla that referenced this pull request Nov 11, 2024
This PR updates the question and ground truth for the
`multi_turn_miss_param` category, since they are augmented from
`multi_turn_base` and the fix for the base entries was finalized in
ShishirPatil#723.

Following ShishirPatil#719, ShishirPatil#722, ShishirPatil#723, ShishirPatil#725 and ShishirPatil#728, this is also part of the
effort to thoroughly bug fix the multi turn categories.

---------

Co-authored-by: Charlie Cheng-Jie Ji
<55744150+CharlieJCJ@users.noreply.github.com>
Co-authored-by: Fanjia-Yan
<78303449+Fanjia-Yan@users.noreply.github.com>
Co-authored-by: VishnuSuresh27
<112032533+VishnuSuresh27@users.noreply.github.com>
HuanzhiMao added a commit that referenced this pull request Nov 19, 2024
This PR updates the leaderboard to reflect the change in score due to
the following PR merge:

1. #719
2. #722
3. #723
4. #728 
5. #732
6. #725
7. #712
8. #733
9. #720 
10. #760 
11. #761 
12. #767
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BFCL-Dataset BFCL Dataset-Related Issue
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants