Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BFCL Chore] Quick fix change of decorators from @overrides to @override #797

Merged
merged 12 commits into from
Nov 28, 2024

Conversation

VishnuSuresh27
Copy link
Contributor

Changing decorators in oss handlers to stay consistent with trusted documentation https://github.com/mkorpela/overrides

VishnuSuresh27 and others added 11 commits November 25, 2024 23:53
Add new model `openbmb/MiniCPM3-4B-FC` to the leaderboard.

---------

Co-authored-by: Huanzhi (Hans) Mao <huanzhimao@gmail.com>
… CLI (ShishirPatil#795)

This PR adds the support for the `bfcl generate` and `bfcl evaluate`
command to take in multiple values for the `--model` and
`--test-category` arguments.

For example,
```
bfcl generate --model claude-3-5-sonnet-20241022-FC,gpt-4o-2024-08-06-FC --test-category parallel,multiple,exec_simple
```

---------

Co-authored-by: Huanzhi (Hans) Mao <huanzhimao@gmail.com>
ShishirPatil#796)

This PR updates the decoding logic for DeepSeek-Coder handler to fix its
performance issue in the irrelevance category.
The irrelevance category metric we use is that, either the `decode_ast`
should fail (error) or the decoded output is empty (eg, empty list or
empty string).

For the DeepSeek-Coder model, 
When it outputs a valid function call, the model response will be a list
of dictionaries `[{func1:{param1:val1,...}},{func2:{param2:val2,...}}]`,
so it's fine for `decode_ast` to just return it without any processing.
However, when the output is a message (not valid function call), under
the `_parse_query_response_prompting` logic, the model response will be
that message string, and in the current `decode_ast` implementation,
that string will just be treated as the decoded output, and it would
fail both the metric for the irrelevance category, which is not ideal.
@HuanzhiMao HuanzhiMao changed the title [BFCL Chore] Quick fix change of decorators from overrides to override [BFCL Chore] Quick fix change of decorators from @overrides to @override Nov 28, 2024
Copy link
Collaborator

@HuanzhiMao HuanzhiMao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks @VishnuSuresh27

@HuanzhiMao HuanzhiMao merged commit e3da4ce into ShishirPatil:main Nov 28, 2024
@HuanzhiMao HuanzhiMao added the BFCL-General General BFCL Issue label Nov 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BFCL-General General BFCL Issue
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants