-
Notifications
You must be signed in to change notification settings - Fork 109
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update huggingface-hub for compatibility with datasets 2.18 #84
Conversation
Edit: just talked with @albertvillanova , and if we bump |
0.21.2 or larger (latest is 0.21.3). |
@@ -237,6 +237,7 @@ Summary: create a **line summary** of your evaluation, in `src/lighteval/tasks/t | |||
- `metric` (list), the metrics you want to use for your evaluation (see next section for a detailed explanation) | |||
- `output_regex` (str), A regex string that will be used to filter your generation. (Genrative metrics will only select tokens that are between the first and the second sequence matched by the regex. For example, for a regex matching `\n` and a generation `\nModel generation output\nSome other text` the metric will only be fed with `Model generation output`) | |||
- `frozen` (bool), for now is set to False, but we will steadily pass all stable tasks to True. | |||
- `trust_dataset` (bool), set to True if you trust the dataset. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(nit) why not call it "trust_remote_code" as the parameter name in datasets
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
To not confuse it with trust_remote_code required in models, initially ^^
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm OK with adding it, feel free to open an issue :)
No description provided.