chore(deps): update dependency datasets to v2 - autoclosed #3
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR contains the following updates:
==1.11.0
->==2.21.0
Release Notes
huggingface/datasets (datasets)
v2.21.0
Compare Source
Features
What's Changed
HF_HUB_OFFLINE
instead ofHF_DATASETS_OFFLINE
by @Wauplin in https://github.com/huggingface/datasets/pull/6968New Contributors
Full Changelog: huggingface/datasets@2.20.0...2.21.0
v2.20.0
Compare Source
Important
trust_remote_code=True
by @lhoestq in https://github.com/huggingface/datasets/pull/6954trust_remote_code=True
to be usedDatasets features
checkpoint and resume an iterable dataset (e.g. when streaming):
Returns:
General improvements and bug fixes
.pth
support for torch tensors by @lhoestq in https://github.com/huggingface/datasets/pull/6920dataset_module_factory
by @Wauplin in https://github.com/huggingface/datasets/pull/6959New Contributors
Full Changelog: huggingface/datasets@2.19.0...2.20.0
v2.19.2
Compare Source
Bug fixes
Full Changelog: huggingface/datasets@2.19.1...2.19.2
v2.19.1
Compare Source
Bug fixes
Full Changelog: huggingface/datasets@2.19.0...2.19.1
v2.19.0
Compare Source
Dataset Features
.to_polars()
;fsspec
support forto_json
,to_csv
, andto_parquet
by @alvarobartt in https://github.com/huggingface/datasets/pull/6096mode
parameter toImage
feature by @mariosasko in https://github.com/huggingface/datasets/pull/6735datasets-cli convert_to_parquet <dataset_id>
General improvements and bug fixes
remove_columns
/rename_columns
doc fixes by @mariosasko in https://github.com/huggingface/datasets/pull/6772uv
in CI by @mariosasko in https://github.com/huggingface/datasets/pull/6779_check_legacy_cache2
by @lhoestq in https://github.com/huggingface/datasets/pull/6792DatasetBuilder._split_generators
incomplete type annotation by @JonasLoos in https://github.com/huggingface/datasets/pull/6799CachedDatasetModuleFactory
andCache
by @izhx in https://github.com/huggingface/datasets/pull/6754os.path.relpath
inresolve_patterns
by @mariosasko in https://github.com/huggingface/datasets/pull/6815Dataset.__getitem__
by @mariosasko in https://github.com/huggingface/datasets/pull/6817New Contributors
Full Changelog: huggingface/datasets@2.18.0...2.19.0
v2.18.0
Compare Source
Dataset features
General improvements and bug fixes
num_workers
could lead to incorrect shards assignments to workers and cause errorsxlistdir
by @mariosasko in https://github.com/huggingface/datasets/pull/6698New Contributors
Full Changelog: huggingface/datasets@2.17.1...2.18.0
v2.17.1
Compare Source
Bug Fixes
arrow_writer.py
from #6636 by @bryant1410 in https://github.com/huggingface/datasets/pull/6664Full Changelog: huggingface/datasets@2.17.0...2.17.1
v2.17.0
Compare Source
Dataset Features
General improvements and bug fixes
drop_last_batch
in map after shuffling or sharding by @lhoestq in https://github.com/huggingface/datasets/pull/6575setup.cfg
topyproject.toml
by @mariosasko in https://github.com/huggingface/datasets/pull/6619tqdm
bars in non-interactive environments by @mariosasko in https://github.com/huggingface/datasets/pull/6627with_rank
param toDataset.filter
by @mariosasko in https://github.com/huggingface/datasets/pull/6608New Contributors
Full Changelog: huggingface/datasets@2.16.1...2.17.0
v2.16.1
Compare Source
Bug fixes
cache_dir
toload_dataset
load_dataset("ted_talks_iwslt", language_pair=("ja", "en"), year="2015")
Full Changelog: huggingface/datasets@2.16.0...2.16.1
v2.16.0
Compare Source
Security features
https://hf.co/datasets/<repo_id>
. A warning is shown to let the user know about the custom code, and they can avoid this message in future by passing the argumenttrust_remote_code=True
.trust_remote_code=True
will be mandatory to load these datasets from the next major release ofdatasets
.HF_DATASETS_TRUST_REMOTE_CODE=0
you can already disable custom code by default without waiting for the next release ofdatasets
Configuration
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR is behind base branch, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR was generated by Mend Renovate. View the repository job log.