Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pens dataset is broken #406

Open
neubig opened this issue Dec 27, 2022 · 0 comments
Open

pens dataset is broken #406

neubig opened this issue Dec 27, 2022 · 0 comments
Labels
bug Something isn't working

Comments

@neubig
Copy link
Collaborator

neubig commented Dec 27, 2022

>>> datalabs.load_dataset("pens", "document")
Couldn't find a directory or a dataset named 'pens' in this version. It was picked from the master branch on github instead.
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/load.py", line 2144, in load_dataset
    builder_instance.download_and_prepare(
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/builder.py", line 747, in download_and_prepare
    self._download_and_prepare(
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/builder.py", line 844, in _download_and_prepare
    split_generators = self._split_generators(dl_manager, **split_generators_kwargs)
  File "/Users/gneubig/.cache/expressai/modules/datasets_modules/datalab/pens/5b1bd06733f02fe4e836301c6e3c1ad740fbdd33409b9308a2fceedfdb6f766c/pens.py", line 87, in _split_generators
    f_path = dl_manager.download_and_extract(
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/download_manager.py", line 322, in download_and_extract
    return self.extract(self.download(url_or_urls))
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/download_manager.py", line 221, in download
    downloaded_path_or_paths = map_nested(
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/py_utils.py", line 297, in map_nested
    return function(data_struct)
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/download_manager.py", line 248, in _download
    return cached_path(url_or_filename, download_config=download_config)
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/file_utils.py", line 344, in cached_path
    output_path = get_from_cache(
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/file_utils.py", line 720, in get_from_cache
    raise ConnectionError(f"Couldn't reach {url} ({repr(head_error)})")
ConnectionError: Couldn't reach https://msrshare.blob.core.windows.net/msr/training_set.zip (ConnectionError(MaxRetryError("HTTPSConnectionPool(host='msrshare.blob.core.windows.net', port=443): Max retries exceeded with url: /msr/training_set.zip (Caused by NewConnectionError('<urllib3.connection.HTTPSConnection object at 0x7f82f9bf3f70>: Failed to establish a new connection: [Errno 8] nodename nor servname provided, or not known'))")))
@neubig neubig added the bug Something isn't working label Dec 27, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant