-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
4.0.0 rc1 #281
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Reading error message (http status, id and label) from the response and return the status, id and label that were saved in the memory.
Update get_eurostat_json.R
Switch from httr to httr2
Require always sf for `get_eurostat_geospatial()`
Implement data.table functions for handling big objects in memory
Update package vignettes and articles on eurostat 4.0.0
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This is the first release candidate for the 4.0.0 CRAN release. All sort of feedback is very welcome.
Changes as they are listed in the package NEWS:
eurostat 4.0.0
Major updates
get_eurostat_interactive()
for interactively searching and downloading data from Eurostat SDMX API. The function aims to make good data citation practices more prominently visible and also make it easier to explore what different arguments inget_eurostat()
function do.fixity_checksum()
to easily calculate a fixity checksum for datasets downloaded from Eurostat. The fixity checksum can, for example, be saved in research notes and reported in as part of data appendices. Printing the fixity checksum is encouraged by including an option to print it in everyget_eurostat_interactive()
query.get_eurostat()
use.data.table
argument. This is especially useful with big datasets that would otherwise take a long time to go through the different data cleaning functions or crash R with their large memory footprint. (issue Better performance with big datasets #277, PR Implement data.table functions for handling big objects in memory #278)httr
package tohttr2
(issue Switch from httr to httr2 #273, PR Switch from httr to httr2 #276)list_eurostat_cache_items()
. (Affects issues mentioned in cache filtered tables, suggestion of functionality #144, Cached datasets #257, get_eurostat() does not save .rds files #258, fixed in PR Improved cache handling #267).eurostatTOC
object (returned byget_eurostat_toc()
) now use dots instead of spaces in the style ofbase::make.names()
, e.g. turninglast update of data
tolast.update.of.data
(PR Enhance search_eurostat(), TOC handling, documentation #271).eurostatTOC
object includes a new hierarchy column that represents the position of each folder, dataset and table in the folder structure.search_eurostat()
includes the option to search Table of Content items by dataset codes in addition to titles. This makes it possible to make further queries from similar datasets (e.g. "nama_10_gdp", "nama_10r_2gdp", "nama_10r_3popgdp") that might have different titles.label_eurostat_tables()
has been rewritten to use the new SDMX API instead oftable_dic.dic
file in Eurostat Bulk Download Listingget_eurostat_geospatial()
now leverages ongiscoR::gisco_get_nuts()
fordownloading geospatial data (PR Leverage on giscoR #264, thanks to @dieghernan):
"spdf"
output class soft-deprecated, it would return asf
object with a message.make_valid
parameter soft-deprecated....
to the function so additional parametes can be passed togiscoR::gisco_get_nuts()
.eurostat_geodata_60_2016
updated.get_eurostat_geospatial()
now requires sf package to work at all (PR Require always sf forget_eurostat_geospatial()
#280, thanks to @dieghernan)Minor updates
clean_eurostat_toc()
for easy removal of TOC objects from .EurostatEnv environment. (PR Implement data.table functions for handling big objects in memory #278)get_eurostat()
andget_eurostat_json()
functions. The documentation now warns users about potential problems caused bytime
/TIME_PERIOD
parameters when used to query datasets that contain quarterly data (issue get_eurostat_json fails with time / time_period parameter on quarterly data #260)get_eurostat_dic()
andlabel_eurostat()
functions.get_bibentry()
now outputs "Accessed YYYY-MM-DD" and "dataset last updated YYYY-MM-DD" in note field as otherwise it would be sporadically printed or not at all printed fromurldate
field.check_lang()
sp
,methods
andbroom
packages from dependencies.giscoR
to Suggests.New features
get_eurostat()
function now explicity accepts a 'lang' argument, for passing onwards toget_eurostat_json()
andlabel_eurostat()
(PR Fix title-code mismatch, add attribution #270)get_eurostat_folder()
for downloading all datasets in a folder. The function is limited to downloading folders that contain at maximum 20 datasets. This function relies on new internal helper functions:toc_count_whitespace()
,toc_determine_hierarchy()
,toc_count_children()
andtoc_list_children()
. (PR Fix title-code mismatch, add attribution #270)get_eurostat_toc()
andset_eurostat_toc()
now have experimental features that support downloading TOCs in French and German as well. This support, in turn, is leveraged inget_bibentry()
which now has a language parameter:lang
(PR Fix title-code mismatch, add attribution #270)get_eurostat_toc()
,search_eurostat()
now supports searching from French and German TOC-files as well (PR Fix title-code mismatch, add attribution #270)Deprecated and defunct
grepEurostatTOC()
is completely marked as defunct and is enroute to being removed from the package assearch_eurostat()
is now the only way to fetch Eurostat TOC items and search (grep) themlabel_eurostat_vars()
has been marked as deprecated in favour of a new (temporary) functionlabel_eurostat_vars2()
which uses the new SDMX API to retrieve names for dataset columns. The old function will be completely removed after October 2023 when Eurostat Bulk Download Listing website is retired andlabel_eurostat_vars2
will be renamed tolabel_eurostat_vars()
. Function evolution is subject to ongoing Eurostat API developments.Bug fixes
get_bibentry()
returns correct codes for titles and warns the user if some / all of the requested codes were not found in the TOC (PR Fix title-code mismatch, add attribution #270)get_bibentry()
uses the date field with the internal BibEntry format that can be easily translated to other formats: bibtex, bibentry (PR Fix title-code mismatch, add attribution #270)get_bibentry()
now outputs dataset codes in titles correctly so thatbibtex
andbiblatex
entries can be copypasted into bibliographies without adding escape characters manually (PR Fix title-code mismatch, add attribution #270)eurotime2date()
when handling big datasets containing weekly data and tens of millions of rows (dataset used for testing mentioned in issue Problem with week time code #200).