Convert small data sets to "static" versions #137

cbeleites · 2020-05-21T19:54:03Z

bryanhanson · 2020-05-21T21:05:40Z

On the third point, why not use the static data in the vignettes? That would simplify things a great deal.

cbeleites · 2020-05-21T21:14:51Z

@bryanhanson: thanks for spotting. I was thinking of the vignettes that import the real spectra.

(Future) vignettes that are shipped with hyperSpec can and should use the data sets as in hyperSpec.

related: #138

GegznaV · 2020-05-21T21:43:56Z

~~Why is chondro not on this list?~~ (I found the answer)

The description of the datasets may contain sentence, such as "The dataset is provided with the package is a subset of... (some real data)" To illustrate the ideas/functionality, we can even use artificial spectra. Just a thought, but maybe, some functions that simulate spectra could be included?

GegznaV · 2020-05-21T21:45:39Z

(Future) vignettes that are shipped with hyperSpec can and should use the data sets as in hyperSpec.

I agree. But the links to the original dataset could also be provided.

cbeleites · 2020-05-21T21:58:41Z

chondro is not on this list, because it needs to be dealt with in a different manner since it is too big (#129 ). => the solution for chondro is that

For the purpose of examples in the help pages, @bryanhanson's new FauxCell will take the role of chondro to illustrate a spectral map/image.
@ximeg is preparing a separate data package that contains the original data of chondro. The vignette (naturally) moves into the data package. The vignette already now depends on the original data: the PCA compressed version shows substantial artifacts (of the PCA compression) that mess up the workflow. But the best lossless compression of the original data we can easily use (xz) is still several MB of data. So no way to ship this other than in a data package.

cbeleites · 2020-05-21T22:01:35Z

links to the original dataset could also be provided.

There is a @source Roxygen parameter that can be used. I was thinking of putting something along the lines e.g. for barbiturates "This data set was prepared from the first few subfiles of "BARBITURATES.SPC"` - for more information and the original file, see package 'hyperSpec_import_spc'"

GegznaV · 2020-05-21T22:08:01Z

see package 'hyperSpec_import_spc'"

Are the names of the new packages confirmed? Was there a discussion on that? I would prefer to have some shorter names as hy.import, hy.manager, hy.plot, hyperImport, hyperWrangler hyperPlot, etc.

cbeleites · 2020-05-22T11:26:34Z

no, that discussion was postponed to the next video call (Monday 7 pm EEST).

I opened an issue, though: #140

GegznaV · 2020-06-11T09:19:30Z

I'm doing some experimentation on this topic. There are some (in my opinion) unnecessary entries in .Rbuildignore or/and in .gitignore, which may have led to unsuccessful results of Claudia's experimentation with data (she talked on that this Monday).

cbeleites · 2020-06-11T14:57:04Z

I had not been experimenting with the data sets in this issue but with fauxCell. The small data sets here are unproblematic. They are standard .rda files already, and behave as expected. fauxCell is totally different since it is a variable created by the package source code. Explanation on what I tried is at the end of #114 - I don't think it is related to .gitignore (or .Rbuildignore).

You are right that .gitignore will need to be changed: at the moment it reflects which files are created or copied into their place in the package directory tree by make.
I added cleaning up .gitignore and the Makefiles to the TODO list at the top of the issue.

see #137

cbeleites · 2020-07-18T09:18:22Z

Fixed documentation - this closes the issue.

cbeleites self-assigned this May 21, 2020

cbeleites mentioned this issue May 21, 2020

Get (mostly) rid of make #132

Closed

cbeleites changed the title ~~Convert data sets to "static" versions~~ Convert small data sets to "static" versions May 21, 2020

GegznaV added the Topic: datasets 📅 Related to datasets in hyperSpec label May 21, 2020

cbeleites added question ❔ and removed question ❔ labels May 22, 2020

cbeleites added this to the Version 1.0 milestone Jun 11, 2020

This was referenced Jun 12, 2020

Removed make, static datasets, Rmd vignettes, enabled CI and pkgdown #153

Closed

Feature/131 Enable R-CMD-check on Git-Hub Actions #159

Merged

GegznaV mentioned this issue Jun 19, 2020

Feature/78, 79, etc. Vignettes translated to R Markdown #147

Merged

5 tasks

cbeleites added a commit that referenced this issue Jul 5, 2020

Shorten barbiturates data

6e31e7d

see #137

cbeleites closed this as completed Jul 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert small data sets to "static" versions #137

Convert small data sets to "static" versions #137

cbeleites commented May 21, 2020 •

edited

Loading

bryanhanson commented May 21, 2020

cbeleites commented May 21, 2020

GegznaV commented May 21, 2020 •

edited

Loading

GegznaV commented May 21, 2020 •

edited by ximeg

Loading

cbeleites commented May 21, 2020

cbeleites commented May 21, 2020

GegznaV commented May 21, 2020

cbeleites commented May 22, 2020

GegznaV commented Jun 11, 2020

cbeleites commented Jun 11, 2020 •

edited

Loading

cbeleites commented Jul 18, 2020

Convert small data sets to "static" versions #137

Convert small data sets to "static" versions #137

Comments

cbeleites commented May 21, 2020 • edited Loading

bryanhanson commented May 21, 2020

cbeleites commented May 21, 2020

GegznaV commented May 21, 2020 • edited Loading

GegznaV commented May 21, 2020 • edited by ximeg Loading

cbeleites commented May 21, 2020

cbeleites commented May 21, 2020

GegznaV commented May 21, 2020

cbeleites commented May 22, 2020

GegznaV commented Jun 11, 2020

cbeleites commented Jun 11, 2020 • edited Loading

cbeleites commented Jul 18, 2020

cbeleites commented May 21, 2020 •

edited

Loading

GegznaV commented May 21, 2020 •

edited

Loading

GegznaV commented May 21, 2020 •

edited by ximeg

Loading

cbeleites commented Jun 11, 2020 •

edited

Loading