Use case for mutate(clean = TRUE) #302

ijlyttle · 2014-03-06T13:28:12Z

Per Hadley's suggestion, this was moved from the manipulatr google group. link

Let's say I get a csv file with a bunch of variables named "XPF102", "FGR24D", and so on. There may be twenty of these.

From my data-dictionary, I see that "XPF102" is pressure in psi, "FGR24D" is a part-per-million of contaminant, and so on.

I have used plyr::summarise() to do two things at once (this could be what you are trying to get away from in dplyr):

data_new <- plyr::summarise(
  data_old,
  pressure = XPF102 * 6894.75729,  # convert to Pa from psi (I know a function is more appropriate here)
  concentration_contaminant = FGR24D / 1.e6, # convert to proportion from parts-per-million
  ...
)

Using dplyr, I might have to do this:

data_new <- 
  data_old %.%
  mutate(
    pressure = XPF102 * 6894.75729,
    concentration_contaminant = FGR24D / 1.e6, 
    ...
  ) %.%
  select(
    pressure,
    concentration_contaminant,
    ...
  )

Doing this, I seem to have the opportunity to mistype a variable name by violating DRY.

Hadley suggested:

Maybe an option to mutate like clean = TRUE?

I like the idea. My only (relatively uneducated) concern is if the user wants to name a variable clean, but I'm sure there is a clever way to avoid that.

Thanks,

Ian

The text was updated successfully, but these errors were encountered:

hadley · 2014-07-28T16:00:34Z

I'm now slightly leaning towards a new verb called transmute().

ijlyttle · 2014-07-28T17:27:43Z

FWIW, I like it.

piccolbo · 2014-08-01T17:53:46Z

When I picked transmute for plyrmr I thought it was odd enough that there would be no sharing of it. I have no problem with transmute popping up in dplyr, quite the opposite, the only problem is that transmute in plyrmr is uber-general and allows you to do thing like multi-row summaries (e.g. quantiles) or expansions, like splitting a line of text into words. It evaluates the ... arguments in an expanded environment and binds them together in a data frame (vectors and data.frames and lists are all allowed), applies fractional recycling like cbind does and returns the result. The right name for this was probably transform but that was taken for a much more constrained operation. So I went with transmute, but now you need that name. Fine. So what do you suggest? I am willing to pick anything that will suggest absolute freedom in assembling the result, is in the dictionary and won't appear in dplyr in the next century.

hadley · 2014-08-01T18:27:16Z

It might be ok - what does the signature of plyrmr::transmute look like? We might be able to share the same generic.

piccolbo · 2014-08-01T18:44:00Z

function(.data, ..., .cbind = FALSE, .columns = NULL, .envir = parent.frame())

The problem I think is more the semantic difference. I know we had this discussion before, but it's not settled. Use case

mtcars %>% transmute(quantile(mpg), quantile(hp))

hadley mentioned this issue Mar 26, 2014

Could select() substitute mutate() to add new variable? #355

Closed

hadley added the enhancement label Aug 1, 2014

hadley modified the milestones: 0.3.1, 0.3 Aug 1, 2014

hadley self-assigned this Aug 1, 2014

hadley closed this as completed in 92690cf Aug 1, 2014

krlmlr pushed a commit to krlmlr/dplyr that referenced this issue Mar 2, 2016

First pass at transmute implementation. Closes tidyverse#302

7daa9f7

lock bot locked as resolved and limited conversation to collaborators Jun 10, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use case for mutate(clean = TRUE) #302

Use case for mutate(clean = TRUE) #302

ijlyttle commented Mar 6, 2014

hadley commented Jul 28, 2014

ijlyttle commented Jul 28, 2014

piccolbo commented Aug 1, 2014

hadley commented Aug 1, 2014

piccolbo commented Aug 1, 2014

Use case for mutate(clean = TRUE) #302

Use case for mutate(clean = TRUE) #302

Comments

ijlyttle commented Mar 6, 2014

hadley commented Jul 28, 2014

ijlyttle commented Jul 28, 2014

piccolbo commented Aug 1, 2014

hadley commented Aug 1, 2014

piccolbo commented Aug 1, 2014