How to create a jagged cross() transform #697

htlin · 2019-01-31T18:35:04Z

I am hoping to do a cross() transform but I wouldn't want a complete cross product - rather a jagged version instead, e.g.:

plan <- drake_plan(
  s_load = target(
    load_csv(group, rep),
    transform = cross(
      group = c("G1", "G2"),
      rep = c("R1", "R2", "R3", "R4", "R5", "R6")
    )
  )
)

For example, my group G1 has rep R1-R6, but G2 only has R1-R4 which is missing R5-R6.
My function load_csv is searching for input files to read, in this case Gx_Ry.csv for example, but I don't have G2_R5.csv and G2_R6.csv and so it fails with files not found for those two targets.
Any recommendations would be appreciated, thanks!

The text was updated successfully, but these errors were encountered:

wlandau · 2019-01-31T19:14:30Z

Another nice one for the FAQ. Fortunately, this is straightforward if you create your own grid in advance and then use map().

library(drake)
library(tidyverse)
  
grid <- crossing(
  group = c("G1", "G2"),
  rep = c("R1", "R2", "R3", "R4", "R5", "R6")
) %>%
  filter(!(group == "G2" & rep %in% c("R5", "R6")))

drake_plan(
  s_load = target(
    load_csv(group, rep),
    transform = map(
      group = !!grid$group,
      rep = !!grid$rep
    )
  )
)
#> # A tibble: 10 x 2
#>    target           command                   
#>    <chr>            <chr>                     
#>  1 s_load_.G1._.R1. "load_csv(\"G1\", \"R1\")"
#>  2 s_load_.G1._.R2. "load_csv(\"G1\", \"R2\")"
#>  3 s_load_.G1._.R3. "load_csv(\"G1\", \"R3\")"
#>  4 s_load_.G1._.R4. "load_csv(\"G1\", \"R4\")"
#>  5 s_load_.G1._.R5. "load_csv(\"G1\", \"R5\")"
#>  6 s_load_.G1._.R6. "load_csv(\"G1\", \"R6\")"
#>  7 s_load_.G2._.R1. "load_csv(\"G2\", \"R1\")"
#>  8 s_load_.G2._.R2. "load_csv(\"G2\", \"R2\")"
#>  9 s_load_.G2._.R3. "load_csv(\"G2\", \"R3\")"
#> 10 s_load_.G2._.R4. "load_csv(\"G2\", \"R4\")"

Created on 2019-01-31 by the reprex package (v0.2.1.9000)

htlin · 2019-01-31T22:19:39Z

Nice! Thanks for the solution.
Another thought I have now is that, can I make a target that tries to find all available files, and then dynamically generate (like yield in Python perhaps) named targets accordingly?

wlandau · 2019-02-01T19:33:18Z

Sounds like #685, which many people have requested. In drake the plan needs to be fully written out before you call make(), which may limit what I think you are describing.

But if the files you mention are all available before you write the plan, then yes, you can write a plan whose target names are automatically generated.

library(drake)
files <- list.files("dir")
plan <- drake_plan(s_load = target(load_csv(file), transform = map(file = !!files)))

wlandau · 2019-02-07T21:51:04Z

#720 will make custom grids easier. Check this out:

library(drake)
library(tidyverse)

grid <- crossing(
  group = c("G1", "G2"),
  rep = c("R1", "R2", "R3", "R4", "R5", "R6")
) %>%
  filter(!(group == "G2" & rep %in% c("R5", "R6")))

drake_plan(
  s_load = target(
    load_csv(group, rep),
    transform = map(.data = !!grid)
  )
)
#> # A tibble: 10 x 2
#>    target           command             
#>    <chr>            <expr>              
#>  1 s_load_.G1._.R1. load_csv("G1", "R1")
#>  2 s_load_.G1._.R2. load_csv("G1", "R2")
#>  3 s_load_.G1._.R3. load_csv("G1", "R3")
#>  4 s_load_.G1._.R4. load_csv("G1", "R4")
#>  5 s_load_.G1._.R5. load_csv("G1", "R5")
#>  6 s_load_.G1._.R6. load_csv("G1", "R6")
#>  7 s_load_.G2._.R1. load_csv("G2", "R1")
#>  8 s_load_.G2._.R2. load_csv("G2", "R2")
#>  9 s_load_.G2._.R3. load_csv("G2", "R3")
#> 10 s_load_.G2._.R4. load_csv("G2", "R4")

^{Created on 2019-02-07 by the reprex package (v0.2.1.9000)}

wlandau self-assigned this Jan 31, 2019

wlandau added type: faq topic: api labels Jan 31, 2019

wlandau closed this as completed Jan 31, 2019

This was referenced Feb 5, 2019

map() back to original variables after after combine() #710

Closed

Try to break the DSL #717

Closed

brendanf mentioned this issue Feb 7, 2019

Add .data argument to map #719

Closed

wlandau mentioned this issue Feb 7, 2019

map(.data = your_grid) #720

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to create a jagged cross() transform #697

How to create a jagged cross() transform #697

htlin commented Jan 31, 2019

wlandau commented Jan 31, 2019

htlin commented Jan 31, 2019

wlandau commented Feb 1, 2019

wlandau commented Feb 7, 2019

How to create a jagged cross() transform #697

How to create a jagged cross() transform #697

Comments

htlin commented Jan 31, 2019

wlandau commented Jan 31, 2019

htlin commented Jan 31, 2019

wlandau commented Feb 1, 2019

wlandau commented Feb 7, 2019