datawizard versions of separate() and unite() #423

jmgirard · 2023-05-25T19:11:33Z

I'm hoping to retool my packages soon to use datawizard instead of dplyr and tidyr. However, a few things I need that don't seem to exist yet are datawizard versions of separate() and unite() from tidyr. Any plans to create these?

etiennebacher · 2023-05-25T19:26:38Z

I think unite() should be fairly easy to implement, but separate() is a bit trickier because of the arguments fill and extra. I started working on implementing separate() in poorman (cf nathaneastwood/poorman#107) but it's still WIP (and I don't know when I'll get back at it). It might provide a starting point if someone else wants to do it.

Fixes #423

strengejacke · 2023-05-25T21:09:06Z

library(datawizard)

d <- data.frame(
  x = c(NA, 1:3),
  y = c(letters[1:3], NA_character_),
  z = 6:9,
  m = c("X", NA_character_, "Y", "Z")
)
d
#>    x    y z    m
#> 1 NA    a 6    X
#> 2  1    b 7 <NA>
#> 3  2    c 8    Y
#> 4  3 <NA> 9    Z

data_unite(d, new_column = "xyz")
#>         xyz
#> 1  NA_a_6_X
#> 2  1_b_7_NA
#> 3   2_c_8_Y
#> 4  3_NA_9_Z
data_unite(d, new_column = "xyz", remove_na = TRUE)
#>        xyz
#> 1    a_6_X
#> 2    1_b_7
#> 3  2_c_8_Y
#> 4    3_9_Z

data_unite(d, new_column = "x")
#>           x
#> 1  NA_a_6_X
#> 2  1_b_7_NA
#> 3   2_c_8_Y
#> 4  3_NA_9_Z
data_unite(d, new_column = "x", remove_na = TRUE)
#>          x
#> 1    a_6_X
#> 2    1_b_7
#> 3  2_c_8_Y
#> 4    3_9_Z

data_unite(d, new_column = "x", append = TRUE)
#> The name for `new_column` already exists as variable name in the data.
#>   This variable will be replaced by `new_column`.
#>           x    y z    m
#> 1  NA_a_6_X    a 6    X
#> 2  1_b_7_NA    b 7 <NA>
#> 3   2_c_8_Y    c 8    Y
#> 4  3_NA_9_Z <NA> 9    Z
data_unite(d, new_column = "x", remove_na = TRUE, append = TRUE)
#> The name for `new_column` already exists as variable name in the data.
#>   This variable will be replaced by `new_column`.
#>          x    y z    m
#> 1    a_6_X    a 6    X
#> 2    1_b_7    b 7 <NA>
#> 3  2_c_8_Y    c 8    Y
#> 4    3_9_Z <NA> 9    Z

data_unite(d, new_column = "xyz", append = TRUE)
#>    x    y z    m       xyz
#> 1 NA    a 6    X  NA_a_6_X
#> 2  1    b 7 <NA>  1_b_7_NA
#> 3  2    c 8    Y   2_c_8_Y
#> 4  3 <NA> 9    Z  3_NA_9_Z
data_unite(d, new_column = "xyz", remove_na = TRUE, append = TRUE)
#>    x    y z    m      xyz
#> 1 NA    a 6    X    a_6_X
#> 2  1    b 7 <NA>    1_b_7
#> 3  2    c 8    Y  2_c_8_Y
#> 4  3 <NA> 9    Z    3_9_Z

data_unite(d, new_column = "x2")
#>          x2
#> 1  NA_a_6_X
#> 2  1_b_7_NA
#> 3   2_c_8_Y
#> 4  3_NA_9_Z
data_unite(d, select = c("x", "z"), new_column = "new")
#>      y    m  new
#> 1    a    X NA_6
#> 2    b <NA>  1_7
#> 3    c    Y  2_8
#> 4 <NA>    Z  3_9
data_unite(d, select = c("x", "z"), new_column = "new", append = TRUE)
#>    x    y z    m  new
#> 1 NA    a 6    X NA_6
#> 2  1    b 7 <NA>  1_7
#> 3  2    c 8    Y  2_8
#> 4  3 <NA> 9    Z  3_9
data_unite(d, select = c("x", "z"), new_column = "new", append = TRUE, remove_na = TRUE)
#>    x    y z    m new
#> 1 NA    a 6    X   6
#> 2  1    b 7 <NA> 1_7
#> 3  2    c 8    Y 2_8
#> 4  3 <NA> 9    Z 3_9

data_unite(d, new_column = "xyz", separator = ".")
#>         xyz
#> 1  NA.a.6.X
#> 2  1.b.7.NA
#> 3   2.c.8.Y
#> 4  3.NA.9.Z

^{Created on 2023-05-25 with reprex v2.0.2}

strengejacke · 2023-05-26T15:25:28Z

@jmgirard data_unite() is implemented, feel free to test. data_separate() will follow in a separate (haha) PR.

Fixes #423

strengejacke added the Feature idea 🔥 label May 25, 2023

strengejacke self-assigned this May 25, 2023

strengejacke added a commit that referenced this issue May 25, 2023

datawizard versions of separate() and unite()

80e9ee4

Fixes #423

strengejacke mentioned this issue May 25, 2023

datawizard version of unite() #424

Merged

strengejacke closed this as completed in 49a4ea1 May 26, 2023

strengejacke reopened this May 26, 2023

etiennebacher mentioned this issue May 30, 2023

Release checklist for 0.8.0 #429

Closed

4 tasks

strengejacke added a commit that referenced this issue Jun 9, 2023

datawizard versions of separate() and unite()

100bbbe

Fixes #423

strengejacke mentioned this issue Jun 9, 2023

Implement data_separate() #431

Merged

strengejacke closed this as completed in #431 Jun 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

datawizard versions of separate() and unite() #423

datawizard versions of separate() and unite() #423

jmgirard commented May 25, 2023

etiennebacher commented May 25, 2023 •

edited

Loading

strengejacke commented May 25, 2023 •

edited

Loading

strengejacke commented May 26, 2023

datawizard versions of separate() and unite() #423

datawizard versions of separate() and unite() #423

Comments

jmgirard commented May 25, 2023

etiennebacher commented May 25, 2023 • edited Loading

strengejacke commented May 25, 2023 • edited Loading

strengejacke commented May 26, 2023

etiennebacher commented May 25, 2023 •

edited

Loading

strengejacke commented May 25, 2023 •

edited

Loading