-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
rows_patch(x, y, ...)
shows unhelpful error message when nrow(y) > nrow(x)
#5699
Comments
rows_patch(x, y, ...)
shows unhelpful error message when nrow(y) > nrow(x)
I think the error is good. The code attempts to patch rows that don't exist on the left-hand side, this is an error for x = data.frame(id = c(1, 2, 3), foo = c("a", "b", NA))
y = data.frame(id = c(1, 3, 5, 7), foo = c("a", "c", "e", "g"))
dplyr::rows_upsert(x, y, by = "id")
#> id foo
#> 1 1 a
#> 2 2 b
#> 3 3 c
#> 4 5 e
#> 5 7 g Created on 2021-06-24 by the reprex package (v2.0.0) |
I'm commenting here, as its the cleanest place I can see, rather than in the PR that closes it. I'm having a bit of hard time wrapping my head around the erroring up At least, that is a feature that is very meaningful to me. |
We typically recommend that you open a new issue that references the old one rather than adding on to conversation of an already old issue. That is much easier for us to track! Also, you can do that in the dev version with library(dplyr)
x <- tibble(a = 1:3, b = letters[c(1:2, NA)], c = 0.5 + 0:2)
x
#> # A tibble: 3 × 3
#> a b c
#> <int> <chr> <dbl>
#> 1 1 a 0.5
#> 2 2 b 1.5
#> 3 3 <NA> 2.5
y <- tibble(a = 3:4, b = "z")
y
#> # A tibble: 2 × 2
#> a b
#> <int> <chr>
#> 1 3 z
#> 2 4 z
rows_patch(x, y, by = "a")
#> Error in `rows_patch()`:
#> ! `y` must contain keys that already exist in `x`.
#> ℹ The following rows in `y` have keys that don't exist in `x`: `c(2)`.
#> ℹ Use `unmatched = "ignore"` if you want to ignore these `y` rows.
#> Backtrace:
#> ▆
#> 1. ├─dplyr::rows_patch(x, y, by = "a")
#> 2. └─dplyr:::rows_patch.data.frame(x, y, by = "a") at dplyr/R/rows.R:249:2
#> 3. └─dplyr:::rows_check_y_unmatched(x_key, y_key, unmatched) at dplyr/R/rows.R:281:2
#> 4. └─rlang::abort(message, call = error_call) at dplyr/R/rows.R:578:6
rows_patch(x, y, by = "a", unmatched = "ignore")
#> # A tibble: 3 × 3
#> a b c
#> <int> <chr> <dbl>
#> 1 1 a 0.5
#> 2 2 b 1.5
#> 3 3 z 2.5 Created on 2022-11-11 with reprex v2.0.2.9000 |
Sorry, my bad. Will remember in the future. Thanks! This is the functionality I was after. |
When I try to patch a data frame (
x
in the below example) using another data frame with more rows (y
), an error is thrown with an unhelpful message. It took some time to figure out that the row number difference was the source of the error.rlang::last_error()
andrlang::last_trace()
didn't help.Is there any reason a data frame should not be patched by a data frame with more rows? I could be missing some theory here. If there is a good reason why this should not be allowed, can I add that to the documentation for
rows_patch()
and friends?Example:
Created on 2021-01-20 by the reprex package (v0.3.0)
Session info
Expected output:
The text was updated successfully, but these errors were encountered: