-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
tidyselect + where VS if_any/if_all, and error messages #5732
Comments
🤔 I believe you are led on a wrong paths, and that We tried to have |
This is a tricky one, perhaps Perhaps the default for library(dplyr, warn.conflicts = FALSE)
library(palmerpenguins)
penguins %>%
filter(if_any(.fns = ~.x == 39.1))
#> # A tibble: 1 x 8
#> species island bill_length_mm bill_depth_mm flipper_length_… body_mass_g sex
#> <fct> <fct> <dbl> <dbl> <int> <int> <fct>
#> 1 Adelie Torge… 39.1 18.7 181 3750 male
#> # … with 1 more variable: year <int> Created on 2021-02-17 by the reprex package (v0.3.0) Perhaps Perhaps the call to Or perhaps the documentation should include some examples to explain that the initial code is off label ? |
I agree that it's slightly confusing that the error suggests
Perhaps if a column is not indicated, the error could suggest something like you mentioned above:
|
It's going to be hard to give a more specific error message for this case, but I wonder if in this error: library(dplyr)
mtcars %>%
filter(if_any(~.x == 39.1))
#> Error: Problem with `filter()` input `..1`.
#> x Formula shorthand must be wrapped in `where()`.
#>
#> # Bad
#> data %>% select(~.x == 39.1)
#>
#> # Good
#> data %>% select(where(~.x == 39.1))
#> ℹ Input `..1` is `if_any(~.x == 39.1)`. We could at least not mention
Or even just suppressing the error would be better than the current behaviour. That at least gets you to: mtcars %>%
filter(if_any(where(~.x == 39.1)))
#> Error: Problem with `filter()` input `..1`.
#> x `where()` must be used with functions that return `TRUE` or `FALSE`.
#> ℹ Input `..1` is `if_any(where(~.x == 39.1))`. And that message really needs to emphasise a "single" I think that's the best we can do; this is the downside NSE. |
* try an instrumenting approach closes #5732 * try not to instrument error but detect misplaced ~ * only do it in if_any() and if_all() * across() also aborts on formula .cols * Update R/across.R Co-authored-by: Lionel Henry <[email protected]> * mention next step * simplify error * + test * rebase hickup * use context_local() / context_peek() * using expect_snapshot() + expect_error() combo * full message with expanded if_any()/if_all() Co-authored-by: Lionel Henry <[email protected]>
Number one, extremely happy to have the new features for
filter
and/ormutate()
withif_any
/if_all
!!! This feature was not something I expected but am so excited to have!!!🥳
There is one problem I'd like to raise and one question:
Problem: The error message for using
if_any()
without a column selector results in some potentially bad recommendations (related to my question as well).Question: What is the "correct" way of
select
-ing columns based on values within that column?While I understand the use of
if_any
withfilter()
, I'm also interested in the similar option forselect()
. when cleaning data or working with "wide" data that I'm putting into a table I fairly often want toselect()
specific columns that contain a specific value/string/range/etc. IE just knowing it's numeric or text is not sufficient, but rather I'm interested in specifics of the column's contents.I have a reprex below that shows some of my testing steps in trying to answer this question.
Created on 2021-02-03 by the reprex package (v0.3.0)
The text was updated successfully, but these errors were encountered: