Implement `filter_out()` #7775

DavisVaughan · 2025-11-25T19:30:22Z

Closes #6560
Closes #6891

BECAUSE THIS IS HARD Y'ALL

DavisVaughan · 2025-11-25T19:34:52Z

NEWS.md

+* New experimental `filter_out()` companion to `filter()`.
+
+  * Use `filter()` when specifying rows to _keep_.
+
+  * Use `filter_out()` when specifying rows to _drop_.
+
+  `filter_out()` simplifies cases where you would have previously used a `filter()` to drop rows. It is particularly useful when missing values are involved. For example, to drop rows where the `count` is zero:
+
+  ```r
+  df |> filter(count != 0 | is.na(count))
+
+  df |> filter_out(count == 0)
+  ```
+
+  With `filter()`, you must provide a "negative" condition of `!= 0` and must explicitly guard against accidentally dropping rows with `NA`. With `filter_out()`, you directly specify rows to drop and you don't have to guard against dropping rows with `NA`, which tends to result in much clearer code.
+
+  This work is a result of [Tidyup 8: Expanding the `filter()` family](https://github.com/tidyverse/tidyups/pull/30), with a lot of great feedback from the community (#6560, #6891).


DavisVaughan · 2025-11-25T19:37:43Z

R/filter.R

+#' @rdname filter
+#' @export
+filter_out <- function(.data, ..., .by = NULL, .preserve = FALSE) {
+  check_by_typo(...)
+  check_not_both_by_and_preserve({{ .by }}, .preserve)
+  UseMethod("filter_out")
+}


The actual implementation is quite simple

Extract out a common filter_impl()

Provide .verb = "filter" or .verb = "filter_out"

If filter_out, provide invert = TRUE to the C implementation of filter, which inverts the final result on the way out

DavisVaughan · 2025-11-25T19:39:12Z

R/filter.R

@@ -1,17 +1,108 @@
-#' Keep rows that match a condition


Did a full doc overhaul. Things worth spending time on:

The Description section

Note that I've declared filter_out() as experimental

@section Missing values:

Examples

DavisVaughan · 2025-11-25T19:39:44Z

src/filter.cpp

+  if (LOGICAL_ELT(invert, 0)) {
+    for (R_xlen_t i = 0; i < n; ++i) {
+      p_keep[i] = !p_keep[i];
+    }
+  }


DavisVaughan · 2025-11-25T19:40:52Z

tests/testthat/test-filter.R

I carefully looked at all filter() tests and added a corresponding filter_out() test if it felt like the test wasn't too niche and generally tested some kind of invariant (0 row behavior, no input behavior, etc)

DavisVaughan · 2025-11-25T19:41:40Z

vignettes/colwise.Rmd


 `across()` doesn't work with `select()` or `rename()` because they already use tidy select syntax; if you want to transform column names with a function, you can use  `rename_with()`.

-### filter()


Tweaked the vignettes only where it felt like using filter_out() was a noticeable improvement

DavisVaughan added 13 commits November 25, 2025 14:30

Implement filter_out()

92cf882

Document filter_out()

822c679

Revise examples

dee6b99

Overhaul tests to include filter_out()

3c1c486

Add to list of .by supported verbs

f40e8e0

Mention in extending dplyr docs

584fdc0

Mention in across() example

be08beb

Test alongside pick()

443eb14

Test alongside rowwise()

c79eb82

Use filter_out() in a few vignettes

ce87a10

Mark filter_out() as experimental

4901aa4

NEWS bullet

2e377e7

Add complement test

3155f84

DavisVaughan force-pushed the feature/filter-out-2 branch from ea193ab to 3155f84 Compare November 25, 2025 19:31

Fix NEWS example

6054744

BECAUSE THIS IS HARD Y'ALL

DavisVaughan commented Nov 25, 2025

View reviewed changes

DavisVaughan requested a review from hadley November 25, 2025 19:44

DavisVaughan mentioned this pull request Nov 26, 2025

Implement when_any() and when_all() #7777

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement `filter_out()` #7775

Implement `filter_out()` #7775

Uh oh!

DavisVaughan commented Nov 25, 2025 •

edited

Loading

Uh oh!

DavisVaughan Nov 25, 2025

Uh oh!

DavisVaughan Nov 25, 2025

Uh oh!

DavisVaughan Nov 25, 2025

Uh oh!

DavisVaughan Nov 25, 2025

Uh oh!

DavisVaughan Nov 25, 2025

Uh oh!

DavisVaughan Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		`across()` doesn't work with `select()` or `rename()` because they already use tidy select syntax; if you want to transform column names with a function, you can use `rename_with()`.

		### filter()

Implement filter_out() #7775

Are you sure you want to change the base?

Implement filter_out() #7775

Uh oh!

Conversation

DavisVaughan commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DavisVaughan Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

DavisVaughan Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

DavisVaughan Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

DavisVaughan Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

DavisVaughan Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

DavisVaughan Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Implement `filter_out()` #7775

Implement `filter_out()` #7775

DavisVaughan commented Nov 25, 2025 •

edited

Loading