-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Add regexp_count function #12970
Conversation
…v/xinli/regexp_count # Conflicts: # datafusion/functions/benches/regx.rs # docs/source/user-guide/sql/scalar_functions.md
…re/regexp_count # Conflicts: # datafusion/functions/src/regex/regexpcount.rs
# Conflicts: # datafusion/functions/src/regex/mod.rs # docs/source/user-guide/sql/scalar_functions.md
Note this PR does not resolve the performance degradation seen in this function as compared to regexp_match or regexp_like. (see #12080 (comment) for a benchmark) I believe that should be looked at in a followup ticket. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @Omega359 -- this looks great. I will merge it in and file a follow on ticket for enhancing its performance
FYI @xinlifoobar |
Filed #13011 |
Which issue does this PR close?
Closes #12079 and part of #11946. Followup to PR #12080 by #xinlifoobar with some additional work (documentation, minor fixes, scalar test fixes).
Rationale for this change
Add an additional useful regexp function to datafusion
What changes are included in this PR?
Code, tests, documentation.
Are these changes tested?
Yes.
Are there any user-facing changes?
Docs updated to include the new udf.