Skip to content

Conversation

@RyanL1997
Copy link
Collaborator

Description

Manual backport

Related Issues

Check List

  • New functionality includes testing.
  • New functionality has been documented.
  • New functionality has javadoc added.
  • New functionality has a user manual doc added.
  • New PPL command checklist all confirmed.
  • API changes companion pull request created.
  • Commits are signed per the DCO using --signoff or -s.
  • Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

…ject#4109)

* rex - initial implementation

Signed-off-by: Jialiang Liang <[email protected]>

* stop using utils

Signed-off-by: Jialiang Liang <[email protected]>

* fix spotless check

Signed-off-by: Jialiang Liang <[email protected]>

* offset_field - initial implementation

Signed-off-by: Jialiang Liang <[email protected]>

* max_match - initial implementation

Signed-off-by: Jialiang Liang <[email protected]>

* sed - initial implementation

Signed-off-by: Jialiang Liang <[email protected]>

* fix name capture group for extraction

Signed-off-by: Jialiang Liang <[email protected]>

* add rex rst doc

Signed-off-by: Jialiang Liang <[email protected]>

* IT - initial setup

Signed-off-by: Jialiang Liang <[email protected]>

* add a analyzer test for legacy engine

Signed-off-by: Jialiang Liang <[email protected]>

* Add UT for rex

Signed-off-by: Jialiang Liang <[email protected]>

* sed - add pushdown for sed and explain IT and IT with fix

Signed-off-by: Jialiang Liang <[email protected]>

* anonymizer - add rex for anonymizer and test

Signed-off-by: Jialiang Liang <[email protected]>

* Add cross cluster IT for rex

Signed-off-by: Jialiang Liang <[email protected]>

* peng - resolve comments for rst doc 0

Signed-off-by: Jialiang Liang <[email protected]>

* peng - address some comments 1

Signed-off-by: Jialiang Liang <[email protected]>

* peng - resolve comment in rst doc to add a java doc link

Signed-off-by: Jialiang Liang <[email protected]>

* kai - modify the bin ast builder test

Signed-off-by: Jialiang Liang <[email protected]>

* peng - fix the extraction behavior without filter even when there is zero match

Signed-off-by: Jialiang Liang <[email protected]>

* fix rex explain no pushdown

Signed-off-by: Jialiang Liang <[email protected]>

* change the offset val output format

Signed-off-by: Jialiang Liang <[email protected]>

* fix rst file

Signed-off-by: Jialiang Liang <[email protected]>

* peng - SWITCH TO USE CALCITE NATIVE OPERATORS

Signed-off-by: Jialiang Liang <[email protected]>

* Peng - fix tests after operator change

Signed-off-by: Jialiang Liang <[email protected]>

* support mode=extract and update doc

Signed-off-by: Jialiang Liang <[email protected]>

* fix the issue after rebase

Signed-off-by: Jialiang Liang <[email protected]>

* peng - enforce specifying field in antlr for now

Signed-off-by: Jialiang Liang <[email protected]>

* relocate rex cmd IT

Signed-off-by: Jialiang Liang <[email protected]>

* peng - simplify vistFunciton

Signed-off-by: Jialiang Liang <[email protected]>

* peng - add UT for RexExtractMultiFunction

Signed-off-by: Jialiang Liang <[email protected]>

* peng - add UT RexOffsetFunction

Signed-off-by: Jialiang Liang <[email protected]>

* fix some tests

Signed-off-by: Jialiang Liang <[email protected]>

* DECOUPLE SED + OFFSET FIELD

Signed-off-by: Jialiang Liang <[email protected]>

* Improve error handling for extract

Signed-off-by: Jialiang Liang <[email protected]>

* add this rex rst into index

Signed-off-by: Jialiang Liang <[email protected]>

* fix return type in extract multi

Signed-off-by: Jialiang Liang <[email protected]>

* add rex doc into doc test

Signed-off-by: Jialiang Liang <[email protected]>

* fix doc test

Signed-off-by: Jialiang Liang <[email protected]>

* Fix linting

Signed-off-by: Jialiang Liang <[email protected]>

* fix rebase issue

Signed-off-by: Jialiang Liang <[email protected]>

* fix regex anonymizer tests

Signed-off-by: Jialiang Liang <[email protected]>

* fix analyzer test and setup to use util function

Signed-off-by: Jialiang Liang <[email protected]>

* lint fix

Signed-off-by: Jialiang Liang <[email protected]>

* fix doc test

Signed-off-by: Jialiang Liang <[email protected]>

* Add max match limit implementation

Signed-off-by: Jialiang Liang <[email protected]>

* fix anonymizer test

Signed-off-by: Jialiang Liang <[email protected]>

* peng - simplify if

Signed-off-by: Jialiang Liang <[email protected]>

* peng - make extract multi to only handle the case of max_match > 1

Signed-off-by: Jialiang Liang <[email protected]>

---------

Signed-off-by: Jialiang Liang <[email protected]>
ykmr1224
ykmr1224 previously approved these changes Sep 5, 2025
@RyanL1997
Copy link
Collaborator Author

I saw the doc tests failed because the sample data used for doctest is different than the one we used on main.

Signed-off-by: Jialiang Liang <[email protected]>
Signed-off-by: Jialiang Liang <[email protected]>
@Swiddis Swiddis merged commit f658d5b into opensearch-project:2.19-dev Sep 9, 2025
42 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants