`<regex>`: Perform simplified stack unwinding for lookahead assertions when the asserted pattern matches #5835

muellerj2 · 2025-11-09T00:10:07Z

This implements simplified backtracking for the case when the pattern of a lookahead assertion matches. It's kind of the equivalent of #5828 for lookahead assertions, though it's more complicated while being much less practically relevant. But I need this for the next PRs that will greatly reduce the number of allocations the matcher performs.

When the pattern in a lookahead assertion matches, we know that the lookahead assertion as a whole succeeded or failed. We can then mostly skip the stack unwinding up until the stack frame that was pushed at the start of the lookahead assertion, except for the effects these stack frames have on the stack counter, because no stack unwinding opcode translates does any other work when a pattern matched in ECMAScript mode (and ECMAScript is the only regex grammar that supports lookahead assertions). Much of of the work at the end of a lookahead assertion is now also handled when processing the _N_end_assert node and no longer when processing the unwinding opcodes _After_assert and _After_neg_assert.

You might notice that we could actually avoid the new loop in _N_end_assert if we kept track of the stack usage counts and the positions of the _After_assert and _After_neg_assert stack frames. But I will have to add a variant of this loop in the PR after the next one anyway, so it doesn't seem worth it to spend much effort on avoiding this loop.

…s when the pattern matches

StephanTLavavej · 2025-11-10T23:38:31Z

because no stack unwinding opcode translates does any other work when a pattern matched in ECMAScript mode

I can't quite parse this - was "translates" a spurious word introduced during editing?

stl/inc/regex

StephanTLavavej · 2025-11-11T20:09:30Z

I'm mirroring this to the MSVC-internal repo - please notify me if any further changes are pushed.

StephanTLavavej · 2025-11-12T17:07:13Z

💚 😻 🎉

<regex>: Perform simplified stack unwinding for lookahead assertion…

aeecd09

…s when the pattern matches

muellerj2 requested a review from a team as a code owner November 9, 2025 00:10

github-project-automation bot added this to STL Code Reviews Nov 9, 2025

github-project-automation bot moved this to Initial Review in STL Code Reviews Nov 9, 2025

StephanTLavavej added enhancement Something can be improved regex meow is a substring of homeowner labels Nov 9, 2025

StephanTLavavej self-assigned this Nov 9, 2025

StephanTLavavej added 2 commits November 10, 2025 15:40

Remove unnecessary scope.

cb3c283

Make wrapping prettier.

685121b

StephanTLavavej reviewed Nov 10, 2025

View reviewed changes

stl/inc/regex Outdated Show resolved Hide resolved

stl/inc/regex Outdated Show resolved Hide resolved

StephanTLavavej approved these changes Nov 10, 2025

View reviewed changes

StephanTLavavej removed their assignment Nov 10, 2025

StephanTLavavej moved this from Initial Review to Ready To Merge in STL Code Reviews Nov 10, 2025

StephanTLavavej moved this from Ready To Merge to Merging in STL Code Reviews Nov 11, 2025

StephanTLavavej merged commit d806de4 into microsoft:main Nov 12, 2025
41 checks passed

github-project-automation bot moved this from Merging to Done in STL Code Reviews Nov 12, 2025

muellerj2 mentioned this pull request Nov 13, 2025

<regex>: Remove capture extent vectors from stack frames #5865

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`<regex>`: Perform simplified stack unwinding for lookahead assertions when the asserted pattern matches #5835

`<regex>`: Perform simplified stack unwinding for lookahead assertions when the asserted pattern matches #5835

muellerj2 commented Nov 9, 2025

Uh oh!

StephanTLavavej commented Nov 10, 2025

Uh oh!

Uh oh!

Uh oh!

StephanTLavavej commented Nov 11, 2025

Uh oh!

Uh oh!

StephanTLavavej commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

<regex>: Perform simplified stack unwinding for lookahead assertions when the asserted pattern matches #5835

<regex>: Perform simplified stack unwinding for lookahead assertions when the asserted pattern matches #5835

Conversation

muellerj2 commented Nov 9, 2025

Uh oh!

StephanTLavavej commented Nov 10, 2025

Uh oh!

Uh oh!

Uh oh!

StephanTLavavej commented Nov 11, 2025

Uh oh!

Uh oh!

StephanTLavavej commented Nov 12, 2025

💚 😻 🎉

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

`<regex>`: Perform simplified stack unwinding for lookahead assertions when the asserted pattern matches #5835

`<regex>`: Perform simplified stack unwinding for lookahead assertions when the asserted pattern matches #5835