Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Std.regex: incorrect values of look-around captures #10662

Open
Viorel opened this issue Mar 5, 2025 · 1 comment
Open

Std.regex: incorrect values of look-around captures #10662

Viorel opened this issue Mar 5, 2025 · 1 comment

Comments

@Viorel
Copy link

Viorel commented Mar 5, 2025

The next example, checked with dmd 2.109.1, uses a combination of lookbehind and lookahead assertions:

import std.stdio;
import std.regex;

void main()
{
    auto re = regex(r"(?<=(..)(?=(..)))..cde");
    auto captures = std.regex.matchFirst("12345abcde", re);
    writeln(captures[0]); // "abcde" as expected
    writeln(captures[1]); // "45" as expected
    writeln(captures[2]); // nothing, but "ab" is expected
}

The value of captures[2] should be “ab”, but it is null. (Other prominent engines, in various languages, give correct results).

According to documentation, the std.regex library should support “arbitrary length and complexity lookbehind, including lookahead in lookbehind and vice-versa”.

The modified patterns, such as (?<=(..))(?=(..))..cde, seem to work correctly.

@0xEAB
Copy link
Member

0xEAB commented Mar 8, 2025

Verified with regex101, we’re apparently the outlier here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants