Skip to content

Commit

Permalink
Add tests for RegExp modifiers
Browse files Browse the repository at this point in the history
  • Loading branch information
rbuckton authored and ptomato committed Mar 7, 2024
1 parent 9e03c40 commit 47b1f5e
Show file tree
Hide file tree
Showing 61 changed files with 3,284 additions and 0 deletions.
Original file line number Diff line number Diff line change
@@ -0,0 +1,46 @@
// Copyright 2023 Ron Buckton. All rights reserved.
// This code is governed by the BSD license found in the LICENSE file.

/*---
author: Ron Buckton
description: >
Adding dotAll (`s`) modifier does not affect RegExp instance `dotAll` property.
info: |
Runtime Semantics: CompileAtom
The syntax-directed operation CompileAtom takes arguments direction (forward or backward) and modifiers (a Modifiers Record) and returns a Matcher.
Atom :: `(` `?` RegularExpressionFlags `:` Disjunction `)`
1. Let addModifiers be the source text matched by RegularExpressionFlags.
2. Let removeModifiers be the empty String.
3. Let newModifiers be UpdateModifiers(modifiers, CodePointsToString(addModifiers), removeModifiers).
4. Return CompileSubpattern of Disjunction with arguments direction and newModifiers.
UpdateModifiers ( modifiers, add, remove )
The abstract operation UpdateModifiers takes arguments modifiers (a Modifiers Record), add (a String), and remove (a String) and returns a Modifiers. It performs the following steps when called:
1. Let dotAll be modifiers.[[DotAll]].
2. Let ignoreCase be modifiers.[[IgnoreCase]].
3. Let multiline be modifiers.[[Multiline]].
4. If add contains "s", set dotAll to true.
5. If add contains "i", set ignoreCase to true.
6. If add contains "m", set multiline to true.
7. If remove contains "s", set dotAll to false.
8. If remove contains "i", set ignoreCase to false.
9. If remove contains "m", set multiline to false.
10. Return the Modifiers Record { [[DotAll]]: dotAll, [[IgnoreCase]]: ignoreCase, [[Multiline]]: multiline }.
esid: sec-compileatom
features: [regexp-modifiers]
---*/

var re1 = /(?s:)/;
assert(!re1.dotAll, "RegExp instance dotAll flag should not be set");

var re2 = new RegExp("(?s:)");
assert(!re2.dotAll, "RegExp instance dotAll flag should not be set");

var re3 = /(?s-:)/;
assert(!re3.dotAll, "RegExp instance dotAll flag should not be set");

var re4 = new RegExp("(?s-:)");
assert(!re4.dotAll, "RegExp instance dotAll flag should not be set");
Original file line number Diff line number Diff line change
@@ -0,0 +1,64 @@
// Copyright 2023 Ron Buckton. All rights reserved.
// This code is governed by the BSD license found in the LICENSE file.

/*---
author: Ron Buckton
description: >
Adding dotAll (`s`) modifier in group should not affect ignoreCase (`i`) flag.
info: |
Runtime Semantics: CompileAtom
The syntax-directed operation CompileAtom takes arguments direction (forward or backward) and modifiers (a Modifiers Record) and returns a Matcher.
Atom :: `(` `?` RegularExpressionFlags `:` Disjunction `)`
1. Let addModifiers be the source text matched by RegularExpressionFlags.
2. Let removeModifiers be the empty String.
3. Let newModifiers be UpdateModifiers(modifiers, CodePointsToString(addModifiers), removeModifiers).
4. Return CompileSubpattern of Disjunction with arguments direction and newModifiers.
Atom :: `(` `?` RegularExpressionFlags `-` RegularExpressionFlags `:` Disjunction `)`
1. Let addModifiers be the source text matched by the first RegularExpressionFlags.
2. Let removeModifiers be the source text matched by the second RegularExpressionFlags.
3. Let newModifiers be UpdateModifiers(modifiers, CodePointsToString(addModifiers), CodePointsToString(removeModifiers)).
4. Return CompileSubpattern of Disjunction with arguments direction and newModifiers.
UpdateModifiers ( modifiers, add, remove )
The abstract operation UpdateModifiers takes arguments modifiers (a Modifiers Record), add (a String), and remove (a String) and returns a Modifiers. It performs the following steps when called:
1. Let dotAll be modifiers.[[DotAll]].
2. Let ignoreCase be modifiers.[[IgnoreCase]].
3. Let multiline be modifiers.[[Multiline]].
4. If add contains "s", set dotAll to true.
5. If add contains "i", set ignoreCase to true.
6. If add contains "m", set multiline to true.
7. If remove contains "s", set dotAll to false.
8. If remove contains "i", set ignoreCase to false.
9. If remove contains "m", set multiline to false.
10. Return the Modifiers Record { [[DotAll]]: dotAll, [[IgnoreCase]]: ignoreCase, [[Multiline]]: multiline }.
esid: sec-compileatom
features: [regexp-modifiers]
---*/

var re1 = /(?s:.es)/;
assert(re1.test("aes"), "s should match s in modified group");
assert(re1.test("\nes"), "s should match s in modified group");
assert(!re1.test("aeS"), "s should not match S in modified group");
assert(!re1.test("\neS"), "s should not match S in modified group");

var re2 = /(?s:.es)/i;
assert(re2.test("aes"), "s should match s in modified group");
assert(re2.test("aeS"), "s should match S in modified group");
assert(re2.test("\nes"), "s should match s in modified group");
assert(re2.test("\neS"), "s should match S in modified group");

var re3 = /(?s-:.es)/;
assert(re3.test("aes"), "s should match s in modified group");
assert(re3.test("\nes"), "s should match s in modified group");
assert(!re3.test("aeS"), "s should not match S in modified group");
assert(!re3.test("\neS"), "s should not match S in modified group");

var re4 = /(?s-:.es)/i;
assert(re4.test("aes"), "s should match s in modified group");
assert(re4.test("aeS"), "s should match S in modified group");
assert(re4.test("\nes"), "s should match s in modified group");
assert(re4.test("\neS"), "s should match S in modified group");
Original file line number Diff line number Diff line change
@@ -0,0 +1,56 @@
// Copyright 2023 Ron Buckton. All rights reserved.
// This code is governed by the BSD license found in the LICENSE file.

/*---
author: Ron Buckton
description: >
Adding dotAll (`s`) modifier in group should not affect multiline (`m`) flag.
info: |
Runtime Semantics: CompileAtom
The syntax-directed operation CompileAtom takes arguments direction (forward or backward) and modifiers (a Modifiers Record) and returns a Matcher.
Atom :: `(` `?` RegularExpressionFlags `:` Disjunction `)`
1. Let addModifiers be the source text matched by RegularExpressionFlags.
2. Let removeModifiers be the empty String.
3. Let newModifiers be UpdateModifiers(modifiers, CodePointsToString(addModifiers), removeModifiers).
4. Return CompileSubpattern of Disjunction with arguments direction and newModifiers.
Atom :: `(` `?` RegularExpressionFlags `-` RegularExpressionFlags `:` Disjunction `)`
1. Let addModifiers be the source text matched by the first RegularExpressionFlags.
2. Let removeModifiers be the source text matched by the second RegularExpressionFlags.
3. Let newModifiers be UpdateModifiers(modifiers, CodePointsToString(addModifiers), CodePointsToString(removeModifiers)).
4. Return CompileSubpattern of Disjunction with arguments direction and newModifiers.
UpdateModifiers ( modifiers, add, remove )
The abstract operation UpdateModifiers takes arguments modifiers (a Modifiers Record), add (a String), and remove (a String) and returns a Modifiers. It performs the following steps when called:
1. Let dotAll be modifiers.[[DotAll]].
2. Let ignoreCase be modifiers.[[IgnoreCase]].
3. Let multiline be modifiers.[[Multiline]].
4. If add contains "s", set dotAll to true.
5. If add contains "i", set ignoreCase to true.
6. If add contains "m", set multiline to true.
7. If remove contains "s", set dotAll to false.
8. If remove contains "i", set ignoreCase to false.
9. If remove contains "m", set multiline to false.
10. Return the Modifiers Record { [[DotAll]]: dotAll, [[IgnoreCase]]: ignoreCase, [[Multiline]]: multiline }.
esid: sec-compileatom
features: [regexp-modifiers]
---*/

var re1 = /(?s:.es$)/;
assert(re1.test("\nes"), ". should match newline in modified group");
assert(!re1.test("\nes\nz"), "$ should not match newline in modified group");

var re2 = /(?s:.es$)/m;
assert(re2.test("\nes"), ". should match newline in modified group");
assert(re2.test("\nes\nz"), "$ should match newline in modified group");

var re3 = /(?s-:.es$)/;
assert(re3.test("\nes"), ". should match newline in modified group");
assert(!re3.test("\nes\nz"), "$ should not match newline in modified group");

var re4 = /(?s-:.es$)/m;
assert(re4.test("\nes"), ". should match newline in modified group");
assert(re4.test("\nes\nz"), "$ should match newline in modified group");
102 changes: 102 additions & 0 deletions test/built-ins/RegExp/regexp-modifiers/add-dotAll.js
Original file line number Diff line number Diff line change
@@ -0,0 +1,102 @@
// Copyright 2023 Ron Buckton. All rights reserved.
// This code is governed by the BSD license found in the LICENSE file.

/*---
author: Ron Buckton
description: >
dotAll (`s`) modifier can be added via `(?s:)` or `(?s-:)`.
info: |
Runtime Semantics: CompileAtom
The syntax-directed operation CompileAtom takes arguments direction (forward or backward) and modifiers (a Modifiers Record) and returns a Matcher.
Atom :: `(` `?` RegularExpressionFlags `:` Disjunction `)`
1. Let addModifiers be the source text matched by RegularExpressionFlags.
2. Let removeModifiers be the empty String.
3. Let newModifiers be UpdateModifiers(modifiers, CodePointsToString(addModifiers), removeModifiers).
4. Return CompileSubpattern of Disjunction with arguments direction and newModifiers.
UpdateModifiers ( modifiers, add, remove )
The abstract operation UpdateModifiers takes arguments modifiers (a Modifiers Record), add (a String), and remove (a String) and returns a Modifiers. It performs the following steps when called:
1. Let dotAll be modifiers.[[DotAll]].
2. Let ignoreCase be modifiers.[[IgnoreCase]].
3. Let multiline be modifiers.[[Multiline]].
4. If add contains "s", set dotAll to true.
5. If add contains "i", set ignoreCase to true.
6. If add contains "m", set multiline to true.
7. If remove contains "s", set dotAll to false.
8. If remove contains "i", set ignoreCase to false.
9. If remove contains "m", set multiline to false.
10. Return the Modifiers Record { [[DotAll]]: dotAll, [[IgnoreCase]]: ignoreCase, [[Multiline]]: multiline }.
esid: sec-compileatom
features: [regexp-modifiers]
---*/

var re1 = /(?s:^.$)/;
assert(re1.test("a"), "Pattern character '.' should match non-line terminators in modified group");
assert(re1.test("3"), "Pattern character '.' should match non-line terminators in modified group");
assert(re1.test("π"), "Pattern character '.' should match non-line terminators in modified group");
assert(re1.test("\u2027"), "Pattern character '.' should match non-line terminators in modified group");
assert(re1.test("\u0085"), "Pattern character '.' should match non-line terminators in modified group");
assert(re1.test("\v"), "Pattern character '.' should match mon-line terminators in modified group");
assert(re1.test("\f"), "Pattern character '.' should match mon-line terminators in modified group");
assert(re1.test("\u180E"), "Pattern character '.' should match non-line terminators in modified group");
assert(!re1.test("\u{10300}"), "Supplementary plane not matched by a single .");
assert(re1.test("\n"), "Pattern character '.' should match line terminators in modified group");
assert(re1.test("\r"), "Pattern character '.' should match line terminators in modified group");
assert(re1.test("\u2028"), "Pattern character '.' should match line terminators in modified group");
assert(re1.test("\u2029"), "Pattern character '.' should match line terminators in modified group");
assert(re1.test("\uD800"), "Pattern character '.' should match non-line terminators in modified group");
assert(re1.test("\uDFFF"), "Pattern character '.' should match non-line terminators in modified group");

var re2 = new RegExp("(?s:^.$)");
assert(re2.test("a"), "Pattern character '.' should match non-line terminators in modified group");
assert(re2.test("3"), "Pattern character '.' should match non-line terminators in modified group");
assert(re2.test("π"), "Pattern character '.' should match non-line terminators in modified group");
assert(re2.test("\u2027"), "Pattern character '.' should match non-line terminators in modified group");
assert(re2.test("\u0085"), "Pattern character '.' should match non-line terminators in modified group");
assert(re2.test("\v"), "Pattern character '.' should match mon-line terminators in modified group");
assert(re2.test("\f"), "Pattern character '.' should match mon-line terminators in modified group");
assert(re2.test("\u180E"), "Pattern character '.' should match non-line terminators in modified group");
assert(!re2.test("\u{10300}"), "Supplementary plane not matched by a single .");
assert(re2.test("\n"), "Pattern character '.' should match line terminators in modified group");
assert(re2.test("\r"), "Pattern character '.' should match line terminators in modified group");
assert(re2.test("\u2028"), "Pattern character '.' should match line terminators in modified group");
assert(re2.test("\u2029"), "Pattern character '.' should match line terminators in modified group");
assert(re2.test("\uD800"), "Pattern character '.' should match non-line terminators in modified group");
assert(re2.test("\uDFFF"), "Pattern character '.' should match non-line terminators in modified group");

var re3 = /(?s-:^.$)/;
assert(re3.test("a"), "Pattern character '.' should match non-line terminators in modified group");
assert(re3.test("3"), "Pattern character '.' should match non-line terminators in modified group");
assert(re3.test("π"), "Pattern character '.' should match non-line terminators in modified group");
assert(re3.test("\u2027"), "Pattern character '.' should match non-line terminators in modified group");
assert(re3.test("\u0085"), "Pattern character '.' should match non-line terminators in modified group");
assert(re3.test("\v"), "Pattern character '.' should match mon-line terminators in modified group");
assert(re3.test("\f"), "Pattern character '.' should match mon-line terminators in modified group");
assert(re3.test("\u180E"), "Pattern character '.' should match non-line terminators in modified group");
assert(!re3.test("\u{10300}"), "Supplementary plane not matched by a single .");
assert(re3.test("\n"), "Pattern character '.' should match line terminators in modified group");
assert(re3.test("\r"), "Pattern character '.' should match line terminators in modified group");
assert(re3.test("\u2028"), "Pattern character '.' should match line terminators in modified group");
assert(re3.test("\u2029"), "Pattern character '.' should match line terminators in modified group");
assert(re3.test("\uD800"), "Pattern character '.' should match non-line terminators in modified group");
assert(re3.test("\uDFFF"), "Pattern character '.' should match non-line terminators in modified group");

var re4 = new RegExp("(?s-:^.$)");
assert(re4.test("a"), "Pattern character '.' should match non-line terminators in modified group");
assert(re4.test("3"), "Pattern character '.' should match non-line terminators in modified group");
assert(re4.test("π"), "Pattern character '.' should match non-line terminators in modified group");
assert(re4.test("\u2027"), "Pattern character '.' should match non-line terminators in modified group");
assert(re4.test("\u0085"), "Pattern character '.' should match non-line terminators in modified group");
assert(re4.test("\v"), "Pattern character '.' should match mon-line terminators in modified group");
assert(re4.test("\f"), "Pattern character '.' should match mon-line terminators in modified group");
assert(re4.test("\u180E"), "Pattern character '.' should match non-line terminators in modified group");
assert(!re4.test("\u{10300}"), "Supplementary plane not matched by a single .");
assert(re4.test("\n"), "Pattern character '.' should match line terminators in modified group");
assert(re4.test("\r"), "Pattern character '.' should match line terminators in modified group");
assert(re4.test("\u2028"), "Pattern character '.' should match line terminators in modified group");
assert(re4.test("\u2029"), "Pattern character '.' should match line terminators in modified group");
assert(re4.test("\uD800"), "Pattern character '.' should match non-line terminators in modified group");
assert(re4.test("\uDFFF"), "Pattern character '.' should match non-line terminators in modified group");
Original file line number Diff line number Diff line change
@@ -0,0 +1,52 @@
// Copyright 2023 Ron Buckton. All rights reserved.
// This code is governed by the BSD license found in the LICENSE file.

/*---
author: Ron Buckton
description: >
Adding ignoreCase (`i`) modifier in group affects backreferences in group.
info: |
Runtime Semantics: CompileAtom
The syntax-directed operation CompileAtom takes arguments direction (forward or backward) and modifiers (a Modifiers Record) and returns a Matcher.
Atom :: `(` `?` RegularExpressionFlags `:` Disjunction `)`
1. Let addModifiers be the source text matched by RegularExpressionFlags.
2. Let removeModifiers be the empty String.
3. Let newModifiers be UpdateModifiers(modifiers, CodePointsToString(addModifiers), removeModifiers).
4. Return CompileSubpattern of Disjunction with arguments direction and newModifiers.
Atom :: `(` `?` RegularExpressionFlags `-` RegularExpressionFlags `:` Disjunction `)`
1. Let addModifiers be the source text matched by the first RegularExpressionFlags.
2. Let removeModifiers be the source text matched by the second RegularExpressionFlags.
3. Let newModifiers be UpdateModifiers(modifiers, CodePointsToString(addModifiers), CodePointsToString(removeModifiers)).
4. Return CompileSubpattern of Disjunction with arguments direction and newModifiers.
UpdateModifiers ( modifiers, add, remove )
The abstract operation UpdateModifiers takes arguments modifiers (a Modifiers Record), add (a String), and remove (a String) and returns a Modifiers. It performs the following steps when called:
1. Let dotAll be modifiers.[[DotAll]].
2. Let ignoreCase be modifiers.[[IgnoreCase]].
3. Let multiline be modifiers.[[Multiline]].
4. If add contains "s", set dotAll to true.
5. If add contains "i", set ignoreCase to true.
6. If add contains "m", set multiline to true.
7. If remove contains "s", set dotAll to false.
8. If remove contains "i", set ignoreCase to false.
9. If remove contains "m", set multiline to false.
10. Return the Modifiers Record { [[DotAll]]: dotAll, [[IgnoreCase]]: ignoreCase, [[Multiline]]: multiline }.
esid: sec-compileatom
features: [regexp-modifiers]
---*/

var re1 = /(a)(?i:\1)/;
assert(!re1.test("AA"), "a should not match first A");
assert(!re1.test("Aa"), "a should not match A");
assert(re1.test("aa"), "a matches first a, so \\1 should match second a");
assert(re1.test("aA"), "a matches a, so \\1 should match A (ignores case)");

var re2 = /(a)(?i-:\1)/;
assert(!re2.test("AA"), "a should not match first A");
assert(!re2.test("Aa"), "a should not match A");
assert(re2.test("aa"), "a matches first a, so \\1 should match second a");
assert(re2.test("aA"), "a matches a, so \\1 should match A (ignores case)");
Loading

0 comments on commit 47b1f5e

Please sign in to comment.