perf(lexer): remove branches from unicode handling by overlookmotel · Pull Request #15328 · oxc-project/oxc

overlookmotel · 2025-11-05T14:24:30Z

#12768 split next_char, next_2_chars, and peek_char into separate functions for the hot and cold paths.

That was a good change, but had one side-effect - because the unicode branch is now in a separate function which isn't inlined, the compiler loses knowledge of the context - that Source isn't at EOF, and that (in 2 of the 3 methods) the next character is known not to be ASCII.

Add unchecked assertions to inform compiler of the known facts, so it can remove 2 branches when calling chars.next().unwrap().

This code is on a cold path, so will likely not make a noticeable difference in files which don't contain many Unicode chars (like our benchmark fixtures), but why not?

overlookmotel · 2025-11-05T14:24:45Z

How to use the Graphite Merge Queue

Add either label to this PR to merge it via the merge queue:

0-merge - adds this PR to the back of the merge queue
hotfix - for urgent hot fixes, skip the queue and merge this PR next

You must have a Graphite account in order to use the merge queue. Sign up using this link.

_{An organization admin has enabled the Graphite Merge Queue in this repository.} _{Please do not merge from GitHub as this will restart CI on PRs being processed by the merge queue.}

This stack of pull requests is managed by Graphite. Learn more about stacking.

codspeed-hq · 2025-11-05T14:33:38Z

CodSpeed Performance Report

Merging #15328 will not alter performance

_{Comparing 11-01-perf_lexer_remove_branches_from_unicode_handling (f39e645) with main (70bf817)¹}

Summary

✅ 37 untouched

No successful run was found on main (6c09c1f) during the generation of this report, so 70bf817 was used instead as the comparison base. There might be some changes unrelated to this pull request in this report. ↩

Copilot

Pull Request Overview

This PR refactors Unicode character handling in the lexer's Source module to improve performance by providing better hints to the compiler through assert_unchecked!. The changes replace unwrap_unchecked() calls with unwrap() paired with assert_unchecked! to communicate invariants more explicitly.

Replaced unsafe unwrap_unchecked() with safe unwrap() after informing the compiler of invariants via assert_unchecked!
Refactored Unicode character handling functions to remove the byte parameter and make them fully responsible for handling Unicode
Restructured control flow to use if-else expressions instead of early returns

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

crates/oxc_parser/src/lexer/source.rs

overlookmotel · 2025-11-05T15:25:38Z

Merge activity

Nov 5, 3:25 PM UTC: The merge label '0-merge' was detected. This PR will be added to the Graphite merge queue once it meets the requirements.
Nov 5, 3:25 PM UTC: overlookmotel added this pull request to the Graphite merge queue.
Nov 5, 3:32 PM UTC: Merged by the Graphite merge queue.

#12768 split `next_char`, `next_2_chars`, and `peek_char` into separate functions for the hot and cold paths. That was a good change, but had one side-effect - because the unicode branch is now in a separate function which isn't inlined, the compiler loses knowledge of the context - that `Source` isn't at EOF, and that (in 2 of the 3 methods) the next character is known not to be ASCII. Add unchecked assertions to inform compiler of the known facts, so it can remove 2 branches when calling `chars.next().unwrap()`. This code is on a cold path, so will likely not make a noticeable difference in files which don't contain many Unicode chars (like our benchmark fixtures), but why not?

github-actions bot added A-parser Area - Parser C-performance Category - Solution not expected to change functional behavior, only performance labels Nov 5, 2025

overlookmotel marked this pull request as ready for review November 5, 2025 14:54

Copilot AI review requested due to automatic review settings November 5, 2025 14:54

Copilot AI reviewed Nov 5, 2025

View reviewed changes

crates/oxc_parser/src/lexer/source.rs Show resolved Hide resolved

crates/oxc_parser/src/lexer/source.rs Show resolved Hide resolved

crates/oxc_parser/src/lexer/source.rs Show resolved Hide resolved

overlookmotel changed the base branch from main to graphite-base/15328 November 5, 2025 15:03

overlookmotel force-pushed the 11-01-perf_lexer_remove_branches_from_unicode_handling branch from a17b0e4 to 05102f0 Compare November 5, 2025 15:03

overlookmotel changed the base branch from graphite-base/15328 to 11-05-test_transformer_update_transformer_conformance_snapshots November 5, 2025 15:04

overlookmotel mentioned this pull request Nov 5, 2025

test(transformer): update transformer conformance snapshots #15330

Merged

overlookmotel force-pushed the 11-01-perf_lexer_remove_branches_from_unicode_handling branch from 05102f0 to cd66ab0 Compare November 5, 2025 15:05

graphite-app bot changed the base branch from 11-05-test_transformer_update_transformer_conformance_snapshots to graphite-base/15328 November 5, 2025 15:06

graphite-app bot force-pushed the 11-01-perf_lexer_remove_branches_from_unicode_handling branch from cd66ab0 to e3986b3 Compare November 5, 2025 15:12

graphite-app bot force-pushed the graphite-base/15328 branch from 0093db6 to 6c09c1f Compare November 5, 2025 15:12

graphite-app bot changed the base branch from graphite-base/15328 to main November 5, 2025 15:12

graphite-app bot force-pushed the 11-01-perf_lexer_remove_branches_from_unicode_handling branch from e3986b3 to f39e645 Compare November 5, 2025 15:13

overlookmotel self-assigned this Nov 5, 2025

overlookmotel added the 0-merge Merge with Graphite Merge Queue label Nov 5, 2025

graphite-app bot force-pushed the 11-01-perf_lexer_remove_branches_from_unicode_handling branch from f39e645 to 5f08c69 Compare November 5, 2025 15:26

graphite-app bot merged commit 5f08c69 into main Nov 5, 2025
21 checks passed

graphite-app bot deleted the 11-01-perf_lexer_remove_branches_from_unicode_handling branch November 5, 2025 15:32

graphite-app bot removed the 0-merge Merge with Graphite Merge Queue label Nov 5, 2025

Boshen mentioned this pull request Nov 11, 2025

release(crates): oxc v0.97.0 #15582

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf(lexer): remove branches from unicode handling#15328

perf(lexer): remove branches from unicode handling#15328
graphite-app[bot] merged 1 commit intomainfrom
11-01-perf_lexer_remove_branches_from_unicode_handling

overlookmotel commented Nov 5, 2025 •

edited

Loading

Uh oh!

overlookmotel commented Nov 5, 2025 •

edited

Loading

Uh oh!

codspeed-hq bot commented Nov 5, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

overlookmotel commented Nov 5, 2025 •

edited by graphite-app bot

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Uh oh!

Conversation

overlookmotel commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

overlookmotel commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

How to use the Graphite Merge Queue

Uh oh!

codspeed-hq bot commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Performance Report

Merging #15328 will not alter performance

Summary

Footnotes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

overlookmotel commented Nov 5, 2025 • edited by graphite-app bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge activity

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

overlookmotel commented Nov 5, 2025 •

edited

Loading

overlookmotel commented Nov 5, 2025 •

edited

Loading

codspeed-hq bot commented Nov 5, 2025 •

edited

Loading

overlookmotel commented Nov 5, 2025 •

edited by graphite-app bot

Loading