perf(ast): reduce size of `Comment` to 16 bytes by camchenry · Pull Request #11062 · oxc-project/oxc

camchenry · 2025-05-15T14:41:56Z

I noticed that most of Comment consisted of enums which were only 1 byte in size, but only used a few bits in each byte. There are few enough fields that we can actually store all of them in a single u16 bit flag, which reduces the size of Comment from 24 bytes to 16 bytes.

camchenry · 2025-05-15T14:42:13Z

perf(ast): reduce size of Comment to 16 bytes #11062 👈 (View in Graphite)
main

How to use the Graphite Merge Queue

Add either label to this PR to merge it via the merge queue:

0-merge - adds this PR to the back of the merge queue
hotfix - for urgent hot fixes, skip the queue and merge this PR next

You must have a Graphite account in order to use the merge queue. Sign up using this link.

_{An organization admin has enabled the Graphite Merge Queue in this repository.} _{Please do not merge from GitHub as this will restart CI on PRs being processed by the merge queue.}

This stack of pull requests is managed by Graphite. Learn more about stacking.

codspeed-hq · 2025-05-15T14:50:09Z

CodSpeed Instrumentation Performance Report

Merging #11062 will not alter performance

_{Comparing 05-15-perf_ast_reduce_size_of_comment_to_16_bytes (b9e51e2) with main (eef93b4)}

Summary

✅ 36 untouched benchmarks

camchenry · 2025-05-15T14:50:24Z

@overlookmotel I don't know if this conflicts too much with #11056, but I had started working on this a bit last night before I saw your PR. I think this should still be valid since this both removes a lot of the padding and also reduces the overall size of this struct. Not sure if it'll affect perf at all though, I'm just trying some things at this point.

I'm also expecting that this will break conformance, until I fix the ESTree serialization.

crates/oxc_ast/src/ast/comment.rs

overlookmotel

We only need to lose 1 byte to get this type down to 16 bytes.

So rather than doing so much with the CommentFlags bitflag set, how about just combining preceded_by_newline and followed_by_newline into 1 byte?

That's simple enough that you could do it with an enum, and avoid complication of the bitflags! macro.

enum CommentNewlines {
    None,
    Leading,
    Trailing,
    LeadingAndTrailing,
}

Generated assembly is sometimes more efficient if a struct has no padding, so using all the available bytes can actually perform better than aggressively packing everything into the minimum number of bits. e.g. #11046 got a small perf gain from using more bits.

The other advantage is that preceded_by_newline and followed_by_newline are skipped in ESTree AST, so that avoids writing a custom ESTree serializer, which would be necessary with your current approach. I'm keen to avoid custom serializers as much as we can - they're a pain, and if the type changes later on, they have to be kept in sync.

camchenry · 2025-05-15T15:57:16Z

Generated assembly is sometimes more efficient if a struct has no padding, so using all the available bytes can actually perform better than aggressively packing everything into the minimum number of bits. e.g. #11046 got a small perf gain from using more bits.

Interesting! Seems worth a try.

The other advantage is that preceded_by_newline and followed_by_newline are skipped in ESTree AST, so that avoids writing a custom ESTree serializer, which would be necessary with your current approach. I'm keen to avoid custom serializers as much as we can - they're a pain, and if the type changes later on, they have to be kept in sync.

yeah I was just starting to figure out the serializer part. since we only need to lose 1 byte here, I'm leaning towards dropping the bitflags and keeping the enums, but combining them like you suggested. should hopefully make it a little easier to maintain too.

overlookmotel · 2025-05-15T16:11:51Z

I really hate the bitflags! macro! Half the time it doesn't do what we really need anyway.

Ideally we need an ergonomic way to pack enums together. e.g.:

pack_enums! {
    pub enum Foo {
        A,
        B,
        C,
        D,
    }

    pub enum Bar {
        E,
        F,
    }

    pub struct FooAndBar {
        pub foo: Foo,
        pub bar: Bar,
    }
}

pack_enums! would convert FooAndBar to something like:

pub struct FooAndBar(FooAndBarInner);

enum FooAndBarInner {
    A_and_E,
    B_and_E,
    C_and_E,
    D_and_E,
    A_and_F,
    B_and_F,
    C_and_F,
    D_and_F,
}

impl FooAndBar {
    pub fn foo(&self) -> Foo {
        match self.0 {
            Self::A_and_E | Self::A_and_F => Foo::A,
            Self::B_and_E | Self::B_and_F => Foo::B,
            Self::C_and_E | Self::B_and_F => Foo::C,
            Self::D_and_E | Self::B_and_F => Foo::D,
        }
    }

    pub fn bar(&self) -> Bar {
        match self.0 {
            Self::A_and_E | Self::B_and_E | Self::C_and_E | Self::D_and_E => Bar::E,
            Self::A_and_F | Self::B_and_F | Self::C_and_F | Self::D_and_F => Bar::F,
        }
    }
}

FooAndBar is now 1 byte, and has a niche so Option<FooAndBar> is 1 byte too.

I've not located a crate which does this, sadly.

camchenry · 2025-05-15T17:53:44Z

Looks like this gets us about the same result in codspeed (+1% in some lexer benchmarks). Running locally, I see a ~1% perf improvement in the parser (on my M1 Air laptop) (numbers are reversed, because I ran in the opposite branch order):

overlookmotel

Great!

Let's merge this.

However, my idea of using an enum instead of bitflags! for CommentNewlines was misjudged. I hadn't taken into account the complexity of the setters for the 2 flags. bitflags! would be simpler, and probably more performant - compiler is surprisingly bad at optimizing operations on fieldless enums.

Here's a comparison between the enum-based implementations and code like what bitflags! would produce: https://godbolt.org/z/qT817zo18

The most important ones to compare are the getters and the "real world usage" methods e.g. set_followed_by_newline_true.

So, although I do hate bitflags! it's still better than what I came up with!

I don't know if you have time/inclination to try making that change in a follow-up PR? Really sorry I sent you on a wild goose-chase with my bitflag-hating. You were on the right track to start with, and I led you astray...

overlookmotel · 2025-05-15T19:26:05Z

Pushed a commit to remove the change to the codegen for raw transfer. That change was I think left over from earlier version where used a u16 for flags.

overlookmotel · 2025-05-15T19:26:21Z

Merge activity

May 15, 3:26 PM EDT: The merge label '0-merge' was detected. This PR will be added to the Graphite merge queue once it meets the requirements.
May 15, 3:30 PM EDT: overlookmotel added this pull request to the Graphite merge queue.
May 15, 3:37 PM EDT: Merged by the Graphite merge queue.

I noticed that most of `Comment` consisted of enums which were only 1 byte in size, but only used a few bits in each byte. There are few enough fields that we can actually store all of them in a single `u16` bit flag, which reduces the size of `Comment` from 24 bytes to 16 bytes.

github-actions bot added A-linter Area - Linter A-parser Area - Parser A-ast Area - AST A-codegen Area - Code Generation A-ast-tools Area - AST tools A-formatter Area - Formatter labels May 15, 2025

github-actions bot added the C-performance Category - Solution not expected to change functional behavior, only performance label May 15, 2025

camchenry force-pushed the 05-15-perf_ast_reduce_size_of_comment_to_16_bytes branch 2 times, most recently from 205d72e to c6dafb6 Compare May 15, 2025 14:49

overlookmotel reviewed May 15, 2025

View reviewed changes

crates/oxc_ast/src/ast/comment.rs Outdated Show resolved Hide resolved

overlookmotel reviewed May 15, 2025

View reviewed changes

crates/oxc_ast/src/ast/comment.rs Outdated Show resolved Hide resolved

overlookmotel reviewed May 15, 2025

View reviewed changes

camchenry force-pushed the 05-15-perf_ast_reduce_size_of_comment_to_16_bytes branch from c6dafb6 to 40baa4a Compare May 15, 2025 17:32

camchenry marked this pull request as ready for review May 15, 2025 17:54

camchenry requested a review from overlookmotel May 15, 2025 17:54

overlookmotel approved these changes May 15, 2025

View reviewed changes

overlookmotel added the 0-merge Merge with Graphite Merge Queue label May 15, 2025

graphite-app bot force-pushed the 05-15-perf_ast_reduce_size_of_comment_to_16_bytes branch from 1e0a280 to b9e51e2 Compare May 15, 2025 19:31

graphite-app bot merged commit b9e51e2 into main May 15, 2025
26 checks passed

graphite-app bot deleted the 05-15-perf_ast_reduce_size_of_comment_to_16_bytes branch May 15, 2025 19:37

graphite-app bot removed the 0-merge Merge with Graphite Merge Queue label May 15, 2025

overlookmotel mentioned this pull request May 17, 2025

perf(ast): use bitflags for storing comment flags #11090

Closed

oxc-bot mentioned this pull request May 20, 2025

release(crates): v0.71.0 #11182

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf(ast): reduce size of `Comment` to 16 bytes#11062

perf(ast): reduce size of `Comment` to 16 bytes#11062
graphite-app[bot] merged 1 commit intomainfrom
05-15-perf_ast_reduce_size_of_comment_to_16_bytes

camchenry commented May 15, 2025 •

edited

Loading

Uh oh!

camchenry commented May 15, 2025

Uh oh!

codspeed-hq bot commented May 15, 2025 •

edited

Loading

Uh oh!

camchenry commented May 15, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

overlookmotel left a comment •

edited

Loading

Uh oh!

camchenry commented May 15, 2025

Uh oh!

overlookmotel commented May 15, 2025 •

edited

Loading

Uh oh!

camchenry commented May 15, 2025

Uh oh!

overlookmotel left a comment •

edited

Loading

Uh oh!

overlookmotel commented May 15, 2025

Uh oh!

overlookmotel commented May 15, 2025 •

edited by graphite-app bot

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

camchenry commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

camchenry commented May 15, 2025

How to use the Graphite Merge Queue

Uh oh!

codspeed-hq bot commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Instrumentation Performance Report

Merging #11062 will not alter performance

Summary

Uh oh!

camchenry commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

overlookmotel left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

camchenry commented May 15, 2025

Uh oh!

overlookmotel commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

camchenry commented May 15, 2025

Uh oh!

overlookmotel left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

overlookmotel commented May 15, 2025

Uh oh!

overlookmotel commented May 15, 2025 • edited by graphite-app bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge activity

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

camchenry commented May 15, 2025 •

edited

Loading

codspeed-hq bot commented May 15, 2025 •

edited

Loading

camchenry commented May 15, 2025 •

edited

Loading

overlookmotel left a comment •

edited

Loading

overlookmotel commented May 15, 2025 •

edited

Loading

overlookmotel left a comment •

edited

Loading

overlookmotel commented May 15, 2025 •

edited by graphite-app bot

Loading