feat: add benchmarks for json parser #9107

Weijun-H · 2026-01-07T17:13:22Z

Which issue does this PR close?

Closes #NNN.

Rationale for this change

Add targeted JSON reader benchmarks to track performance for wide objects, hex-encoded binary inputs, and projection workloads.

What changes are included in this PR?

Add arrow-json/benches/wide_object.rs for wide-object decode/serialize benchmarks.
Add arrow-json/benches/binary_hex.rs for hex string decoding into Binary/FixedSizeBinary/BinaryView.
Add arrow-json/benches/wide_projection.rs for full vs projected schema decoding.

Are these changes tested?

No

Are there any user-facing changes?

No

alamb · 2026-01-07T17:36:14Z

arrow-json/benches/wide_object.rs

+    c.bench_function("decode_wide_object_i64_json", |b| {
+        b.iter(|| {
+            let mut decoder = ReaderBuilder::new(schema.clone())
+                .with_batch_size(1024)


I recommend using batch size of 4k or 8k to better mirror production

Also then I would recommend using like 128k input rows perhaps

alamb

Thanks @Weijun-H

alamb

Thanks @Weijun-H

Any chance you are willing to make them all a single benchmark (called json-reader.rs)? If not no worries, I can consolidate them as a follow on PR

…creased row and batch sizes

…rojection files and adding json-reader benchmark

alamb · 2026-01-08T17:06:04Z

Thank you @Weijun-H

# Which issue does this PR close?  - Closes #NNN. # Rationale for this change  Add targeted JSON reader benchmarks to track performance for wide objects, hex-encoded binary inputs, and projection workloads. # What changes are included in this PR? - Add `arrow-json/benches/wide_object.rs` for wide-object decode/serialize benchmarks. - Add `arrow-json/benches/binary_hex.rs` for hex string decoding into Binary/FixedSizeBinary/BinaryView. - Add `arrow-json/benches/wide_projection.rs` for full vs projected schema decoding.  # Are these changes tested? No  # Are there any user-facing changes? No

Weijun-H mentioned this pull request Jan 7, 2026

perf: improve field indexing in JSON StructArrayDecoder (1.7x speed up) #9086

Merged

github-actions bot added the arrow Changes to the arrow crate label Jan 7, 2026

Weijun-H marked this pull request as draft January 7, 2026 17:30

Weijun-H changed the title ~~feat: add benchmark for decoding and serializing wide JSON objects~~ feat: add benchmark for json parse Jan 7, 2026

alamb reviewed Jan 7, 2026

View reviewed changes

alamb approved these changes Jan 7, 2026

View reviewed changes

Weijun-H marked this pull request as ready for review January 7, 2026 17:42

alamb changed the title ~~feat: add benchmark for json parse~~ feat: add benchmarks for json parser Jan 7, 2026

alamb approved these changes Jan 7, 2026

View reviewed changes

Weijun-H added 7 commits January 8, 2026 10:28

feat: add benchmark for decoding and serializing wide JSON objects

338e4fc

feat: add benchmarks for decoding and serializing wide JSON objects

4986283

feat: add benchmark for wide JSON projection decoding

886bc40

feat: update benchmarks for wide JSON decoding and projection with in…

9df6f25

…creased row and batch sizes

chore

ac1f197

feat: refactor benchmarks by removing obsolete wide_object and wide_p…

f61a9a0

…rojection files and adding json-reader benchmark

chore: fmt

fbf50de

Weijun-H force-pushed the json-wide-tabe-bench branch from b0e2f1d to fbf50de Compare January 8, 2026 08:28

alamb merged commit 73bbfee into apache:main Jan 8, 2026
24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add benchmarks for json parser #9107

feat: add benchmarks for json parser #9107

Uh oh!

Weijun-H commented Jan 7, 2026 •

edited

Loading

Uh oh!

alamb Jan 7, 2026

Uh oh!

alamb left a comment

Uh oh!

alamb left a comment

Uh oh!

Uh oh!

alamb commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: add benchmarks for json parser #9107

feat: add benchmarks for json parser #9107

Uh oh!

Conversation

Weijun-H commented Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

alamb Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alamb commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Weijun-H commented Jan 7, 2026 •

edited

Loading