Skip to content

Conversation

@zhuqi-lucas
Copy link
Contributor

Which issue does this PR close?

See comments:
#7873 (comment)

We need to Add StringViewArray gc benchmark with not null cases before we optimizing the performance for this case.

cc @Dandandan

Rationale for this change

Add StringViewArray gc benchmark with not null cases

What changes are included in this PR?

Add StringViewArray gc benchmark with not null cases

Are these changes tested?

Yes

Are there any user-facing changes?

No

@github-actions github-actions bot added the arrow Changes to the arrow crate label Jul 7, 2025
Copy link
Contributor

@alamb alamb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Makes sense to me -- thank you @zhuqi-lucas 🙏

});
});

let array = gen_view_array_without_nulls(100_000);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

100_000 seems like an overly large array to be checking (though I see the other benchmarks also use that number)

Given we recommend batch sizes more ike 8k, maybe we should change the benchmark (in a follow on PR) to use 10K or 8K rows instead

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you @alamb for good suggestion, i also addressed this in latest PR.

Copy link
Contributor

@alamb alamb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you @zhuqi-lucas

@alamb alamb merged commit 38a7a1a into apache:main Jul 8, 2025
26 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

arrow Changes to the arrow crate

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants