-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Enh]: Better benchmarking routine #805
Comments
Hey, I installed Codspeed for benchmarking on pydata/sparse. I'm transforming benchmarks from asv to codspeed "as we speak".
Codpeed tests the opened PR against main, if that was the question. 🤔 |
Hey Dea, thanks for the input. That's what happens when I open issues in a rush. Let me try to clarify some points and ideas. My understanding is that one can mark some test for benchmarking, and I am wondering what could these test be.
One option is to run TPCH queries with the subset of the data we have in
Yes that is exactly my point: PR (branch) vs main, so I am getting the process right 👌
I have never used it so far, I am happy to give it a spin, but expect to be pinged for help 🙈 |
It doesn't say in the documentation. I guess one could do it "somehow" with the CI, but I don't think that it's an out-of-the-box option. You can select if the report is sent all the time to the PR, or only if there is a failure/improvement... but that's all they mention.
I truly doubt that you'll ever need help from me 😁 .. but sure! |
Commenting to discuss the idea: as plotly is understandably concerned about performances, maybe we could use the script they shared to assess if we have a performance drop |
We would like to learn about your use case. For example, if this feature is needed to adopt Narwhals in an open source project, could you please enter the link to it below?
No response
Please describe the purpose of the new feature or describe the problem to solve.
There are some features that can require extra attention or are worth benchmarking to understand if worth implementing. For example I am thinking of #500 and #743.
Suggest a solution if possible.
I checked how other libraries do that, specifically pydantic. They use codspeed which seems to have a free tier for public repos.
Question is: what to benchmark?! Would TPCH queries in main vs branch be a reasonable test?
If you have tried alternatives, please describe them below.
Currently very manual effort on kaggle
Additional information that may help us understand your needs.
No response
The text was updated successfully, but these errors were encountered: