-
Notifications
You must be signed in to change notification settings - Fork 373
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
arrow2
's estimated_size_bytes
performance issues
#1738
Comments
Can't we fix this in |
Yep, I want to investigate this tomorrow after a good night's sleep. |
arrow2
's estimated_size_bytes
is _extremely_ slowarrow2
's estimated_size_bytes
performance issues
Long-winded explanation here, but the TL;DR is that, as far as I can tell, A) there is nothing wrong with how Due to the way At least in the short-term I think the solution stays the same: get these operations out of the fast path, cache the results in |
The problem is so severe that it is pretty much impossible to use it anywhere in the store without everything slowing down to a crawl, which makes implementing statistics really, really painful.
Even for incremental measurements this is way too slow, orders of magnitudes slower than everything else on the write path.
See #1743 for detailed benchmarks.
If we cannot optimize it any further, my proposal is to move the problem upstream: compute byte sizes within the batching system, therefore:
The text was updated successfully, but these errors were encountered: