Decrease size of _tsids created by TsidBuilder#133631
Decrease size of _tsids created by TsidBuilder#133631felixbarny merged 4 commits intoelastic:mainfrom
Conversation
Based on testing, this has marginal impact on storage but decreases the size of the _tsids from 21B-36B to 18B-21B.
|
Pinging @elastic/es-storage-engine (Team:StorageEngine) |
server/src/test/java/org/elasticsearch/cluster/routing/TsidBuilderTests.java
Outdated
Show resolved
Hide resolved
| * <ul> | ||
| * <li> | ||
| * A hash of the dimension field names (4 bytes). | ||
| * A hash of the dimension field names (1 byte). |
There was a problem hiding this comment.
What happens if we skip this? is it worth experimenting with? It'd be nice to have a tsid that's always the same size e.g. 20b.
There was a problem hiding this comment.
Storage is a little worse, but not much. We may be overfitting to a specific dataset, though. We could also have a fixed 4 bytes instead of a variable 1-4 bytes for the values similarity hash.
|
The impact of this on the
I don't understand why the throughput has regressed compared to #132566 (comment). It should be similar. I suspect that the throughput is quite volatile and can't easily be compared run-to-run. But before going with this approach, we should double-check this. The storage size should be stable as the track includes a force-merge step. Full Rally comparison ...Esbench params ... |
|
Do we have results for 16B too? |
|
Yes, they're here: #133706 (comment) |
|
Seems like a good tradeoff, @martijnvg wdyt? |
|
I'm trying to re-run the benchmarks with beefier instances that are closer to the nightly configuration. The variance for the throughput seems higher than expected. Having some issues with esbench atm. |
|
Merging after the latest round of testing: #132566 |
Based on testing, this has marginal impact on storage but decreases the size of the _tsids from 21B-36B to 18B-21B.
Based on testing, this has marginal impact on storage but decreases the size of the _tsids from 21B-36B to 18B-21B.
Based on testing, this has marginal impact on storage but decreases the size of the _tsids from 21B-36B to 18B-21B.
Follow-up from #133344
Part of #132566