Use sparse array for cell position caches #1312

trxcllnt · 2019-01-27T15:08:33Z

This PR swaps out the hash based cell position caches and the underlying binary search with a constant-time sparse array linear layout vector implementation.

The vector stores element sizes in a single dimension. Sizes are stored in blocks with power-of-two length. The vector supports constant-time (index) => position lookup, update, insertion, and removal by shifting the index to find the block index, then a mask (mod) to find the cell index.

Reverse lookup (position) => index is currently linear with respect to the number of blocks/size of each block. Flamegraphs indicate that on a table with 1MM rows and with the default block_size of 128, this amounts to about 0.15-1ms (150-1000 microseconds) per call when scrolling at the end of the list. If avoiding linear scans is an issue on principle, it would be possible to cache and invalidate the block sizes' prefix sum, and implement this lookup as a binary search again. The block size can also be intelligently tuned based on total element count for a classic space/time trade-off.

This change means the following methods are now O(1):

getSizeAndPositionOfCell
getSizeAndPositionOfLastMeasuredCell
getUpdatedOffsetForIndex

getVisibleCellRange is O(n), where n=num_blocks

The biggest practical benefit is that cell dimensions are stable across invalidation. After initial measurement the scrollbars won't jump around, as you can see in this screen capture I uploaded to youtube. The first half is rendering a MultiGrid with [email protected], and the second half is rendering the same MultiGrid with the current PR. You can find the source for this demo here.

The existing test suites (npm test) all pass
For any new features or bug fixes, both positive and negative test cases have been added
For any new features, documentation has been added
For any documentation changes, the text has been proofread and is clear to both experienced users and beginners.
Format your code with prettier (npm run prettier).
Run the Flow typechecks (npm run typecheck).

…me sparse array cache

wuweiweiwu · 2019-02-24T16:05:58Z

@trxcllnt What is the size impact of this dependency?

trxcllnt · 2019-02-25T18:57:37Z

2.2kb minified (w/ closure compiler simple optimizations) + gzipped, though that could be even smaller if cleaned up and converted from ES5 to an ES6 class. The version up on npm is a few years old, and that was converted to ES5 from ActionScript a few years before that.

omerts · 2019-03-06T17:31:39Z

@trxcllnt This PR really caught my attention, and is an interesting concept.. How is getSizeAndPositionOfCell O(1) if there is still a for loop iterating over the cells and calculating their size?

wuweiweiwu

Looks good to me!!

Thanks for doing this!

This reverts commit 7be1258.

trxcllnt added 2 commits January 27, 2019 05:19

swap hash-based cell position cache and binary search for constant-ti…

32f3259

…me sparse array cache

Guard against incomplete cell measurement

8fcc703

trxcllnt force-pushed the perf/use-llv branch from a9c2ae7 to 8fcc703 Compare January 27, 2019 20:48

wuweiweiwu self-assigned this Feb 24, 2019

wuweiweiwu approved these changes May 3, 2019

View reviewed changes

wuweiweiwu merged commit 7be1258 into bvaughn:master May 3, 2019

alxbradley mentioned this pull request May 17, 2019

Infinite loop in v9.21.1 #1375

Closed

wuweiweiwu added a commit that referenced this pull request May 22, 2019

Revert "Use sparse array for cell position caches (#1312)"

917a9a3

This reverts commit 7be1258.

wuweiweiwu mentioned this pull request Jun 4, 2019

Revert "Use sparse array for cell position caches" #1382

Merged

wuweiweiwu added a commit that referenced this pull request Jun 4, 2019

Revert "Use sparse array for cell position caches (#1312)" (#1382)

9df8639

This reverts commit 7be1258.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use sparse array for cell position caches #1312

Use sparse array for cell position caches #1312

trxcllnt commented Jan 27, 2019 •

edited

Loading

wuweiweiwu commented Feb 24, 2019

trxcllnt commented Feb 25, 2019 •

edited

Loading

omerts commented Mar 6, 2019

wuweiweiwu left a comment

Use sparse array for cell position caches #1312

Use sparse array for cell position caches #1312

Conversation

trxcllnt commented Jan 27, 2019 • edited Loading

wuweiweiwu commented Feb 24, 2019

trxcllnt commented Feb 25, 2019 • edited Loading

omerts commented Mar 6, 2019

wuweiweiwu left a comment

Choose a reason for hiding this comment

trxcllnt commented Jan 27, 2019 •

edited

Loading

trxcllnt commented Feb 25, 2019 •

edited

Loading