Avoid allocating spire uint objects during apply agglomerate #6532

fm3 · 2022-10-05T11:19:32Z

When applying an agglomerate mapping, each value of the segmentation data needs to be converted to Long to then look up the agglomerate id in the hdf5 array. Since the jvm does not support unsigned types, this was previously done by wrapping each signed value in a spire unsigned integer object, and then calling toLong.

This PR changes the code to skip using the spire classes and directly calling the to-long method (which performs bitwise and to do the unsigned logic) on each value.

It also moves the filter-zero logic out of the data conversion since this had a performance cost even if the filterZero boolean was false.

It also adds a pre-allocated LongBuffer instead of using array.map(_.toLong), which is slightly faster, I guess the old version did not know the final length of the output array before.

My measurements show that the conversion to Long is about six times faster with this PR, almost halving the time to apply an agglomerate mapping on my test data.

cc @youri-k

URL of deployed dev instance (used for testing):

https://applymappingfunctionaltyping.webknossos.xyz

Steps to test:

View dataset with agglomerate mapping; activate it, should show mapped data
Compute ad-hoc mesh, should show meshes for mapped data
Histogram and find data should still work for different dtypes (e.g. uint8, uint16, float, uint24-rgb)

Updated (unreleased) changelog
Needs datastore update after deployment
Ready for review

jstriebel

Nice, mostly LGTM, just a comment about the optimizations would be nice. I tested the behavior on the deployed dev-instance, and skimmed through the code, not checking everything in detail. Do you think that's fine?

...ossos-datastore/app/com/scalableminds/webknossos/datastore/services/AgglomerateService.scala

…jects-created * 'master' of github.com:scalableminds/webknossos: (337 commits) Fix docs for the annotation download file format (#6546) Added total runtime information to VX reports (#6543) fix VX report for completed + skipped tasks (#6540) Avoid allocating spire uint objects during apply agglomerate (#6532) Explore remote N5 datasets (#6520) Fix MeshChunk byteOffset (Long, not Int) (#6536) update browserslist (#6505) Support new Mesh File (v3) (#6491) makes workflow_yamlContent optional (#6518) Always return 404 for Failures in Zarr Streaming (#6515) Poll wk version to notify during upgrade (#6451) add script which extracts newest changelog and creates GH release for it (#6504) release 22.10.0 (#6500) voxel³ -> voxel (#6501) Allow task type summary to identify task type when creating tasks in bulk (#6486) Fix sql evolution 090 (defer not null constraint) (#6498) SQL schema cleanup (#6492) Fix validation of layer selection when trying to start globalization of floodfills (#6497) Add "shift + w" shortcut to cycle backwards through tools (#6493) Fix filtering for public datasets in dataset table (#6496) ...

fm3 added 2 commits October 5, 2022 10:40

Avoid allocating spire uint objects during apply agglomerate

921c84c

use long buffer, fix find data

c712ef9

fm3 added backend performance labels Oct 5, 2022

fm3 self-assigned this Oct 5, 2022

fm3 added 2 commits October 5, 2022 13:23

remove debugging output

f8facd7

changelog

66c24ba

fm3 marked this pull request as ready for review October 5, 2022 11:35

fm3 requested a review from jstriebel October 5, 2022 11:35

pretty

f2ca632

jstriebel approved these changes Oct 5, 2022

View reviewed changes

...ossos-datastore/app/com/scalableminds/webknossos/datastore/services/AgglomerateService.scala Show resolved Hide resolved

fm3 and others added 2 commits October 6, 2022 15:49

add explaining comment for data conversion

938d524

Merge branch 'master' into apply-mapping-functional-typing

b0ebd6d

fm3 merged commit 9a4fa63 into master Oct 6, 2022

fm3 deleted the apply-mapping-functional-typing branch October 6, 2022 14:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid allocating spire uint objects during apply agglomerate #6532

Avoid allocating spire uint objects during apply agglomerate #6532

fm3 commented Oct 5, 2022 •

edited

Loading

jstriebel left a comment

Avoid allocating spire uint objects during apply agglomerate #6532

Avoid allocating spire uint objects during apply agglomerate #6532

Conversation

fm3 commented Oct 5, 2022 • edited Loading

URL of deployed dev instance (used for testing):

Steps to test:

jstriebel left a comment

Choose a reason for hiding this comment

fm3 commented Oct 5, 2022 •

edited

Loading