Allow to compose datasets without transforms or with bigwarp/NML landmarks #7395

philippotto · 2023-10-17T14:38:50Z

This PR adds the ability to compose a new dataset from existing dataset layers. There are three possibilities to do this:

combine datasets without any transforms
combine datasets with a BigWarp CSV that contains landmarks. Transforms are automatically created.
combine datasets with two WK NMLs that contain landmarks. Transforms are automatically created.

The user is guided through these options in a wizard. Transforms are only generated for a pair of datasets (i.e., it is not yet possible to compose from more than two datasets).

Slack discussion

URL of deployed dev instance (used for testing):

https://composedatasets.webknossos.xyz

Steps to test:

use the wizard to test the individual options
no transforms:
- simply pick one or more datasets
- edit the layers afterwards bit
- create the dataset and view it
for the other two modes:
- download this archive.
- import the datasets by moving the dataset folders into your binaryData folder
- then compose a new dataset each by providing the landmark files
- in the CSV case, you need to pass the dataset names manually (ROI2017_wkw_tiffs_1_wkw and ROI2017_wkw_tiffs_2_wkw)

Issues:

fixes Import of big warp transformations #7269

(Please delete unneeded items, merge only when none are left open)

Updated changelog
Updated documentation if applicable
Removed dev-only changes like prints and application.conf edits
Considered common edge cases
Needs datastore update after deployment

…sets

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/ComposeService.scala

frontend/javascripts/admin/dataset/dataset_add_compose_view.tsx

…-position-toast

…ry (due to empty NMLs)

…pose-datasets

…s into compose-datasets

philippotto · 2024-01-03T16:43:39Z

I cannot progress further than the dataset selection. I chose no transforms in the first step and then regardless of which datasets I select, I get the message that the datasets don't exist.

Hmm, this is weird. It works for me. Can you check the network tab and understand why the dataset can't be found? Happy to debug this together in a call 🤙

The dataset selection behaves a little un-intuitive: (1) When first clicking into the selection, no dataset names are shown. (2) After typing, results that match the typed string are shown. (3) After deleting the typed string, some but not all datasets are shown (I think all of the datasets that matched a typed string at some point). - (2) is good, but for (1) and (3) maybe a list of all available dataset names could be shown?

I made the component an async one to avoid that all datasets have to be fetched. Instead, the same search api is used as in the dashboard. To reduce the confusion I changed two things:

I adapted the placeholder to read: Type to search and select dataset(s)...
I added code that clears the suggestions after selecting an entry. This means that clicking into the field won't show old suggestions. Thus, the user shouldn't assume that these are all available datasets. Instead, they hopefully type again.

It's not perfect, but I can't think of a better way right now without fetching possibly thousands of datasets in this UI. I hope this is okay?

fm3

Backend LGTM :)

daniel-wer · 2024-01-09T18:04:06Z

@philippotto Thanks for addressing my feedback! The added format instructions will be very helpful to users I would imagine.

Hmm, this is weird. It works for me. Can you check the network tab and understand why the dataset can't be found? Happy to debug this together in a call 🤙

This turned out to be an sql issue of the dev deployment. I wrongly assumed I was the first to deploy the branch at the time, but it was deployed earlier and the schemaVersion became incompatible. After a re-deployment, this works well 👍

I added code that clears the suggestions after selecting an entry. This means that clicking into the field won't show old suggestions. Thus, the user shouldn't assume that these are all available datasets. Instead, they hopefully type again.
It's not perfect, but I can't think of a better way right now without fetching possibly thousands of datasets in this UI. I hope this is okay?

This mostly works well and addresses my issue, but a small nuisance now is that typing and then clicking anywhere but on a dataset entry will clear the search box. I'm not sure whether that's important, but if you see a quick fix, feel free.

One optional suggestion would be to use toggles to disable layers in the layer selection screen, because it would allow users to change their mind (going back one screen and next again, resets all values including the dataset name).

@fm3 Feel free to test as well and/or approve if you are happy with this PR :)

philippotto · 2024-01-10T14:25:58Z

This mostly works well and addresses my issue, but a small nuisance now is that typing and then clicking anywhere but on a dataset entry will clear the search box. I'm not sure whether that's important, but if you see a quick fix, feel free.

Hmm, this seems to be the default behavior in the select component from antd (see doc examples). I didn't find a way to avoid this (I tried autoClearSearchValue and onClear). So, there's not much we can do for now, I'm afraid.

One optional suggestion would be to use toggles to disable layers in the layer selection screen, because it would allow users to change their mind (going back one screen and next again, resets all values including the dataset name).

Good idea! However, I'd like to postpone this to another iteration :) We will certainly get more feedback when this feature is available in production and then we can priotize.

…ng single-node trees

fm3

Works for me :)

I think the first page of the wizard is a bit cumbersome. It contains three explanations, then a radio group with three buttons that mirror those, and then a next button. Instead it could be three clickable cards, where each starts one of the workflows, and which contain the explanations. To my mind that would be a way clearer UI.

However, this does not need to block this PR

fm3 · 2024-01-16T09:23:45Z

@frcroth One more question: I see that the new case class DataLayerId used inside of ComposeRequestLayer has name and owningOrganization. Isn’t this the same as DataSourceId? Should that just be used instead? I think this thing shouldn’t be called DataLayerId as it does not actually identify a layer, but rather a dataset. Maybe another way could be found here

philippotto · 2024-01-16T12:37:44Z

I think the first page of the wizard is a bit cumbersome. It contains three explanations, then a radio group with three buttons that mirror those, and then a next button. Instead it could be three clickable cards, where each starts one of the workflows, and which contain the explanations. To my mind that would be a way clearer UI.

Good point! We don't have off-the-shelf cards with a nice design yet. However, I decided to slim down the UI by directly inlining the radio buttons where the list was before. The description was somewhat redundant anyway (the next pages of the wizard explain what's needed better). I hope this is okay that way.

fm3 · 2024-01-16T12:41:33Z

Definitely better :)

philippotto · 2024-01-16T12:42:24Z

@frcroth One more question: I see that the new case class DataLayerId used inside of ComposeRequestLayer has name and owningOrganization. Isn’t this the same as DataSourceId? Should that just be used instead? I think this thing shouldn’t be called DataLayerId as it does not actually identify a layer, but rather a dataset. Maybe another way could be found here

Should I wait with merging this PR?

fm3 · 2024-01-16T12:44:11Z

Should I wait with merging this PR?

Can be a follow-up. I guess we can change the API later without issue, since it is not used except by our own frontend code

frcroth · 2024-01-17T08:26:07Z

@frcroth One more question: I see that the new case class DataLayerId used inside of ComposeRequestLayer has name and owningOrganization. Isn’t this the same as DataSourceId? Should that just be used instead? I think this thing shouldn’t be called DataLayerId as it does not actually identify a layer, but rather a dataset. Maybe another way could be found here

Discussed here: https://scm.slack.com/archives/C5AKLAV0B/p1700831539308049?thread_ts=1697184159.482769&cid=C5AKLAV0B
Yes, can be a followup

fm3 · 2024-01-17T08:34:32Z

you’re right, I now wrote #7560

implement compose-dataset-view which accepts NMLs from different data…

83fdff0

…sets

philippotto self-assigned this Oct 17, 2023

wording

739ce7e

philippotto assigned frcroth Nov 14, 2023

fm3 mentioned this pull request Nov 15, 2023

Support for high/low res datasets #4026

Closed

philippotto mentioned this pull request Nov 20, 2023

Upload layers to existing datasets #7444

Open

2 tasks

frcroth added 4 commits November 20, 2023 16:50

Add backend for composing datasets

9722b8c

Extract common method

2ce1031

Merge branch 'master' into compose-datasets

24bc264

Use scale and check if directory is writeable

0fa38a4

frcroth reviewed Nov 20, 2023

View reviewed changes

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/ComposeService.scala Outdated Show resolved Hide resolved

frcroth and others added 5 commits November 20, 2023 17:46

Validate user access to all included datasets

e7b7972

Merge branch 'master' into compose-datasets

92224e1

integrate new compose route

f131d7a

temporarily disable most ci checks

75471b2

improve loading state

47e4508

philippotto commented Nov 24, 2023

View reviewed changes

frontend/javascripts/admin/dataset/dataset_add_compose_view.tsx Outdated Show resolved Hide resolved

frcroth and others added 13 commits November 27, 2023 09:31

Do not use datasource id for compose API

49cd2a6

Refresh inbox after composing dataset

bc2d278

Rename id to datasetId

034c1b4

remove sleep and id workaround; also improve formatting of jumping-to…

a47e04a

…-position-toast

better error handling and don't crash if no transformation is necessa…

92c3d6b

…ry (due to empty NMLs)

Merge branch 'master' of github.com:scalableminds/webknossos into com…

195908f

…pose-datasets

implement wizard for dataset composition

94fff83

clean up a bit

3cbc54e

refactor dataset selection component into own module

911ebff

remove unused onNext/onPrev code

19b222a

refactor into separate modules

5721867

iterate on styling etc

43f92e8

tweak intro paragraphs in wizard

1721d64

philippotto added 2 commits January 3, 2024 17:40

explain how trees are matched

cc5550c

Merge branch 'compose-datasets' of github.com:scalableminds/webknosso…

086afed

…s into compose-datasets

frcroth added 3 commits January 8, 2024 09:51

Make symlink trait a service

80eced8

Remove check inbox

3766288

Reorganize uploading services

2f5f948

frcroth requested a review from fm3 January 8, 2024 10:27

fm3 reviewed Jan 8, 2024

View reviewed changes

fm3 and others added 2 commits January 9, 2024 19:34

format

9b6c80f

Merge branch 'master' into compose-datasets

adb4a82

philippotto added 3 commits January 10, 2024 15:27

change default tab back to UPLOAD

e142d5f

allow trees with multiple nodes during composition instead of expecti…

72733c0

…ng single-node trees

remove unused import

8474006

fm3 approved these changes Jan 11, 2024

View reviewed changes

Merge branch 'master' into compose-datasets

13b2c9b

inline radio buttons in first step of compose-dataset-wizard

655b4e9

philippotto enabled auto-merge (squash) January 16, 2024 12:45

Merge branch 'master' into compose-datasets

cc2ba26

philippotto merged commit 08a8e7f into master Jan 16, 2024
2 checks passed

philippotto deleted the compose-datasets branch January 16, 2024 13:10

fm3 mentioned this pull request Jan 17, 2024

Clean up DataSourceId, DataLayerId #7560

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow to compose datasets without transforms or with bigwarp/NML landmarks #7395

Allow to compose datasets without transforms or with bigwarp/NML landmarks #7395

philippotto commented Oct 17, 2023 •

edited by daniel-wer

Loading

philippotto commented Jan 3, 2024

fm3 left a comment

daniel-wer commented Jan 9, 2024

philippotto commented Jan 10, 2024

fm3 left a comment

fm3 commented Jan 16, 2024

philippotto commented Jan 16, 2024

fm3 commented Jan 16, 2024

philippotto commented Jan 16, 2024

fm3 commented Jan 16, 2024

frcroth commented Jan 17, 2024

fm3 commented Jan 17, 2024

Allow to compose datasets without transforms or with bigwarp/NML landmarks #7395

Allow to compose datasets without transforms or with bigwarp/NML landmarks #7395

Conversation

philippotto commented Oct 17, 2023 • edited by daniel-wer Loading

URL of deployed dev instance (used for testing):

Steps to test:

Issues:

philippotto commented Jan 3, 2024

fm3 left a comment

Choose a reason for hiding this comment

daniel-wer commented Jan 9, 2024

philippotto commented Jan 10, 2024

fm3 left a comment

Choose a reason for hiding this comment

fm3 commented Jan 16, 2024

philippotto commented Jan 16, 2024

fm3 commented Jan 16, 2024

philippotto commented Jan 16, 2024

fm3 commented Jan 16, 2024

frcroth commented Jan 17, 2024

fm3 commented Jan 17, 2024

philippotto commented Oct 17, 2023 •

edited by daniel-wer

Loading