Remove experimental data selector #2290

stephanwlee · 2019-05-30T01:24:12Z

DataSelector was an experimental feature, when using Database backed
backend, allowed improved user experience in choosing data with built-in
notion of experiments.

The team has no plans to make tangible improvements to it and want to
re-consider the data selection story more holistically.

I, personally, am ok with maintaining this code iff it does not incur
maintenance burden but, especially, since we lack tests, that is not the
case today. If we want to revive it, we can always revert the change.

WANT_LGTM=all

wchargin

Oh, wow, okay.

The code change looks good (modulo inline), and the intent is fine with
me, but I feel like we should run this by @nfelt, too—?

tensorboard/components/tf_tensorboard/tf-tensorboard.html

nfelt · 2019-05-31T20:00:58Z

I'm ok with this if you think it's best to avoid the maintenance overhead. I do think it's valuable to be able to have this concrete functionality on hand as a working reference/demo when revisiting this area in the near future (whether or not we revive the literal code itself), but it's true that we can check out and build older revisions even if it's removed at head.

Basically it's up to you.

stephanwlee · 2019-05-31T21:47:13Z

it's true that we can check out and build older revisions even if it's removed at head.

@nfelt yup, TB 1.13.x and probably older version should contain this. Btw, this PR is "WANT_LGTM=all" :)

DataSelector was an experimental feature, when using Database backed backend, allowed improved user experience in choosing data with built-in notion of experiments. The team has no plans to make tangible improvements to it and want to re-consider the data selection story more hollistically.

@wchargin

Summary: This implements a rudimentary experiment selection mechanism for the runs selector and scalars dashboard only, for the purposes of testing data provider implementations. The experiment ID is stored as a query parameter, which isn’t perfect because it precludes caching the big TensorBoard HTML blob across experiments. In the long term, we can consider refactoring to serve the big HTML blob from a static path that’s fetched by the page. The TensorBoard codebase still has some fragments of a previous experimental data selector, which was partially removed in #2290, and so parts of this diff look like changes but are functionally additions. Test Plan: Instrument the multiplexer data provider to log experiment IDs: ```diff diff --git a/tensorboard/backend/event_processing/data_provider.py b/tensorboard/backend/event_processing/data_provider.py index ef602320..4c72d096 100644 --- a/tensorboard/backend/event_processing/data_provider.py +++ b/tensorboard/backend/event_processing/data_provider.py @@ -54,7 +54,7 @@ class MultiplexerDataProvider(provider.DataProvider): return None def list_runs(self, experiment_id): - del experiment_id # ignored for now + logger.warn("Listing runs for experiment %r", experiment_id) return [ provider.Run( run_id=run, # use names as IDs @@ -65,7 +65,7 @@ class MultiplexerDataProvider(provider.DataProvider): ] def list_scalars(self, experiment_id, plugin_name, run_tag_filter=None): - del experiment_id # ignored for now + logger.warn("Listing scalars for experiment %r", experiment_id) run_tag_content = self._multiplexer.PluginRunToTagToContent(plugin_name) result = {} if run_tag_filter is None: @@ -96,6 +96,7 @@ class MultiplexerDataProvider(provider.DataProvider): def read_scalars( self, experiment_id, plugin_name, downsample=None, run_tag_filter=None ): + logger.warn("Reading scalars for experiment %r", experiment_id) # TODO(@wchargin): Downsampling not implemented, as the multiplexer # is already downsampled. We could downsample on top of the existing # sampling, which would be nice for testing. ``` Then launch TensorBoard with `--generic_data=true` and navigate to <http://localhost:6006/?experiment=foo>; verify that all three varieties of server logs are exclusively for the correct experiment ID. Then, remove the query parameter, and ensure that TensorBoard still works normally (with experiment ID the empty string). wchargin-branch: data-experiment

@wchargin

Summary: This implements a rudimentary experiment selection mechanism for the runs selector and scalars dashboard only, for the purposes of testing data provider implementations. The experiment ID is stored as a query parameter, which isn’t perfect because it precludes caching the big TensorBoard HTML blob across experiments. In the long term, we can consider refactoring to serve the big HTML blob from a static path that’s fetched by the page. The TensorBoard codebase still has some fragments of a previous experimental data selector, which was partially removed in #2290, and so parts of this diff look like changes but are functionally additions. Test Plan: Instrument the multiplexer data provider to log experiment IDs: ```diff diff --git a/tensorboard/backend/event_processing/data_provider.py b/tensorboard/backend/event_processing/data_provider.py index ef602320..4c72d096 100644 --- a/tensorboard/backend/event_processing/data_provider.py +++ b/tensorboard/backend/event_processing/data_provider.py @@ -54,7 +54,7 @@ class MultiplexerDataProvider(provider.DataProvider): return None def list_runs(self, experiment_id): - del experiment_id # ignored for now + logger.warn("Listing runs for experiment %r", experiment_id) return [ provider.Run( run_id=run, # use names as IDs @@ -65,7 +65,7 @@ class MultiplexerDataProvider(provider.DataProvider): ] def list_scalars(self, experiment_id, plugin_name, run_tag_filter=None): - del experiment_id # ignored for now + logger.warn("Listing scalars for experiment %r", experiment_id) run_tag_content = self._multiplexer.PluginRunToTagToContent(plugin_name) result = {} if run_tag_filter is None: @@ -96,6 +96,7 @@ class MultiplexerDataProvider(provider.DataProvider): def read_scalars( self, experiment_id, plugin_name, downsample=None, run_tag_filter=None ): + logger.warn("Reading scalars for experiment %r", experiment_id) # TODO(@wchargin): Downsampling not implemented, as the multiplexer # is already downsampled. We could downsample on top of the existing # sampling, which would be nice for testing. ``` Then launch TensorBoard with `--generic_data=true` and navigate to <http://localhost:6006/?experiment=foo>; verify that all three varieties of server logs are exclusively for the correct experiment ID. Then, remove the query parameter, and ensure that TensorBoard still works normally (with experiment ID the empty string). wchargin-branch: data-experiment

wchargin reviewed May 30, 2019

View reviewed changes

tensorboard/components/tf_tensorboard/tf-tensorboard.html Show resolved Hide resolved

stephanwlee requested a review from nfelt May 30, 2019 03:34

wchargin approved these changes May 31, 2019

View reviewed changes

stephanwlee force-pushed the gc branch 2 times, most recently from de15b88 to 805e934 Compare June 7, 2019 15:07

stephanwlee requested review from manivaradarajan and removed request for nfelt June 7, 2019 15:12

stephanwlee added 4 commits June 10, 2019 09:06

Missed few spots

7c43048

Wow, I wrote the tests

7123024

Minor style mistake

aed9140

stephanwlee force-pushed the gc branch from f252192 to aed9140 Compare June 10, 2019 16:07

manivaradarajan approved these changes Jun 11, 2019

View reviewed changes

stephanwlee merged commit b764306 into tensorflow:master Jun 11, 2019

stephanwlee deleted the gc branch June 11, 2019 04:03

wchargin mentioned this pull request Aug 19, 2019

data: plumb experiment ID through runs and scalars #2580

Merged

wchargin mentioned this pull request Jul 14, 2020

Scalars: Multiplex fetch (one tag, many runs). #3835

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove experimental data selector #2290

Remove experimental data selector #2290

stephanwlee commented May 30, 2019 •

edited

Loading

wchargin left a comment

nfelt commented May 31, 2019

stephanwlee commented May 31, 2019 •

edited

Loading

Remove experimental data selector #2290

Remove experimental data selector #2290

Conversation

stephanwlee commented May 30, 2019 • edited Loading

wchargin left a comment

Choose a reason for hiding this comment

nfelt commented May 31, 2019

stephanwlee commented May 31, 2019 • edited Loading

stephanwlee commented May 30, 2019 •

edited

Loading

stephanwlee commented May 31, 2019 •

edited

Loading