Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

data store: wipe workflow data on reload. #1479

Merged
merged 3 commits into from
Oct 19, 2023

Conversation

oliver-sanders
Copy link
Member

@oliver-sanders oliver-sanders commented Sep 15, 2023

The UI side of cylc/cylc-flow#5715

In combination with cylc/cylc-flow#5715, this closes #1480

  • When a workflow is reloaded or restarted (node restart infers reload), there may be objects left behind in the store which have been wiped out in the scheduler by the configuration change.
  • To handle this, we must wipe all objects stored on the workflow (e.g. TaskProxies, TaskDefs, Jobs, etc), and rebuild from scratch.
  • Note, we don't need to wipe the data on the Workflow object itself (e.g. port, stateTotals, etc), these things will be refreshed by this or subsequent deltas.

Check List

  • I have read CONTRIBUTING.md and added my name as a Code Contributor.
  • Contains logically grouped changes (else tidy your branch by rebase).
  • Does not contain off-topic changes (use other PRs for other changes).
  • Applied any dependency changes to both setup.cfg (and conda-environment.yml if present).
  • Tests are included (or explain why tests are not needed).
  • CHANGES.md entry included if this is a change that can affect users
  • Cylc-Doc pull request opened if required at cylc/cylc-doc/pull/XXXX.
  • If this is a bug fix, PR should be raised against the relevant ?.?.x branch.

@oliver-sanders oliver-sanders added the bug Something isn't working label Sep 15, 2023
@oliver-sanders oliver-sanders added this to the 2.2.0 milestone Sep 15, 2023
@oliver-sanders oliver-sanders self-assigned this Sep 15, 2023
Copy link
Member

@MetRonnie MetRonnie left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Something is going wrong here when you open the graph view at the same time as the tree view. Both stop updating.

@oliver-sanders oliver-sanders marked this pull request as draft October 10, 2023 10:01
Comment on lines +41 to +44
store.commit('workflows/REMOVE_CHILDREN', (deltas.updated.workflow.id))
}
if (deltas.added?.workflow?.reloaded) {
store.commit('workflows/REMOVE_CHILDREN', (deltas.added.workflow.id))
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To test, try jamming in console.log statements or whatever here to ensure that the REMOVE_CHILDREN store method is being called.

@@ -25,3 +25,61 @@ E.G. the Tree *view* displays Cylc cycles/tasks/jobs in a collapsible hierarchy.
It uses the Tree *component* which provides the generic tree logic e.g.
indentation, expand/collapse, etc. This Tree *component* is also used by
the Workflows *view*.


## Subscriptions
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added some details here as the rules for subscription writing are getting a little complex.

See also #862 for context, and a possible way to simplify things for view writers.

@oliver-sanders
Copy link
Member Author

Started this again from scratch:

  • I've moved the subscriptions out of the common module into the views to which they pertain.
  • The idea of the subscription merging interface is that each view only requests the fields it needs so code reuse is not desirable here.
  • I've been through the subscriptions and pulled out a number of fields which weren't required by the views which were requesting them.
  • I've also added a couple of things in which the views weren't requesting but should have been.
  • The tree and table now have different subscriptions containing only the things they need rather than the superset of the two.
  • I've added the reloaded field for each of the workflow query views (i.e. the views which get their data from the "workflow" subscription). The other subscriptions don't need this field:
    • Application subscription is used for Workflows, Dashboard & WorkflowsTable, since these only request information about the workflow, not about tasks, etc, they don't need to be wiped on reload.
    • Log subscriptions are reload agnostic.
    • The analysis view uses queries not subscriptions so doesn't require this logic.

The tree and graph seem to be playing together nicely.

@oliver-sanders oliver-sanders marked this pull request as ready for review October 11, 2023 08:29
Copy link
Member

@MetRonnie MetRonnie left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems to work well. Spotted a couple of minor issues after a reload which may or may not be specific to this branch

  1. Succeeded tasks without jobs (I have a feeling we came across this before)

    image

  2. The Graph View arrow placement went a bit errant for a moment and I got this error and couldn't pan or zoom anymore (though the graph kept updating afterwards)

    image

@oliver-sanders
Copy link
Member Author

Succeeded tasks without jobs (I have a feeling we came across this before)

I haven't seen that one in testing, though I have seen cylc/cylc-flow#5215

If you're able to reproduce, could you check whether the job info was sent through in the deltas?

@oliver-sanders
Copy link
Member Author

oliver-sanders commented Oct 16, 2023

I can't reproduce either reported issue?

Though the graph issue you showed wouldn't be entirely surprising, as I don't imagine svgPanZoom would appreciate a document with no dimensions.

I have now seen the absence of graph arrows following a firstParent null error though which is useful.

@oliver-sanders
Copy link
Member Author

Turns out that repeatably reloading the workflow with this branch is a really good way of triggering the firstParent bug.

Tried testing with #1513 merged in and everything is looking good after 25 reloads:

Screenshot from 2023-10-16 16-58-34

diff --git a/src/services/treeCallback.js b/src/services/treeCallback.js
index 570c3ac4..1ab9308c 100644
--- a/src/services/treeCallback.js
+++ b/src/services/treeCallback.js
@@ -38,9 +38,11 @@ class CylcTreeCallback extends DeltasCallback {
     // do this, we can end up with nodes in the store which aren't meant to be
     // there and won't get pruned.
     if (deltas.updated?.workflow?.reloaded) {
+      console.warn('REMOVE_CHILDREN')
       store.commit('workflows/REMOVE_CHILDREN', (deltas.updated.workflow.id))
     }
     if (deltas.added?.workflow?.reloaded) {
+      console.warn('REMOVE_CHILDREN')
       store.commit('workflows/REMOVE_CHILDREN', (deltas.added.workflow.id))
     }
   }

* Each view must register a subscription requesting the
  data the view requires.
* Currently some of these queries are colocated and have not
  been kept in sync with the requirements of the views.
* This commit moves the queries into the views they are used by.
* Remove fields from the query which are not used by the view.
* Fill in missing field requests.
* When a workflow is reloaded or restarted (node restart infers reload),
  there may be objects left behind in the store which have been wiped
  out in the scheduler by the configuration change.
* To handle this, we must wipe all objects stored on the workflow
  (e.g. TaskProxies, TaskDefs, Jobs, etc), and rebuild from scratch.
* Note, we don't need to wipe the data on the Workflow object itself
  (e.g. port, stateTotals, etc), these things will be refreshed by
  this or subsequent deltas.
@oliver-sanders
Copy link
Member Author

@MetRonnie

  • Applied suggestions
  • Removed delta fragments from all queries

Copy link
Member

@MetRonnie MetRonnie left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've tested out again, and the only problem I can see is the jobs disappear on reload (includes running and succeeded jobs) (seems to be at least latestJob goes missing?). This does not happen on master / #1513 presumably because the data was not wiped.

Update: turns out this job wiping was only happening in simulation mode, which is forgiveable

Copy link
Member

@hjoliver hjoliver left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, tested as working 👍

@hjoliver hjoliver merged commit ccba831 into cylc:master Oct 19, 2023
@oliver-sanders oliver-sanders deleted the reloaded-deltas branch October 23, 2023 08:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

data store: tasks left behind in store on reload/restart
3 participants