feat: Model merging with delta objects #4177

byronxu99 · 2022-09-29T17:16:59Z

Add a new VW::model_delta object that internally keeps a VW::workspace
Define operators + and - such that deltas are created by subtracting two workspaces, and can be added to update a workspace
Add a merge_delta function that combines multiple deltas into a single delta
Refactor model merging to be implemented via delta merging

…lta conversion

…into model_delta_merging

olgavrou

It looks like model_delta is just a wrapper around a workspace that is used to communicate that this holds a delta

Wondering if the +/- operators should be operators of the workspace class leading to a more intuitive API for those operators, and it would also mean that we avoid wrapping workspaces into deltas in order to merge them

If we need to indicate that a workspace is a diff we could add a bool field indicating that it is a diff, or maybe model_delta could be derived from workpace

Thoughts?

byronxu99 · 2022-10-03T16:35:58Z

Yes, as it is right now model_delta simply is a wrapper around VW::workspace that makes types explicit and specifies ownership. I think this provides a greater level of safety than the alternative of having an is_delta flag inside of workspace.

The - operator already operates on the workspace class, but returns a model_delta. The + operator I've defined as an "update a workspace with new changes" operation, and thus requires a workspace and a model_delta.

I specifically do not want + to be a "merge" operation, that is, to work on two workspaces or two deltas. The reason is that it won't work for merging GD with save_resume, which must have access to the set of all workspaces being merged at once, instead of just two of them at a time. Otherwise, we could have implemented the merge function in terms of +, -, and a "multiply by scale factor" operation, which I would find more intuitive.

The case of "wrapping workspaces into deltas in order to merge them" isn't as bad as it seems as first. In federated learning, we would be directly receiving deltas to pass to the merge function - having to merge workspaces via deltas would be the less common use case. It also simplifies the merge logic inside of each and every reduction, because now there's no longer a requirement to check for a base workspace and subtract it if it exists.

byronxu99 added 9 commits September 21, 2022 16:10

Add boolean flag is_delta to merge functions

a194fc0

Initial attempt at model addition and subtraction

550ed50

Add model delta merge test

8cd5f1e

Remove is_delta flag + use class method instead of static_cast for de…

04ecec9

…lta conversion

Implement model merging in terms of delta merging

205785c

Add some tests

bd3bc47

Use size_t instead of uint

ec73bd7

Add weights check to merge test

f0a51fe

Merge branch 'master' of https://github.com/VowpalWabbit/vowpal_wabbit …

32f54d2

…into model_delta_merging

byronxu99 marked this pull request as ready for review September 30, 2022 15:05

byronxu99 added 2 commits September 30, 2022 11:09

Clang format

a29397d

Fix bug where workspace was passed by value instead of by reference

9dff702

olgavrou reviewed Oct 3, 2022

View reviewed changes

jackgerrits approved these changes Oct 3, 2022

View reviewed changes

byronxu99 added 3 commits October 3, 2022 15:38

Merge branch 'master' into model_delta_merging

111a186

Merge branch 'master' into model_delta_merging

b7de81c

Merge branch 'master' into model_delta_merging

53b6d25

byronxu99 merged commit c2d0f6c into VowpalWabbit:master Oct 6, 2022

byronxu99 deleted the model_delta_merging branch October 6, 2022 15:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Model merging with delta objects #4177

feat: Model merging with delta objects #4177

byronxu99 commented Sep 29, 2022 •

edited

Loading

olgavrou left a comment

byronxu99 commented Oct 3, 2022

feat: Model merging with delta objects #4177

feat: Model merging with delta objects #4177

Conversation

byronxu99 commented Sep 29, 2022 • edited Loading

olgavrou left a comment

Choose a reason for hiding this comment

byronxu99 commented Oct 3, 2022

byronxu99 commented Sep 29, 2022 •

edited

Loading