feat: Privacy Preserving Learning #3334

manavsinghal157 · 2021-09-20T17:09:53Z

Part of the Empirical Analysis of Privacy Preserving Learning Project.

This PR introduces a command line argument that implements aggregated learning by saving only those features that have seen a minimum threshold of users thus upholding the privacy of the user.

Methodology:

For each feature, a 32-bit vector is defined. (vowpalwabbit/array_parameters.h and vowpalwabbit/array_parameters_dense.h)
We calculate a 5-bit hash of the tag of the example. (vowpalwabbit/parser.cc)
For each feature weight updated by a non-zero value, we use the 5-bit hash to look up a bit in the 32-bit vector and set it to 1.(vowpalwabbit/gd_predict.h -> (vowpalwabbit/array_parameters.h and vowpalwabbit/array_parameters_dense.h))
When saving the weights into a file, we calculate the number of bits set to 1 for a feature. If it is greater than the threshold, the weights for that feature are saved. (vowpalwabbit/gd.cc->(vowpalwabbit/array_parameters.h and vowpalwabbit/array_parameters_dense.h))

(The default value of the threshold is 10)

This PR includes:

Command line argument to activate privacy preservation and set the threshold. (vowpalwabbit/parse_args.cc)
Runtests to test the desired output on a small dataset. (test/core.vwtest.json)
Unit-tests for checking output when threshold is reached for a feature and when it is not. (test/unit_test/weights_test.cc)
Benchmarks to test time taken for learning in privacy preserving method. (test/benchmarks/standalone/benchmark_text_input.cc )

Implementation details:

--privacy_activation : To activate the feature
--privacy_activation_threshold arg (=10) : To set the threshold

Future Work:

Implement the feature for save_resume.
Work on aggregations in the online setting.

Wiki page for the same : https://github.com/VowpalWabbit/vowpal_wabbit/wiki/Privacy-Preserving-Learning

…tency and removed is_activated in gd.cc to pass checks

…g-Learning Patch for Privacy Preserving Learning

…g-Learning Patch_for_privacy_preserving_learning #2

…g-Learning Command line argument for privacy preserving learning

…g-Learning Calculating tag_hash in parser.cc && RunTests

…g-Learning Fetching Upstream

…g-Learning Benchmarks for Privacy Preserving Learning

Removed extra } line 802

vowpalwabbit/parser.cc

manavsinghal157 added 30 commits June 21, 2021 23:31

Patch for Privacy Preserving Learning

7a99406

Changes in response to PR comments

1c14739

Made variables private in parameters and defined unset_tag in gd.h

f53b820

Added unit test for feature activation, cleaned parameters for consis…

477ea18

…tency and removed is_activated in gd.cc to pass checks

Added offset and is_activated()

3523a98

Added _weight_mask

161dcd5

Merge pull request #1 from manavsinghal157/RLOS-21--Privacy-Preservin…

882b05a

…g-Learning Patch for Privacy Preserving Learning

Interactions included

7b9dd99

Change to original

6842ab5

Reverting to original

3e916be

Added set_tag support for ftrl.cc

09000c8

Merge pull request #2 from manavsinghal157/RLOS-21--Privacy-Preservin…

0738cc2

…g-Learning Patch_for_privacy_preserving_learning #2

Added command line argument for privacy activation

6ed31d0

Merge pull request #3 from manavsinghal157/RLOS-21--Privacy-Preservin…

536668e

…g-Learning Command line argument for privacy preserving learning

Calculating tag_hash in parser.cc

7b1571a

Added support in example.cc and made the bitset_size a variable

2ebce73

Cleaning

1b10a67

RunTests 2 tests addition

39322c9

Resolving reviewer comments

93b7d7b

Merge pull request #4 from manavsinghal157/RLOS-21--Privacy-Preservin…

723a97b

…g-Learning Calculating tag_hash in parser.cc && RunTests

Benchmark for Privacy_Activation

60e0201

Fetching upstream

de58630

Undoing Benchmarks

0496c38

Merge pull request #5 from manavsinghal157/RLOS-21--Privacy-Preservin…

53c5d50

…g-Learning Fetching Upstream

Merge branch 'VowpalWabbit:master' into master

0aa16a8

Benchmarks for Privacy Preserving Learning

27ae138

Merge pull request #6 from manavsinghal157/RLOS-21--Privacy-Preservin…

f615a9b

…g-Learning Benchmarks for Privacy Preserving Learning

Merge branch 'VowpalWabbit:master' into master

c21a572

Merge branch 'VowpalWabbit:master' into master

072971f

Update gd.cc

e81446c

Removed extra } line 802

manavsinghal157 and others added 7 commits September 20, 2021 23:52

Merge branch 'master' into RLOS_Privacy_Bracket_Operator

a88f0bc

Update corrupt_weights_gd_mf.stderr

6d79a7e

Merge branch 'master' into RLOS_Privacy_Bracket_Operator

3d06497

merge from master

505bb2a

fix tests

820d4d9

formatting

1876bba

reserve correctly

d03bda0

olgavrou closed this Nov 23, 2021

olgavrou reopened this Nov 29, 2021

olgavrou added 7 commits November 29, 2021 11:36

Merge branch 'master' into RLOS_Privacy_Bracket_Operator

854ba74

shared ptr for activation bitset

4045e2f

add compile time flag

d599109

cleanup

624a4c1

add ci for privacy activation

9e8868b

permissions

e9e0fbe

Merge branch 'master' into RLOS_Privacy_Bracket_Operator

30816b5

jackgerrits reviewed Nov 29, 2021

View reviewed changes

vowpalwabbit/parser.cc Show resolved Hide resolved

jackgerrits reviewed Nov 29, 2021

View reviewed changes

vowpalwabbit/parser.cc Show resolved Hide resolved

olgavrou added 3 commits November 29, 2021 13:39

add more ifdefs

e553241

skip spanning tree tests

a79fc6f

remove comment

e9553f1

olgavrou mentioned this pull request Nov 29, 2021

feat: Privacy Preserving Learning #3485

Closed

olgavrou changed the title ~~[wip] please ignore, running benchmarks~~ feat: Privacy Preserving Learning Nov 29, 2021

olgavrou marked this pull request as ready for review November 29, 2021 21:58

olgavrou added this to the VW 9.0 milestone Nov 29, 2021

olgavrou added 3 commits November 29, 2021 14:16

add input/output label to stderr

5aba3f2

missing ifdef in benchmarks

47d14cd

Merge branch 'master' into RLOS_Privacy_Bracket_Operator

c15bfea

jackgerrits approved these changes Nov 30, 2021

View reviewed changes

olgavrou merged commit f0e16ad into VowpalWabbit:master Nov 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Privacy Preserving Learning #3334

feat: Privacy Preserving Learning #3334

manavsinghal157 commented Sep 20, 2021 •

edited by olgavrou

Loading

feat: Privacy Preserving Learning #3334

feat: Privacy Preserving Learning #3334

Conversation

manavsinghal157 commented Sep 20, 2021 • edited by olgavrou Loading

Methodology:

This PR includes:

Implementation details:

Future Work:

manavsinghal157 commented Sep 20, 2021 •

edited by olgavrou

Loading