[AutoDiff] Canonicalize linear maps to be maximally indirect. #26709

dan-zheng · 2019-08-17T03:46:26Z

Canonicalize JVPs/VJPs to return maximally abstracted linear map functions
with @in_guaranteed parameters and an @out result.

This is a necessary step towards re-enabling LoadableByAddress:
linear map type is no longer computed based on the original function type.

Summary of changes:

SILGen
- Extra reabstraction logic for JVP/VJPs and differential/pullbacks to
  handle maximally indirect linear maps.
- Related forum discussion.
Differentiation transform
- Use only tangent/adjoint buffers in differential/pullback generation.

See TF-11 for more info regarding LoadableByAddress.
See TF-625 for more info regarding maximally abstracted linear maps.

Initial patch by @rxwei - commits lost because resolving conflicts got too complicated, sorry.

Canonicalize JVPs/VJPs to return maximally abstracted linear map functions with `@in_guaranteed` parameters and an `@out` result. This is a necessary step towards re-enabling LoadableByAddress: linear map type is no longer computed based on the original function type. See TF-11 for more info regarding LoadableByAddress. See TF-625 for more info regarding maximally abstracted linear maps.

- Change differential generation to use only tangent buffers. - Change pullback visitors to use only adjoint buffers. - Change adjoint of active value propagation to use only adjoint buffers. - Mark all tangent/adjoint value helpers as `[[deprecated]]`. - They are not deleted because helpers may become useful after SIL opaque values are introduced.

Will re-add in follow-up for separation of concerns.

dan-zheng · 2019-09-21T03:33:43Z

@swift-ci Please test tensorflow

dan-zheng · 2019-09-21T05:01:41Z

@swift-ci Please test tensorflow

dan-zheng · 2019-09-21T12:49:46Z

It would be good to test the performance impact of this patch. Using adjoint buffers instead of adjoint values may incur a performance penalty.

lib/SIL/SILFunctionType.cpp

lib/SILOptimizer/Mandatory/Differentiation.cpp

Remove unused `AdjointValue` utilities for materialization and addition. `AdjointValue` itself is preserved to implement symbolic zero buffer optimization.

lib/SILOptimizer/Mandatory/Differentiation.cpp

Address review feedback. Standardize variable naming.

`tuple_extract` is no longer testable after Differentiation has been moved before OwnershipModelEliminator. Old reproducers for `tuple_extract` now generate `destructure_tuple`. Deleting untestable code is prudent.

Avoid unnecessary local tangent struct struct allocation. Generate `struct_element_addr` into existing struct tangent buffer.

Todo: add test. Currently, test crashes. This commit will be overwritten and rebased when done.

There are no known tests for `tuple_extract` visitors. Re-adding `tuple_extract` visitors may be necessary when differentiation supports inout parameters. This Gist shows cases where SILGen produces `struct_extract`: https://gist.github.com/dan-zheng/1343673d2d4d20d403306283b42d522b

Some cleanup will be refactored to a separate patch.

dan-zheng · 2019-12-03T18:12:33Z

A less invasive approach for re-enabling LoadableByAddress was found: #27923

Maximal indirection will not be pursued further: the differentiation transform should not be forced to generate maximally indirect code to workaround issues caused by LoadableByAddress.

dan-zheng added the tensorflow This is for "tensorflow" branch PRs. label Aug 17, 2019

Merge remote-tracking branch 'apple/tensorflow' into ad-all-indirect

c4a4bb8

dan-zheng mentioned this pull request Sep 19, 2019

[AutoDiff] Forward-mode support for variables, generics, tuples, structs. #26743

Merged

dan-zheng added 2 commits September 19, 2019 04:36

Merge remote-tracking branch 'apple/tensorflow' into ad-all-indirect

66b63a5

Merge branch 'tensorflow' of github.com:apple/swift into ad-all-indirect

91bc4a8

dan-zheng force-pushed the ad-all-indirect branch 3 times, most recently from a690449 to bf6b528 Compare September 21, 2019 03:03

dan-zheng force-pushed the ad-all-indirect branch from bf6b528 to 328676e Compare September 21, 2019 03:05

dan-zheng requested a review from rxwei September 21, 2019 03:05

dan-zheng added 2 commits September 20, 2019 20:12

Add todo/explanatory comments.

de0392b

Undo changes to LoadableByAddress.

7f13456

Will re-add in follow-up for separation of concerns.

dan-zheng force-pushed the ad-all-indirect branch 2 times, most recently from 9dbb1d6 to 361ae93 Compare September 21, 2019 03:32

Clean up.

e38afb9

dan-zheng force-pushed the ad-all-indirect branch from 361ae93 to e38afb9 Compare September 21, 2019 04:32

rxwei reviewed Sep 21, 2019

View reviewed changes

lib/SIL/SILFunctionType.cpp Show resolved Hide resolved

lib/SIL/SILFunctionType.cpp Outdated Show resolved Hide resolved

lib/SILOptimizer/Mandatory/Differentiation.cpp Outdated Show resolved Hide resolved

dan-zheng added 2 commits September 21, 2019 12:07

Remove unused AdjointValue utilities.

82c775f

Remove unused `AdjointValue` utilities for materialization and addition. `AdjointValue` itself is preserved to implement symbolic zero buffer optimization.

Add doc comments for autodiff function type normalization.

1093416

rxwei reviewed Sep 21, 2019

View reviewed changes

lib/SILOptimizer/Mandatory/Differentiation.cpp Outdated Show resolved Hide resolved

lib/SILOptimizer/Mandatory/Differentiation.cpp Outdated Show resolved Hide resolved

lib/SILOptimizer/Mandatory/Differentiation.cpp Outdated Show resolved Hide resolved

dan-zheng added 3 commits September 21, 2019 18:18

Minor cleanup.

3e2a35f

Address review feedback. Standardize variable naming.

Delete tuple_extract visitors for differential/pullback generation.

608725f

`tuple_extract` is no longer testable after Differentiation has been moved before OwnershipModelEliminator. Old reproducers for `tuple_extract` now generate `destructure_tuple`. Deleting untestable code is prudent.

Optimize PullbackEmitter::visitStructExtractInst.

a6988c8

Avoid unnecessary local tangent struct struct allocation. Generate `struct_element_addr` into existing struct tangent buffer.

dan-zheng force-pushed the ad-all-indirect branch from cce599d to a6988c8 Compare September 22, 2019 01:22

dan-zheng mentioned this pull request Sep 23, 2019

[AutoDiff] Canonicalize JVP/VJP types to re-enable LoadableByAddress. #27298

Closed

dan-zheng and others added 3 commits September 23, 2019 12:44

[WIP] Re-add differential/pullback support for tuple_extract.

a3b6798

Todo: add test. Currently, test crashes. This commit will be overwritten and rebased when done.

Fix build error by adding a missing }.

deaa2b1

Improve code style around macro definitions.

68e0561

dan-zheng mentioned this pull request Sep 25, 2019

[AutoDiff] Unify forward- and reverse-mode activity analysis. #27358

Merged

dan-zheng and others added 4 commits September 25, 2019 18:25

Merge branch 'tensorflow' of github.com:apple/swift into ad-all-indirect

2c1f19d

Fix all tests, clean up.

8a08917

Some cleanup will be refactored to a separate patch.

Merge branch 'tensorflow' of github.com:apple/swift into ad-all-indirect

ea6f87a

dan-zheng mentioned this pull request Oct 14, 2019

[AutoDiff] [IRGen] Lower @differentiable(linear) function types. #27661

Merged

dan-zheng force-pushed the tensorflow branch from 6dcf239 to 04dca63 Compare November 17, 2019 02:40

dan-zheng closed this Dec 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AutoDiff] Canonicalize linear maps to be maximally indirect. #26709

[AutoDiff] Canonicalize linear maps to be maximally indirect. #26709

Uh oh!

dan-zheng commented Aug 17, 2019 •

edited

Loading

Uh oh!

dan-zheng commented Sep 21, 2019

Uh oh!

dan-zheng commented Sep 21, 2019

Uh oh!

dan-zheng commented Sep 21, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dan-zheng commented Dec 3, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[AutoDiff] Canonicalize linear maps to be maximally indirect. #26709

[AutoDiff] Canonicalize linear maps to be maximally indirect. #26709

Uh oh!

Conversation

dan-zheng commented Aug 17, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dan-zheng commented Sep 21, 2019

Uh oh!

dan-zheng commented Sep 21, 2019

Uh oh!

dan-zheng commented Sep 21, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dan-zheng commented Dec 3, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dan-zheng commented Aug 17, 2019 •

edited

Loading