Support arbitrary `X` in `data` #100

wiseodd · 2024-04-02T18:18:47Z

Closes #86

@f-dangel @runame please give feedback and answer the question below.

Assumption:

data: Union[Iterable[Tuple[Tensor, Tensor]], Iterable[Tuple[UserDict, Tensor]], Iterable[Tuple[dict, Tensor]]]

and there is an additional parameter in _base._LinearOperator:

batch_size_fn: Optional[Callable[[Any], int]] = None

where it must be non-None whenever X is not a torch.Tensor.

This also fits well with Huggingface, although one must replace HF's default dataloader to outputs (data, data['labels']) instead of just data. Let me know if this should be considered.

Code for testing the functionality:

https://gist.github.com/wiseodd/426061afae24199446e60bfabc00e26e
I use laplace-torch there (two birds one stone), so just remove it if you don't want to install it. If you want to test laplace-torch, install via

pip install git+https://github.com/aleximmer/Laplace.git@mc-subset2

wiseodd · 2024-04-02T21:41:58Z

Just to link this to aleximmer/Laplace#144

coveralls · 2024-04-05T19:25:53Z

Pull Request Test Coverage Report for Build 8791845160

Warning: This coverage report may be inaccurate.

This pull request's base commit is no longer the HEAD commit of its target branch. This means it includes changes from outside the original pull request, including, potentially, unrelated coverage changes.

For more information on this, see Tracking coverage changes with pull request builds.
To avoid this issue with future PRs, see these Recommended CI Configurations.
For a quick fix, rebase this PR at GitHub. Your next report should be accurate.

Details

71 of 82 (86.59%) changed or added relevant lines in 8 files are covered.
26 unchanged lines in 3 files lost coverage.
Overall coverage increased (+0.2%) to 88.59%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
curvlinops/fisher.py	1	2	50.0%
curvlinops/examples/functorch.py	17	21	80.95%
curvlinops/jacobian.py	36	42	85.71%

Files with Coverage Reduction	New Missed Lines	%
curvlinops/fisher.py	1	28.57%
curvlinops/examples/functorch.py	1	88.3%
curvlinops/kfac.py	24	92.08%

Totals
Change from base Build 8757107974:	0.2%
Covered Lines:	1219
Relevant Lines:	1376

💛 - Coveralls

wiseodd · 2024-04-06T17:27:26Z

@f-dangel @runame I added (i) an example on the usage with HuggingFace transformers in the docs, and (ii) unit tests.

Please review.

wiseodd · 2024-04-07T14:41:49Z

Here's an example use case.

https://curvlinops--100.org.readthedocs.build/en/100/basic_usage/example_huggingface.html

The load for the users is minimal and the choice of UserDict is compatible with HF. Note that the users can do whatever they want in terms of their X's since the only requirements are they are dict/UserDict and they handle their "preprocessing" like .to(device) inside the forward function of the model.

f-dangel

Did a first pass and added some minor suggestions, mainly missing documentation.
One main suggestion is to avoid having to duplicate the test functions, simply by appending the test cases using a dict to CASES_NO_DEVICE and instead make the cases fixture return the batch_size_fn.

curvlinops/_base.py

docs/examples/basic_usage/example_huggingface.py

test/cases.py

runame

While for many settings the model/data loader still has to be adjusted, this seems like a nice usability improvement!

See my comments and formatting and linting have to be fixed as well.

curvlinops/_base.py

curvlinops/fisher.py

curvlinops/kfac.py

test/test_fisher.py

test/test_ggn.py

test/test_gradient_moments.py

test/test_hessian.py

test/test_kfac.py

runame · 2024-04-10T13:28:59Z

One more thing, the type hints for the data also have to be modified in all files that don't require other changes, e.g. ggn.py.

wiseodd · 2024-04-18T21:48:00Z

I'm finally done.

examples/functorch.py is now aware of dict-like inputs
Update all the test cases to accommodate batch_size_fn

@f-dangel @runame ready for another check.

_ _ (_) ___ ___ _ __| |_ | |/ _/ / _ \/ '__ _/ | |\__ \/\_\/| | | |_ |_|\___/\___/\_/ \_/ isort your imports, so you don't have to. VERSION 5.13.2 Nothing to do: no files or paths have have been passed in! Try one of the following: `isort .` - sort all Python files, starting from the current directory, recursively. `isort . --interactive` - Do the same, but ask before making any changes. `isort . --check --diff` - Check to see if imports are correctly sorted within this project. `isort --help` - In-depth information about isort's available command-line options. Visit https://pycqa.github.io/isort/ for complete information about how to use isort.

f-dangel

Mostly nits. Please make sure you try running make test on a compute infrastructure with access to a GPU to make sure there are no device-related issues as GH actions can only check on CPU (I'll do it anyways before merging).

curvlinops/_base.py

curvlinops/examples/functorch.py

curvlinops/fisher.py

curvlinops/jacobian.py

test/test_jacobian.py

test/test_submatrix_on_curvatures.py

wiseodd · 2024-04-19T19:50:40Z

@f-dangel I'm done. Currently running the test on a GPU-enabled env. It takes so long but you can continue reviewing. PEP8 and other linters' issues should also be resolved.

wiseodd · 2024-04-19T20:20:38Z

Alright, confirmed that all tests passed on GPU!

f-dangel · 2024-04-22T21:26:20Z

@runame I'm done with my second pass, could you take a quick second look and merge if everything looks good?

runame

LGTM!

wiseodd added 2 commits April 2, 2024 14:00

[ADD] Generalize assumption about X

2d234e3

Update typehint

33f8dcf

wiseodd added a commit to aleximmer/Laplace that referenced this pull request Apr 2, 2024

[WIP] Support for <f-dangel/curvlinops#100>

af641cd

wiseodd added 2 commits April 6, 2024 09:45

[REF] Add example how to use the UserDict input for HuggingFace models

bdc660d

Add tests for UserDict and dict Xs

cb1e518

wiseodd marked this pull request as ready for review April 6, 2024 17:26

Add Huggingface transformers and datasets to the docs deps

4570160

f-dangel reviewed Apr 9, 2024

View reviewed changes

runame added the enhancement New feature or request label Apr 10, 2024

runame self-requested a review April 10, 2024 11:46

runame reviewed Apr 10, 2024

View reviewed changes

wiseodd added 3 commits April 12, 2024 07:09

Address comments in _base.py

e89509d

Resolves comments in fisher.py and kfac.py

c3bd66d

Update docstrings and example in documentation

3ec309f

wiseodd mentioned this pull request Apr 18, 2024

Bringing laplace-torch to foundation-model era aleximmer/Laplace#144

Merged

wiseodd added 4 commits April 18, 2024 15:46

Update functorch.py to accomodate dict-like inputs

0e43308

Make *JacobianLinearOperator aware of dict-like inputs

18dc107

Update tests

7d2bb9c

Remove unused test case

4bd43ef

wiseodd added 3 commits April 18, 2024 18:05

Add to(device) in ModelWithDictInput test model

64c52ef

Fix import error in unit test

a2e0fff

f-dangel reviewed Apr 18, 2024

View reviewed changes

Address @f-dangel's comments

16e178d

wiseodd added 3 commits April 19, 2024 11:35

Better explanation in ignoring KFAC reduce & expand cases

8b4d1bd

Merge with main

7805e2d

Fix y device in _base.py

442efae

Merge branch 'main' into arbitrary-inputs

8d50917

Resolves flake8 warnings

5363df7

runame approved these changes Apr 23, 2024

View reviewed changes

f-dangel merged commit 30c77b4 into f-dangel:main Apr 24, 2024
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support arbitrary `X` in `data` #100

Support arbitrary `X` in `data` #100

wiseodd commented Apr 2, 2024

wiseodd commented Apr 2, 2024

coveralls commented Apr 5, 2024 •

edited

Loading

wiseodd commented Apr 6, 2024

wiseodd commented Apr 7, 2024

f-dangel left a comment

runame left a comment

runame commented Apr 10, 2024

wiseodd commented Apr 18, 2024

f-dangel left a comment

wiseodd commented Apr 19, 2024

wiseodd commented Apr 19, 2024

f-dangel commented Apr 22, 2024

runame left a comment

Support arbitrary X in data #100

Support arbitrary X in data #100

Conversation

wiseodd commented Apr 2, 2024

wiseodd commented Apr 2, 2024

coveralls commented Apr 5, 2024 • edited Loading

Pull Request Test Coverage Report for Build 8791845160

Warning: This coverage report may be inaccurate.

Details

💛 - Coveralls

wiseodd commented Apr 6, 2024

wiseodd commented Apr 7, 2024

f-dangel left a comment

Choose a reason for hiding this comment

runame left a comment

Choose a reason for hiding this comment

runame commented Apr 10, 2024

wiseodd commented Apr 18, 2024

f-dangel left a comment

Choose a reason for hiding this comment

wiseodd commented Apr 19, 2024

wiseodd commented Apr 19, 2024

f-dangel commented Apr 22, 2024

runame left a comment

Choose a reason for hiding this comment

Support arbitrary `X` in `data` #100

Support arbitrary `X` in `data` #100

coveralls commented Apr 5, 2024 •

edited

Loading