Param info util: dict and string summary #288

rolandgvc · 2020-05-27T14:52:20Z

This is a proposal for two parameter summary utils, one that returns the information as a dict and one as a string.

Marvin182

Very cool. Here is my old internal version (I had it for TF and changed it to support TF and JAX):
https://gist.github.com/Marvin182/4c87f14b01aa1bf481d312e36e32332e
Yours look much nicer though.

flax/nn/utils.py

mohitreddy1996 · 2020-05-27T16:55:27Z

This is awesome!

I have a small request. Would it be possible to refactor out logic which return the number/count of parameters (something similar to @Marvin182 's implemented method 'count_parameters' in https://gist.github.com/Marvin182/4c87f14b01aa1bf481d312e36e32332e).

This would be helpful in testing model definition in examples. I currently have TODOs in #287 and #289

rolandgvc · 2020-05-27T17:03:11Z

This is awesome!

I have a small request. Would it be possible to refactor out logic which return the number/count of parameters (something similar to @Marvin182 's implemented method 'count_parameters' in https://gist.github.com/Marvin182/4c87f14b01aa1bf481d312e36e32332e).

This would be helpful in testing model definition in examples. I currently have TODOs in #287 and #289

Thanks! Avital wanted that as a HOWTO, which I have here #277. I'll consult with him :)

codecov-commenter · 2020-05-27T18:06:47Z

Codecov Report

Merging #288 into master will decrease coverage by 1.75%.
The diff coverage is 10.52%.

@@            Coverage Diff             @@
##           master     #288      +/-   ##
==========================================
- Coverage   79.39%   77.63%   -1.76%     
==========================================
  Files          34       34              
  Lines        2252     2312      +60     
==========================================
+ Hits         1788     1795       +7     
- Misses        464      517      +53

Impacted Files	Coverage Δ
flax/nn/utils.py	`51.40% <10.52%> (-46.60%)`	⬇️
flax/metrics/tensorboard.py	`91.07% <0.00%> (-3.27%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 3bc5289...e41e180. Read the comment docs.

flax/nn/utils.py

…, sorting parameters by layer index, support for dynamic sizing and nested dicts

avital · 2020-06-09T12:23:57Z

flax/nn/utils.py

+
+def _name_idx(name: str):
+  """Returns the layer index of the parameter name."""
+  index = name[name.find('_') + 1 : name.find('/')]


cc @levskaya @jheek weird stuff that happens when we autoincrement layer names as strings -- "everything becomes part of the API"

I think this is bad style. New layers might not follow this convention and the param info shouldn't rely on it. I actually have good experience with just sorting alphabetical by name.

@rolandgvc - I wouldn't try to sort parameters by the "layer order", in general the "order" of layers is a partial order given that arbitrary dataflow can happen in complex modules. Also this ordering information is lost for manually named layers.

@avital - the alternative to names is an opaque tree-structure-based serialization (which we used in trax) and it is a complete nightmare to work with when debugging. Pragmatically, I am happy to suffer Hyrum's law if it enables vastly more pleasant debugging. I want a human-navigable "at rest" representation of models.

I believe we can get "the best of both worlds" with easy
introspection while allowing people to look through ordered lists when
those are semantically meaningful.

In the meanwhile I propose we add a strong TODO comment here saying that
we should use this as a use-case for the ongoing API rewrite considerations.

This could also be resolved by having the params as OderedDicts instead so they can keep the order of the applied modules by default.

Let's just add a comment for now and I think we can merge this.

avital · 2020-06-11T07:32:21Z

@levskaya I believe we can get "the best of both worlds" with easy introspection while allowing people to look through ordered lists when those are semantically meaningful.

…

On Wed, Jun 10, 2020 at 7:25 AM Anselm Levskaya ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In flax/nn/utils.py <#288 (comment)>: > + +def flatten_dict(input_dict: Dict[str, Any], prefix: str = "") -> Dict[str, Any]: + """Flattens the keys of a nested dictionary.""" + output_dict = {} + for key, value in input_dict.items(): + nested_key = "{}/{}".format(prefix, key) if prefix else key + if isinstance(value, dict): + output_dict.update(flatten_dict(value, prefix=nested_key)) + else: + output_dict[nested_key] = value + return output_dict + + +def _name_idx(name: str): + """Returns the layer index of the parameter name.""" + index = name[name.find('_') + 1 : name.find('/')] @rolandgvc <https://github.com/rolandgvc> - I wouldn't try to sort parameters by the "layer order", in general the "order" of layers is a partial order given that arbitrary dataflow can happen in complex modules. Also this ordering information is lost for manually named layers. @avital <https://github.com/avital> - the alternative to names is an opaque tree-structure-based serialization (which we used in trax) and it is a complete nightmare to work with when debugging. Pragmatically, I am happy to suffer Hyrum's law if it enables vastly more pleasant debugging. I want a human-navigable "at rest" representation of models. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#288 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAJFUSIZKXMTGIYSMB5A4LRV4KLFANCNFSM4NMGXH4Q> .

avital · 2021-02-13T13:05:26Z

See CLU (Common Loop Utils) for another implementation: https://github.com/google/CommonLoopUtils/blob/master/clu/parameter_overview.py

cgarciae · 2022-01-18T23:15:42Z

Hey! If adding a dependency is not too much of a burden, maybe we could leverage the rich.tables, they provide a lot of format options, colors, and other stuff like properly handing multi-line rows.

Here is an example of rich in action:

avital · 2022-01-20T15:58:30Z

@cgarciae that looks really good. But I wonder if we can/should separate the needs: Part 1 is a function that give a Flax module (and variables and inputs?) generates some simple dict output that describes the module heirarchy. Then part 2 is a small piece of code that reads this dict and calls into rich.tables. Then we could replace part 2 with any other renderer. WDYT? We could even put the "renderer" part in a separate pip package to remove the dependency of flax on it.

It's also worth comparing and contrasting with https://dm-haiku.readthedocs.io/en/latest/notebooks/visualization.html

cgarciae · 2022-01-24T17:36:08Z

I think splitting the functionality makes a lot of sense, the current proposed implementation in the PR more or less does this via the separation between param_info and show_param_info. I think we could expand the current idea with the following:

[optional] Rename param_info to module_info, this would take in a Module, variables, sample inputs, and would use capture_intermediates to add input/output information about the layers to the dict structure.
Create a render_info function that could render the structure, internally it would use rich.tables.
show_{module,param}_info could have a render_fn that defaults to render_info, it would simply call {module,param}_info and subsequently call render_fn. Users could provide their own render_fn.

Later on render_info could be extracted if deemed generally useful.

avital · 2022-02-03T15:15:32Z

Closing in favor of #1844

added model_summary() in nn.utils

2757377

googlebot added the cla: yes label May 27, 2020

Marvin182 requested changes May 27, 2020

View reviewed changes

rolandgvc added 2 commits May 27, 2020 18:58

namedtuple and other minor changes (from suggestions)

bd982d5

minor variable change

9e6abb1

avital reviewed May 28, 2020

View reviewed changes

flax/nn/utils.py Outdated Show resolved Hide resolved

avital reviewed May 28, 2020

View reviewed changes

flax/nn/utils.py Outdated Show resolved Hide resolved

avital reviewed May 28, 2020

View reviewed changes

flax/nn/utils.py Outdated Show resolved Hide resolved

avital reviewed May 28, 2020

View reviewed changes

flax/nn/utils.py Outdated Show resolved Hide resolved

marcvanzee assigned avital May 29, 2020

rolandgvc added 3 commits May 30, 2020 17:26

added support for nested dicts, return params_info as a dict

d70fd07

added utils for getting param info as dict or as string summarization…

b5bd4e9

…, sorting parameters by layer index, support for dynamic sizing and nested dicts

minor change

e41e180

rolandgvc changed the title ~~model_summary util~~ Param info util: dict and string summary Jun 4, 2020

rolandgvc mentioned this pull request Jun 4, 2020

HOWTO: Params info #311

Closed

avital reviewed Jun 9, 2020

View reviewed changes

avital added this to the Design notes and patterns milestone Dec 12, 2020

avital modified the milestones: Design notes, Patterns/HOWTOs Dec 29, 2020

avital removed their assignment Feb 13, 2021

cgarciae mentioned this pull request Feb 1, 2022

Module tabulation #1844

Closed

5 tasks

avital closed this Feb 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Param info util: dict and string summary #288

Param info util: dict and string summary #288

rolandgvc commented May 27, 2020 •

edited

Loading

Marvin182 left a comment

mohitreddy1996 commented May 27, 2020

rolandgvc commented May 27, 2020

codecov-commenter commented May 27, 2020 •

edited

Loading

avital Jun 9, 2020

Marvin182 Jun 9, 2020

levskaya Jun 10, 2020 •

edited

Loading

avital Jun 12, 2020

rolandgvc Jun 12, 2020

avital Aug 18, 2020

avital commented Jun 11, 2020 via email

avital commented Feb 13, 2021

cgarciae commented Jan 18, 2022

avital commented Jan 20, 2022

cgarciae commented Jan 24, 2022

avital commented Feb 3, 2022

Param info util: dict and string summary #288

Param info util: dict and string summary #288

Conversation

rolandgvc commented May 27, 2020 • edited Loading

Marvin182 left a comment

Choose a reason for hiding this comment

mohitreddy1996 commented May 27, 2020

rolandgvc commented May 27, 2020

codecov-commenter commented May 27, 2020 • edited Loading

Codecov Report

avital Jun 9, 2020

Choose a reason for hiding this comment

Marvin182 Jun 9, 2020

Choose a reason for hiding this comment

levskaya Jun 10, 2020 • edited Loading

Choose a reason for hiding this comment

avital Jun 12, 2020

Choose a reason for hiding this comment

rolandgvc Jun 12, 2020

Choose a reason for hiding this comment

avital Aug 18, 2020

Choose a reason for hiding this comment

avital commented Jun 11, 2020 via email

avital commented Feb 13, 2021

cgarciae commented Jan 18, 2022

avital commented Jan 20, 2022

cgarciae commented Jan 24, 2022

avital commented Feb 3, 2022

rolandgvc commented May 27, 2020 •

edited

Loading

codecov-commenter commented May 27, 2020 •

edited

Loading

levskaya Jun 10, 2020 •

edited

Loading