-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Model specific custom_objects for CremaModel #27
Comments
It's a good question. Really this all comes down to limitations of keras (and, to some extent, pickle/json). My basic position is that we should use custom objects sparingly, and try to keep them as general as possible to maximize reuse -- squeezelayer is a pretty good example of that I think, so would autopool. For that reason, I prefer coding them directly into the layers module, but i'm open to counter-arguments. Did you have something specific in mind? |
I guess I'm not sure what the idea behind Crema is. Is it supposed to be scaffolding that other people import and build models on top of? Or is it meant to have all of the models self-contained and integrated into the master branch? If it's the latter, then I understand why you'd have things written into the For For
Honestly, I'm not sure why |
Primarily the latter.
I suppose we could put some helpers in to make the monkeypatching less problematic, but then i'd worry that some models wouldn't be able to import without run-time patching in advance, and that seems bad. |
That's why I think having them pickled would be the best way. Then whoever designed the model has to make sure that any custom objects are importable/available before calling It also means that if a model isn't being used, then dependencies can be considered optional which can make things lighter weight |
In
CremaModel._instantiate
, you use layers defined inlayers.py
as available custom objects when loading a model. Should we make it easier to add arbitrary custom objects?Two quick ideas that I had:
we could use one or the other, or both. I'm partial to the second one tho because it doesn't require subclassing.
The text was updated successfully, but these errors were encountered: