Deep learning #1025

adriangb · 2020-02-24T09:01:52Z

What does this PR do?

Attempting to revive #895 while expanding upon it a little to allow for more general integration of Keras models into TPOT.

The main focus is basic sequential models (tensorflow.keras.models.Sequential).

This PR proposes a method of wrapping them to allow parametric generation of deep learning models in TPOT. Example classes are created for a classifier, a regressor and a transformer (the latter being the original intent of #895).

This is a very rough proof of concept, but I believe that the versatile functionality of tensorflow.keras will allow for fancy stuff like callbacks to be integrated into TPOT's logging, etc.

Where should the reviewer start?

Please start by looking over the general implementation and API in deep_learning.py.
I realize that extensive documentation is needed, I will do that at some point, I'm hoping to get some feedback before I spend a bunch of time on that.

How should this PR be tested?

I wrote some basic tests. More is needed.

I also think I found a bug in tensorflow.keras.utils.generic_utils.has_arg which would be a blocker for this. I submitted a PR (tensorflow/tensorflow#37004) on the tensorflow repo, we'll see where that goes. If tests on this PR fail, that's probably why.

I also ran some basic tests comparing the performance of these parametrically generated estimators with some basic Keras tutorials I found, it's about the same if not better.

What are the relevant issues?

#895 , #809

PS: @chappers, I made a new PR because it's a new branch on my fork. I squashed your commits into a single commit for now, I will make sure you are a co-author if this gets merged.

weixuanfu

Thank you for submitting this PR. I like this feature and I think it should be merged into TPOT if the performance in computational time and accuracy/score is competitive with other scikit-learn estimators in default TPOT configuration (e.g vs. RandomForestEstimator). Please provide more details about the performance test (with/without GPU) and also the search space of hyperparameters. Regarding the callbacks function from tensorflow.keras, it can be integrated later.

weixuanfu · 2020-02-24T09:15:19Z

tests/test_deep_learning.py

@@ -0,0 +1,38 @@
+import nose
+from sklearn.datasets import make_classification, make_regression
+from sklearn.neural_network import MLPClassifier, MLPRegressor


Please remove this line since this two seems not to be used in the scripts.

Pleas also fix those failed unit tests in this test scripts.

weixuanfu · 2020-02-24T09:28:47Z

tpot/builtins/deep_learning.py

+    )
+    return model
+
+class DeepLearningClassifier(KerasClassifier):


Based on the error in AppVayor, it failed to import tpot due to no tensorflow in the environment. Maybe using HAS_TENSORFLOW to determine if those new classes should be created.

weixuanfu · 2020-02-24T09:44:45Z

tests/test_deep_learning.py

+    layer_sizes = [20, 100, 50, 20, 60, 100]
+    X, y = make_classification(random_state=1)
+
+    def check(X, X_transformed, embedding_layer_size):


This function is duplicated and please only keep one.

adriangb · 2020-02-24T16:37:34Z

Great, thank you for taking a look and pointing some things out. I'll do the following three things as next steps:

Fix CI issues
Fix comments made above
Compare performance and accuracy. The primary comparison will be to sklearn.neaural_net.MLPClassifier. I will compare accuracy to other models, but I fully expect this to be a lot slower than classic classifiers. I will probably use this scikit-learn doc as a basepoint for comparison.

Quick update: I'm realizing that in order to do a 'fair' comparison, I'm going to need to tune all of the default parameters for these networks to as similar as possible to MPLClassifier. From initial testing with this dataset, these networks can meet or exceed the accuracy of MPLClassifier, but the runtime is considerably worse. This is all using tensorflow in CPU only on WSL. I'm sure using the GPU version on bare metal would already make this at least as fast as MPLClassifier, but I think tuning those parameters as I mentioned should allow the CPU version to be about as fast.

adriangb · 2020-02-25T20:49:10Z

Another update. Although I was able to hack this together to make a DeepLearningRegressor class with the exact same API as MPLClassifier, there are some bugs and it uses somewhat ugly mixin inheritance patterns. Looking around the tf.keras source, I found this PR, which I think would make things a lot cleaner and easier on the TPOT side: tensorflow/tensorflow#32533

Hence, I think I am going to focus efforts on reviving that PR and will pause work on this one until then.

adriangb · 2020-02-27T19:51:55Z

Another comment. Testing performance, I found that for small datasets on my laptops CPU, tensorflow/keras was considerably slower than sklearn.neural_net. I think running on any decent GPU, distributed or maybe even a workstation CPU, tensorflow would be much faster, but I really don't have the ability to test that.

adriangb · 2020-09-04T02:20:13Z

If anyone is interested, this could easily be revived using https://github.com/adriangb/scikeras. MLP example: https://github.com/adriangb/scikeras/blob/master/tests/mlp_models.py

8bit-pixies and others added 4 commits February 21, 2020 22:55

Initial deep-learning embedding

0b68b8e

temp

d0926d0

test new structure

b2b0de1

fix uneeded changes

9c6e064

adriangb requested a review from weixuanfu February 24, 2020 09:10

weixuanfu suggested changes Feb 24, 2020

View reviewed changes

adriangb closed this Feb 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deep learning #1025

Deep learning #1025

adriangb commented Feb 24, 2020 •

edited

Loading

weixuanfu left a comment

weixuanfu Feb 24, 2020

weixuanfu Feb 24, 2020

weixuanfu Feb 24, 2020

weixuanfu Feb 24, 2020

adriangb commented Feb 24, 2020 •

edited

Loading

adriangb commented Feb 25, 2020

adriangb commented Feb 27, 2020

adriangb commented Sep 4, 2020

Deep learning #1025

Deep learning #1025

Conversation

adriangb commented Feb 24, 2020 • edited Loading

What does this PR do?

Where should the reviewer start?

How should this PR be tested?

What are the relevant issues?

weixuanfu left a comment

Choose a reason for hiding this comment

weixuanfu Feb 24, 2020

Choose a reason for hiding this comment

weixuanfu Feb 24, 2020

Choose a reason for hiding this comment

weixuanfu Feb 24, 2020

Choose a reason for hiding this comment

weixuanfu Feb 24, 2020

Choose a reason for hiding this comment

adriangb commented Feb 24, 2020 • edited Loading

adriangb commented Feb 25, 2020

adriangb commented Feb 27, 2020

adriangb commented Sep 4, 2020

adriangb commented Feb 24, 2020 •

edited

Loading

adriangb commented Feb 24, 2020 •

edited

Loading