support recurrent with no states. #1113

Beronx86 · 2016-06-08T06:29:31Z

The recurrent wrapper does not support loop with no states. But this kind of loop may be useful. So I modified the codes.

Fixes #1112

dwf · 2016-06-08T06:31:42Z

I'll let someone more familiar with recurrent do the review but I know they will ask for you to add a test. 😄

Beronx86 · 2016-06-08T06:34:26Z

Ok, I'll write the test case.

rizar · 2016-06-11T20:17:01Z

blocks/bricks/recurrent/base.py

+                # Ensure that all initial states are available.
+                initial_states = brick.initial_states(batch_size, as_dict=True,
+                                                      *args, **kwargs)
+                for state_name in application.states:


It seems like the code starting from this line can be moved out of the if clause, and the else part is not really necessary. Right now we pay a high price of having an additional level of indentation for this new feature, and it would be great to keep the complexity of the code down.

I suggest to add the line before

else: initial_states = OrderedDict()

Beronx86 · 2016-06-12T13:13:55Z

In the original code, the recurrent method would require initial_states function whether there is recurrent states or not. If the recurrent states is left empty, an error would occur at the time of visiting initial_states function.

Beronx86 · 2016-06-12T14:03:38Z

You may produce the error with the following code. The error occurs when the class does not contain a recurrent method named apply

import numpy
import theano
from numpy.testing import assert_allclose
from theano import tensor

from blocks.bricks import Brick
from blocks.bricks.recurrent import BaseRecurrent, recurrent
# from recurrent import recurrent


class RecurrentWrapperNoStatesClass(BaseRecurrent):
    def __init__(self, dim, **kwargs):
        super(RecurrentWrapperNoStatesClass, self).__init__(**kwargs)
        self.dim = dim

    def get_dim(self, name):
        if name in ['inputs', 'outputs', 'outputs_2']:
            return self.dim
        if name == 'mask':
            return 0
        return super(RecurrentWrapperNoStatesClass, self).get_dim(name)

    @recurrent(sequences=['inputs', 'mask'], states=[],
               outputs=['outputs', 'outputs_2'], contexts=[])
    def apply2(self, inputs, mask=None):
        outputs = inputs * 10
        outputs_2 = tensor.sqr(inputs)
        if mask:
            outputs *= mask
            outputs_2 *= mask
        return outputs, outputs_2


if __name__ == '__main__':
    recurrent_examples = RecurrentWrapperNoStatesClass(
        dim=11, name='test_example')

    X = tensor.tensor3('X')
    out, out_2 = recurrent_examples.apply2(inputs=X, mask=None)

    x_val = numpy.random.uniform(size=(5, 1, 1))
    x_val = numpy.asarray(x_val, dtype=theano.config.floatX)

    out_eval = out.eval({X: x_val})
    out_2_eval = out_2.eval({X: x_val})

    assert_allclose(x_val * 10, out_eval)
    assert_allclose(numpy.square(x_val), out_2_eval)

dmitriy-serdyuk · 2016-06-13T15:13:09Z

blocks/bricks/recurrent/base.py

+                                    state_name, brick.name))
+                states_given = dict_subset(kwargs, application.states)
+            else:
+                states_given = {}


If I remember right, it should be an OrderedDict.

Since states_given in the else clause is never used, it does not matter whether it is a OrderedDict, dict or None.

Beronx86 · 2016-06-14T04:51:45Z

blocks/graph/__init__.py

@@ -104,7 +104,15 @@ def auxiliary_variables(self):
    @property
    def scan_variables(self):
        """Variables of Scan ops."""
-        return list(chain(*[g.variables for g in self._scan_graphs]))


This code supposed that no recurrent class is nested. #1115

rizar · 2016-06-14T05:04:49Z

blocks/graph/__init__.py

@@ -104,7 +104,15 @@ def auxiliary_variables(self):
    @property
    def scan_variables(self):
        """Variables of Scan ops."""
-        return list(chain(*[g.variables for g in self._scan_graphs]))
+        # BFS
+        scan_graphs = self._scan_graphs


You probably want to copy scan_graphs here, like e.g. scan_graphs = list(self._scan_graphs).

…bles

dmitriy-serdyuk · 2016-06-14T17:49:35Z

blocks/bricks/recurrent/base.py

@@ -46,6 +46,9 @@ def initial_states(self, batch_size, *args, **kwargs):
            The keyword arguments of the application call.

        """
+        if not hasattr(self, 'apply') or not self.apply.states:
+            return
+


Can you explain how it works? I cannot immediately see it.

when some subclass call the default initial_states function in the BaseRecurrent class. This line would check whether it is necessary to return the initial states. If the subclass does not have an apply method or its apply method does not contain states, the initial_states would not return anything.
This line would make it to support recurrent class with no apply function or with no states.

Why do you want to have a class without apply? It's a mistake if a user forgot to define apply and the best is to crash soon.

In a case if apply.states is empty, initial_states would return an empty list before this change, why is it wrong?

If this line is added, the above code, which contains a recurrent brick with no apply method, would run well.
But, you are right about the apply method. The Brick subclass should follow some design rules. The problem is no code checks whether there is an apply method in a Brick subclass at present.

@Beronx86 , checking apply.states in BaseRecurrent.initial_states is not a solution. There are quite a few places in Blocks-dependent code where initial_states method is overloaded. Instead, like in your previous solution, initial_states should not be called if application does not have states. Can you please revert back to the previous version of your fix?

@rizar I think this check could be carried out in Brick.__init__ method. So we can make sure all Brick subclasses contain apply methods. I reverted back the changes in BaseRecurrent.

…brick

…ld not pass this case.

rizar · 2016-06-21T04:26:43Z

I don't understand, now you have removed your fix, and it is again not supported to have no states property. Why not just implemented like you did in the first place, but with more gentle changes to the code as I suggested?

support recurrent with no states.

2b45c97

dwf added the needs tests label Jun 8, 2016

recurrent wrapper without states test case

2fdba23

dwf removed the needs tests label Jun 8, 2016

rizar reviewed Jun 11, 2016
View reviewed changes

dmitriy-serdyuk reviewed Jun 13, 2016
View reviewed changes

recurrent wrapper without states test case

69b1846

Beronx86 reviewed Jun 14, 2016
View reviewed changes

Beronx86 mentioned this pull request Jun 14, 2016

Bug in nested recurrent model #1115

Open

rizar reviewed Jun 14, 2016
View reviewed changes

make recurrent with no states easier. test case for nested scan_varia…

0d49671

…bles

dmitriy-serdyuk reviewed Jun 14, 2016
View reviewed changes

Beronx86 added 3 commits June 15, 2016 14:49

a test case for nested recurrent model. assert there is only one top_…

ba6f6ae

…brick

revert back the change in BaseRecurrent.initial_states

c5b015f

remove recurrent_with_no_states test case, since the above change wou…

0de2dc7

…ld not pass this case.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support recurrent with no states. #1113

support recurrent with no states. #1113

Beronx86 commented Jun 8, 2016 •

edited by dwf

Loading

dwf commented Jun 8, 2016

Beronx86 commented Jun 8, 2016

rizar Jun 11, 2016

dmitriy-serdyuk Jun 13, 2016

Beronx86 commented Jun 12, 2016 •

edited

Loading

Beronx86 commented Jun 12, 2016

dmitriy-serdyuk Jun 13, 2016

Beronx86 Jun 14, 2016

Beronx86 Jun 14, 2016

rizar Jun 14, 2016

dmitriy-serdyuk Jun 14, 2016

Beronx86 Jun 15, 2016

dmitriy-serdyuk Jun 15, 2016

Beronx86 Jun 16, 2016 •

edited

Loading

rizar Jun 19, 2016

Beronx86 Jun 20, 2016 •

edited

Loading

rizar commented Jun 21, 2016

support recurrent with no states. #1113

Are you sure you want to change the base?

support recurrent with no states. #1113

Conversation

Beronx86 commented Jun 8, 2016 • edited by dwf Loading

dwf commented Jun 8, 2016

Beronx86 commented Jun 8, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Beronx86 commented Jun 12, 2016 • edited Loading

Beronx86 commented Jun 12, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Beronx86 Jun 16, 2016 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Beronx86 Jun 20, 2016 • edited Loading

Choose a reason for hiding this comment

rizar commented Jun 21, 2016

Beronx86 commented Jun 8, 2016 •

edited by dwf

Loading

Beronx86 commented Jun 12, 2016 •

edited

Loading

Beronx86 Jun 16, 2016 •

edited

Loading

Beronx86 Jun 20, 2016 •

edited

Loading