stateful RNNs #243

superbock · 2017-01-18T13:05:08Z

This is the latest attempt so solve #230 and #234. It also address partly #185, at least the parts relevant to NNs.

Supersedes #233 and #235.

fdlm · 2017-01-18T16:06:05Z

madmom/ml/nn/__init__.py

+    def __setstate__(self, state):
+        # restore instance attributes
+        self.__dict__.update(state)
+        # TODO: old models do not have the online attribute, thus create it


Isn't this an explanation, not a "TODO"?

see below...

fdlm

Looks nice overall :)

fdlm · 2017-01-18T16:06:38Z

madmom/ml/nn/__init__.py

+        # TODO: old models do not have the online attribute, thus create it
+        #       remove this initialisation code after updating the models
+        if not hasattr(self, 'online'):
+            self.online = None


Why is in initialised with None, and not False?

The reason is that not all old models are "offline models", some are "online models". Thus this is a TODO and None is used as the default. After we update at least the online models, we can set it to False, after updating all models (which is unlikely to be done in the near future) this can be removed completely. Updated the TODO accordingly.

fdlm · 2017-01-18T16:14:00Z

madmom/ml/nn/layers.py

-        return self.activate(*args)
+        return self.activate(*args, **kwargs)
+
+    def __getstate__(self):


IMHO, this shouldn't be in the Layer class. Every subclass of Layer should be responsible to ensure that their state is not pickled. This does not lead to much code duplication - right now, only RecurrentLayer and LSTMLayer are stateful, and I think for LSTMLayer you'll have to overwrite __getstate__ anyways, because you don't want to pickle the previous state and the cell state.

fdlm · 2017-01-18T16:15:50Z

madmom/ml/nn/layers.py

@@ -44,16 +51,24 @@ def activate(self, data):
        """
        raise NotImplementedError('must be implemented by subclass.')

+    @staticmethod


Not sure this should be a static method - sure, it does not change the object state in the Layer class, but its functionality is intended to change the object state.

Yes, this was a normal method before, changed it because some checker criticised it. Will change it back.

fdlm · 2017-01-18T16:27:43Z

madmom/ml/nn/layers.py

+
+        """
+        # reset previous time step to initial value
+        self._prev = self.init if init is None else init


self._prev = init or self.init ?

fdlm · 2017-01-18T16:29:48Z

madmom/ml/nn/layers.py

+        # add non-pickled attributes needed for stateful processing
+        self._prev = self.init
+        self._state = self.cell_init
+


Shouldn't LSTMLayer also have a __getstate__ for filtering out self._prev and self._state for pickling?

fdlm · 2017-01-18T16:34:50Z

madmom/ml/nn/layers.py

@@ -413,7 +572,7 @@ def activate(self, data, reset_gate, prev):
        return self.activation_fn(out)


This functionality looks very similar to what the Gate class already provides. Do we really need a separate class for the GRU cell?

Unfortunately yes, since it is only similar...

superbock

Thanks, force pushed the requested changes.

superbock · 2017-01-18T16:50:06Z

madmom/ml/nn/__init__.py

+        # TODO: old models do not have the online attribute, thus create it
+        #       remove this initialisation code after updating the models
+        if not hasattr(self, 'online'):
+            self.online = None


The reason is that not all old models are "offline models", some are "online models". Thus this is a TODO and None is used as the default. After we update at least the online models, we can set it to False, after updating all models (which is unlikely to be done in the near future) this can be removed completely. Updated the TODO accordingly.

superbock · 2017-01-18T17:10:30Z

madmom/ml/nn/layers.py

@@ -44,16 +51,24 @@ def activate(self, data):
        """
        raise NotImplementedError('must be implemented by subclass.')

+    @staticmethod


Yes, this was a normal method before, changed it because some checker criticised it. Will change it back.

superbock · 2017-01-18T17:12:13Z

madmom/ml/nn/layers.py

+
+        """
+        # reset previous time step to initial value
+        self._prev = self.init if init is None else init


superbock · 2017-01-18T17:24:33Z

madmom/ml/nn/layers.py

@@ -413,7 +572,7 @@ def activate(self, data, reset_gate, prev):
        return self.activation_fn(out)


Unfortunately yes, since it is only similar...

fdlm · 2017-01-18T18:34:22Z

madmom/ml/nn/layers.py

+        self.init = init
+        # keep the state of the layer
+        self._prev = self.init
+


Missing a __getstate__ here, because it's no longer in Layer. (I still think it's good to have it here instead of there)

added initialisation of hidden states to layers; fixes #230 renamed GRU parameters to be consistend with all other layers

Sebastian Böck added 3 commits January 18, 2017 14:08

reorder GRU parameters, fixes #234

56f350e

add shape information to docstrings

005ff87

add tests for NN layers

a2165a6

superbock force-pushed the stateful_rnns branch 2 times, most recently from 79bbafc to af11059 Compare January 18, 2017 13:21

superbock requested a review from fdlm January 18, 2017 13:24

fdlm reviewed Jan 18, 2017

View reviewed changes

superbock force-pushed the stateful_rnns branch from af11059 to d528b7e Compare January 18, 2017 17:28

superbock commented Jan 18, 2017

View reviewed changes

fdlm reviewed Jan 18, 2017

View reviewed changes

superbock force-pushed the stateful_rnns branch 2 times, most recently from 51314d5 to 1ce9cf6 Compare January 19, 2017 08:50

add stateful processing of recurrent layers

a92614d

added initialisation of hidden states to layers; fixes #230 renamed GRU parameters to be consistend with all other layers

superbock force-pushed the stateful_rnns branch from 1ce9cf6 to a92614d Compare January 19, 2017 11:39

superbock merged commit 98787a4 into master Jan 20, 2017

superbock deleted the stateful_rnns branch January 20, 2017 10:28

superbock mentioned this pull request Jan 20, 2017

[WIP] added initial states to RNN and LSTM layers #233

Closed

superbock changed the title ~~[WIP] stateful RNNs~~ stateful RNNs Jan 20, 2017

This was referenced Jan 20, 2017

[WIP] save previous output and state of RNNs #235

Closed

Proper handling of state-ful processors #237

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stateful RNNs #243

stateful RNNs #243

superbock commented Jan 18, 2017

fdlm Jan 18, 2017

superbock Jan 18, 2017

fdlm left a comment

fdlm Jan 18, 2017

superbock Jan 18, 2017

fdlm Jan 18, 2017

fdlm Jan 18, 2017

superbock Jan 18, 2017

fdlm Jan 18, 2017

superbock Jan 18, 2017

fdlm Jan 18, 2017

fdlm Jan 18, 2017

superbock Jan 18, 2017

superbock left a comment

superbock Jan 18, 2017

superbock Jan 18, 2017

superbock Jan 18, 2017

superbock Jan 18, 2017

fdlm Jan 18, 2017

		@@ -413,7 +572,7 @@ def activate(self, data, reset_gate, prev):
		return self.activation_fn(out)

stateful RNNs #243

stateful RNNs #243

Conversation

superbock commented Jan 18, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fdlm left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

superbock left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment