[WIP] added initial states to RNN and LSTM layers #233

fdlm · 2016-12-02T22:34:50Z

This is a first implementation of initial states for RNN and LSTM layers. Needs some testing. Addresses #230.

superbock

Although the proposed changes will work the way they are now, I think we should address this together with #185, since we need to store the previous outputs and states anyways. These could then be initialised accordingly.

superbock · 2016-12-03T08:10:32Z

madmom/ml/nn/layers.py

@@ -98,12 +98,16 @@ class RecurrentLayer(FeedForwardLayer):
        Recurrent weights.
    activation_fn : numpy ufunc
        Activation function.
+    hid_init : numpy array, shape (), optional
+        Initial state of hidden units.


I'm not really happy with the name hid_init, I'd prefer simply init, or -- if this is too generic -- hidden_init.

superbock · 2016-12-03T08:15:37Z

madmom/ml/nn/layers.py

@@ -125,6 +129,9 @@ def activate(self, data):
            return super(RecurrentLayer, self).activate(data)
        # weight input and add bias
        out = np.dot(data, self.weights) + self.bias
+        # if we have a pre-initialised hidden state, add it
+        if self.hid_init is not None:
+            out[0] += np.dot(self.hid_init, self.recurrent_weights)


I think the logic here should be refactored.

Since we have to keep the last output anyways for streaming mode, me might just save out as self.out to be able to access it in the next step. The initialisation of self.out could then be done in __init__ with the given (learned) value.

The logic in the for loop below has to be changed to always access self.out -- or in case of block-wise processing -- the last item thereof.

superbock · 2016-12-03T09:03:02Z

madmom/ml/nn/layers.py

        self.input_gate = input_gate
        self.forget_gate = forget_gate
        self.cell = cell
        self.output_gate = output_gate
        self.activation_fn = activation_fn
+        self.out_init = out_init
+        self.state_init = state_init


As above, I'm not really happy with the names here.
Also, for streaming mode, the output of the previous step must be accessible anyways, e.g. as self.out.

For the state_init I am thinking of moving this as init to the cell, and store the cell's state there -- this is basically what it is. This needs some change in the activate() method, but is the cleaner way to solve this (IMO).

fdlm · 2016-12-03T10:44:47Z

Since some of your proposed changes require more refactoring, I guess it would be really better to do this together with #185. It will be less work in total.

superbock · 2016-12-03T10:58:28Z

Yes, I will work on this next week, since we need NNs in #185 anyways.

superbock · 2016-12-04T14:19:45Z

Please see #235 for my first draft. If that PR addresses all your points we can close this PR.

superbock · 2017-01-20T10:29:01Z

Closing, since #243 is merged

added initial states to RNN and LSTM layers

d8ce009

superbock reviewed Dec 3, 2016

View reviewed changes

fdlm mentioned this pull request Dec 5, 2016

[WIP] save previous output and state of RNNs #235

Closed

superbock mentioned this pull request Jan 18, 2017

stateful RNNs #243

Merged

superbock closed this Jan 20, 2017

superbock deleted the rnn_hid_init branch January 20, 2017 10:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] added initial states to RNN and LSTM layers #233

[WIP] added initial states to RNN and LSTM layers #233

fdlm commented Dec 2, 2016

superbock left a comment •

edited

Loading

superbock Dec 3, 2016

superbock Dec 3, 2016

superbock Dec 3, 2016

fdlm commented Dec 3, 2016 •

edited

Loading

superbock commented Dec 3, 2016

superbock commented Dec 4, 2016

superbock commented Jan 20, 2017

[WIP] added initial states to RNN and LSTM layers #233

[WIP] added initial states to RNN and LSTM layers #233

Conversation

fdlm commented Dec 2, 2016

superbock left a comment • edited Loading

Choose a reason for hiding this comment

superbock Dec 3, 2016

Choose a reason for hiding this comment

superbock Dec 3, 2016

Choose a reason for hiding this comment

superbock Dec 3, 2016

Choose a reason for hiding this comment

fdlm commented Dec 3, 2016 • edited Loading

superbock commented Dec 3, 2016

superbock commented Dec 4, 2016

superbock commented Jan 20, 2017

superbock left a comment •

edited

Loading

fdlm commented Dec 3, 2016 •

edited

Loading