Forget function #220

jsuit · 2016-04-23T03:54:33Z

I have some questions about forget.

What exactly does forget do?
When should you call it?
2.a When should you call it when you are using a LSTM?
If you call forget, should you also call recycle?

Here's a test case:
I have a tensor T of size n (batchsize) x 50, each entry represents the indx to lookup a word. Let this represent a batch of size n with 50 timesteps. If I have a LSTM in my model, do (or should) I call forget after I give my model T. I noticed in the language model example, forget was never called if you were using a LSTM.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Forget function #220

Forget function #220

jsuit commented Apr 23, 2016

Forget function #220

Forget function #220

Comments

jsuit commented Apr 23, 2016