Teacher forcing for a LSTM network

Question

0 votes

Is there a way to implement teacher forcing for a LSTM network in MATLAB? Hopefully there is an option buried somewhere.

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

David Willingham on 24 May 2022

0 votes

Hi Philip,

This example shows how teacher forcing can be implemented with LSTM's in MATLAB.Sequence-to-Sequence Translation Using Attention.

1 Comment
Show -1 older comments Hide -1 older comments

Philip Hua on 24 May 2022

hi David,

Thank you for your help but this is a attention model. I just need TF for a standard LSTM model.

Sign in to comment.

Answer 2

David Willingham on 24 May 2022

Edited: David Willingham on 24 May 2022

0 votes

Please see the attached example, trainLSTM_seq2seq. Is this what you were looking for? I.e. an example for a standard LSTM model?

5 Comments
Show 3 older comments Hide 3 older comments

David Willingham on 7 Jun 2022

If your problem is a time-series problem then it possibly could. Have you tried adapting this example to meet your use case? If so, is it getting the result you expect?

Philip Hua on 7 Jun 2022

hi David,

Thank you for your email. I just implemented the basic code without teacher forcing yesterday and during training it gives 30-40% accuracy. The author

https://www.mlmi.eng.cam.ac.uk/files/feynman_liang_8224771_assignsubmission_file_liangfeynmanthesis.pdf

seems to suggest that the model should perform much better. There are two things missing from my model currently:

1) i have not embedded the tokens (page 27) and

2) No teacher forcing

The problem with 1) is that these tokens are categorical tokens so after embedding (using the embed functions), I don't know how to retrieve the original tokens from the embedded data. I also presume that the data has to be discretized and there is no mention in the thesis of this so I am not sure what is going on TBH. Perhaps you can shed some light on this

2) i was going to try to use what you send to implement teacher forcing but I thought the open loop is a much neater way to implement the solution in this case. I am also dubious about TF - i suspect that all it's going to do is to overfit the data.

Sign in to comment.

Teacher forcing for a LSTM network

0 Comments
Show -2 older comments Hide -2 older comments

Answers (2)

1 Comment
Show -1 older comments Hide -1 older comments

5 Comments
Show 3 older comments Hide 3 older comments

Categories

Products

Release

Tags

Community Treasure Hunt

Teacher forcing for a LSTM network

0 Comments Show -2 older comments Hide -2 older comments

Answers (2)

1 Comment Show -1 older comments Hide -1 older comments

5 Comments Show 3 older comments Hide 3 older comments

Categories

Products

Release

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

1 Comment
Show -1 older comments Hide -1 older comments

5 Comments
Show 3 older comments Hide 3 older comments