Reinforcement Learning Toolbox - When does algorithm train?

Question

Hans-Joachim Steinort on 17 Sep 2019

0
Link

Direct link to this question

https://in.mathworks.com/matlabcentral/answers/480728-reinforcement-learning-toolbox-when-does-algorithm-train

Commented: Hans-Joachim Steinort on 26 Sep 2019

Accepted Answer: Emmanouil Tzorakoleftherakis

I am currently using the RL-Toolbox with a DQN-Agent built into a long-running process-simulation.

The maximum stepcount is currently 8000 steps per episode.

Unfortunately the documentation seems a little ambiguous to me, so here my question:

Doese the train-function of the RL-Toolbox train the agent at the end of an episode or during the episode when the step count exeeds the minibatch-size (like in the baseline algorithms)?

Thank you in advance.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Emmanouil Tzorakoleftherakis on 25 Sep 2019

0
Link

Direct link to this answer

https://in.mathworks.com/matlabcentral/answers/480728-reinforcement-learning-toolbox-when-does-algorithm-train#answer_393529

The implementation is based on the algorithm listed here.

Weights are being updated at each time step.

1 Comment
Show -1 older commentsHide -1 older comments

Hans-Joachim Steinort on 26 Sep 2019

"For each training time step" - that was the line I was looking for (yet looking into the source code lead me to the same conclusion).

After double-checking the baseline-algorithms I found that they do it the same way.

Thank you for your time!

Sign in to comment.

Reinforcement Learning Toolbox - When does algorithm train?

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

1 Comment
Show -1 older commentsHide -1 older comments

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Reinforcement Learning Toolbox - When does algorithm train?

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

1 Comment Show -1 older commentsHide -1 older comments

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

1 Comment
Show -1 older commentsHide -1 older comments