Do MBPO agents not support recurrent neural networks for the environment model, the base off-policy agent, or both?

1 view (last 30 days)

Kundan Panta on 5 May 2024

0
Link

Direct link to this question

https://in.mathworks.com/matlabcentral/answers/2115186-do-mbpo-agents-not-support-recurrent-neural-networks-for-the-environment-model-the-base-off-policy

Commented: Kundan Panta on 7 May 2024

Since TD3, SAC, etc. agents support using recurrent layers by themselves, would using these recurrent base agents still not work with MBPO?

Could this limit be circumvented by using a custom training loop for the environment model and for the base agents?

2 Comments
Show NoneHide None

Naren Raman on 6 May 2024

Thank you for your question. No, MBPO agents do not support recurrent networks for now as mentioned in the documentation. The custom training loop provides more flexibility. Yes, you should be able to use the custom training loop to create a custom MBPO agent with recurrent neural networks.

Kundan Panta on 7 May 2024

Thank you for your timely response. To confirm that recurrent networks are not supported for the base agents, in addition to the environment model, I tried combining the "Create MBPO Agent" and "Create TD3 Agent with Recurrent Neural Networks" examples and it indeed threw an error.

Answers (0)

Products

Reinforcement Learning Toolbox

Release

R2024a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Do MBPO agents not support recurrent neural networks for the environment model, the base off-policy agent, or both?

2 Comments
Show NoneHide None

Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Do MBPO agents not support recurrent neural networks for the environment model, the base off-policy agent, or both?

2 Comments Show NoneHide None

Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

2 Comments
Show NoneHide None