Bipedal walking robot TD3 training example bad convergence

1 view (last 30 days)

Tech Logg Ding on 6 Apr 2021

0
Link

Direct link to this question

https://in.mathworks.com/matlabcentral/answers/793687-bipedal-walking-robot-td3-training-example-bad-convergence

Edited: Tech Logg Ding on 6 Apr 2021

Hi all,

I have attempted to run the bipedal walking robot example training myself and it converged to an suboptimal solution. I used the TD3 agent training and also used gpu to host my actor and critic.

The final simulation shows that the robot learnt to fall at the start of the simulation. Why does my training produce significantly different results compared to the example? Did hosting the networks on the gpu caused this?

Here's the training plot. Note that the maximum reward was only 35 compared to the 250 shown in the example.

Thank you :)

0 Comments
Show -2 older commentsHide -2 older comments

Answers (0)

Products

Reinforcement Learning Toolbox

Release

R2021a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Bipedal walking robot TD3 training example bad convergence

0 Comments
Show -2 older commentsHide -2 older comments

Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Bipedal walking robot TD3 training example bad convergence

0 Comments Show -2 older commentsHide -2 older comments

Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments