Trial software

How GAE calculates in Reinforement Learning Toolbox(PPO)?

2 views (last 30 days)

Show older comments

TigerSee on 14 Feb 2021

0
Link

Direct link to this question

https://in.mathworks.com/matlabcentral/answers/745072-how-gae-calculates-in-reinforement-learning-toolbox-ppo

Answered: Emmanouil Tzorakoleftherakis on 16 Feb 2021

Accepted Answer: Emmanouil Tzorakoleftherakis

A difference between help center and reference[3] about TD error.

Why

in Generalized Advantage Estimator?

https://ww2.mathworks.cn/help/reinforcement-learning/ug/ppo-agents.html

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Accepted Answer

Emmanouil Tzorakoleftherakis on 16 Feb 2021

0
Link

Direct link to this answer

https://in.mathworks.com/matlabcentral/answers/745072-how-gae-calculates-in-reinforement-learning-toolbox-ppo#answer_624942

Hello,

Thank you for catching this typo - it should be Gt = Dt+V. I have let the documentation team know.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

More Answers (0)

Sign in to answer this question.

Categories

Computational Finance Financial Toolbox

Find more on Financial Toolbox in Help Center and File Exchange

Tags

Products

Reinforcement Learning Toolbox

Release

R2020a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Trial software