Hi, I could not find out what the difference between "NextObs" and "LoggedSignals" is in the step function. In all scripts both are passed on from the step function. [NextObs,Reward,IsDone,LoggedSignals] = myStepFunction(Action,LoggedSignals) "LoggedSignals" is obviously used for the next step, but what is "NextObs" used for? Thanks!

Reinforcement learning: "NextObs" vs. "LoggedState" in step function

Anne Tscheliessnig on 28 Jul 2020

Taking the example of the Pendulum on https://de.mathworks.com/help/reinforcement-learning/ug/create-custom-reinforcement-learning-environment-in-matlab.html :

The myStepFunction is taking NextObs, Reward, IsDone and LoggedSignal and when calling needs Action and LoggedSignals

function [NextObs,Reward,IsDone,LoggedSignals] = myStepFunction(Action,LoggedSignals)

Then I would need LoggedSignals for the next step? And NextObs is being used for the agent? In this case I could not leave LoggedSignals empty, or?

This example is very confusing for me, because actually it states almost to the end in the myResetFunction

NextObs = LoggedSignals.State;

Emmanouil Tzorakoleftherakis on 28 Jul 2020

Open in MATLAB Online

Oh you were looking at creating custom environments with functions - I was looking at creating environments with classes by running e.g.

rlCreateEnvTemplate('myenv')

where LoggedSignals is not that important since you can use class variables to store the states.

I suspect the reason you need both LoggedSignals and NextObs is to create a unified way of using custom environments regardless of how you create it. NextObs is probably what the agent is using when interacting with the environment, whereas LoggedSignals is a way to save intermmediate values if you don't use classes to create your custom env.

lfyx on 1 Nov 2021

Hello, may I ask that, can the "sim" function output the LoggedSignals to the work space? Many information about the simulation action or observarion are saved in the LoggedSignals. However, the output of "sim" is the experince structure.

Maha Mosalam on 22 Nov 2021

Hi, what about the xact role of IsDone flag it it shuld be true or false or what?

Reinforcement learning: "NextObs" vs. "LoggedState" in step function

0 Comments
Show -2 older comments Hide -2 older comments

Answers (1)

4 Comments
Show 2 older comments Hide 2 older comments

Categories

Products

Release

Tags

Community Treasure Hunt

Reinforcement learning: "NextObs" vs. "LoggedState" in step function

0 Comments Show -2 older comments Hide -2 older comments

Answers (1)

4 Comments Show 2 older comments Hide 2 older comments

Categories

Products

Release

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

4 Comments
Show 2 older comments Hide 2 older comments