Live Monitoring of Critic Predictions in the RL Toolbox

1 view (last 30 days)

walli on 17 Aug 2020

1
Link

Direct link to this question

https://in.mathworks.com/matlabcentral/answers/580884-live-monitoring-of-critic-predictions-in-the-rl-toolbox

Edited: walli on 17 Aug 2020

I'm wondering if it is possible to monitor the Q-value predictions within any critic-based RL approach using the RL toolbox? For example, having a multi-output DQN agent the internal deep NN has to be called at every step in order to evaluate all possible discrete actions given the current state sample - hence, somewhere internally there must be a Q-value prediction for every discrete action available which are then evaluated in order to find the optimal action.

However, having spend some time on the 2020a documentation I was not able to find a way accessing these internal Q-value predictions at each time step. In particular, it would be nice if the Simulink-based agent block would be able to provide these predictions for further processing and monitoring reasons during the training and deployment phase.

Does somebody have a useful hint in order to retrieve the Q-value estimates during learning?

0 Comments
Show -2 older commentsHide -2 older comments

Answers (0)

Products

Reinforcement Learning Toolbox

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Live Monitoring of Critic Predictions in the RL Toolbox

0 Comments
Show -2 older commentsHide -2 older comments

Answers (0)

See Also

Categories

Tags

Products

Community Treasure Hunt

Live Monitoring of Critic Predictions in the RL Toolbox

0 Comments Show -2 older commentsHide -2 older comments

Answers (0)

See Also

Categories

Tags

Products

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments