Community Profile

photo

DAMODARAN B.K


Last seen: 2 years ago Active since 2021

Followers: 0   Following: 0

Statistics

  • Explorer

View badges

Feeds

View by

Question


Episode Q0 increases exponentially
Can anyone explain why episode Q0 in RL increases exponentially after convergence of reward to a suboptimal policy?

3 years ago | 1 answer | 0

1

answer