Fig. 2
From: Towards reinforcement learning for vulnerability analysis in power-economic systems

Learning curve of the 15 agents over 200 tsd. training steps with average return and standard deviation averaged over 10 episodes
From: Towards reinforcement learning for vulnerability analysis in power-economic systems
Learning curve of the 15 agents over 200 tsd. training steps with average return and standard deviation averaged over 10 episodes