Fig. 7From: Peer-to-peer energy trading optimization in energy communities using multi-agent deep reinforcement learningMean reward for each agent per training episode: (a) DDPG, and (b) TD3Back to article page