Fig. 1From: A stochastic deep reinforcement learning agent for grid-friendly electric vehicle charging managementThe deep neural network architecture of the actor (left) and critic (right). Parameter heads a and b refers to the two branches of the actor network that returns the beta policy parameters a and bBack to article page