References | Temporal resolution | Objective | Stochastic policy | Voltage violation | Method |
---|---|---|---|---|---|
Chang et al. (2019) | 30 mins | Cost, expected SOC at the end | No | No | Q-learning |
Wan et al. (2019) | 1 h | Cost, incl. battery degradation | No | No | DQN |
Ding et al. (2020) | 1 h | DSO profits | Yes | Yes | DDPG |
Dorokhova et al. (2021) | 1 h | PV self consumption | No | Yes | DDQN, DDPG, PDQN |