Leite, Gabriel Matos Cardoso, Marcelino, Carolina Gil, Jiménez-Fernández, Silvia, Wanner, Elizabeth Fialho, Salcedo-Sanz, Sancho and Pedreira, Carlos Eduardo (2026). A cross-entropy based direct policy search algorithm for multi-objective energy storage control. Neural Computing and Applications, 38 ,
Abstract
Effective control of Energy Storage Systems (ESS) is crucial for the secure and profitable operation of microgrids. In this context, ESSs are essential for enhancing the overall grid resilience, balancing supply, and mitigating voltage and frequency variations. This paper presents a novel neuroevolutionary method, coupling a modified version of the Multi-Objective Evolutionary Policy Search (MEPS) algorithm with the Cross-Entropy method, aimed at optimizing an ESS control problem. The modified MEPS, named Cascade-MEPS, employs a cascade weights mutation operator to refine policies by focusing on the most recent hidden node, ensuring localized and non-disruptive adjustments. The resulting algorithm, referred to as cross-entropy Cascade-MEPS (CE-CMEPS), utilizes the cross-entropy method as a depth initialization strategy, conducting an initial exploration of the weights space to initialize the population prior to Cascade-MEPS execution. Experimental validation on a newly proposed multi-objective ESS control problem demonstrates the efficacy of CE-CMEPS, showcasing performance improvements and reduced variation compared to standalone MEPS. Our results show that CE-CMEPS is an effective ESS discharge controller and a sustainable multi-objective reinforcement learning solution.
| Publication DOI: | https://doi.org/10.1007/s00521-025-11785-3 |
|---|---|
| Divisions: | College of Engineering & Physical Sciences > School of Computer Science and Digital Technologies > Applied AI & Robotics College of Engineering & Physical Sciences > Aston Centre for Artifical Intelligence Research and Application College of Engineering & Physical Sciences > School of Computer Science and Digital Technologies College of Engineering & Physical Sciences Aston University (General) |
| Additional Information: | © The authors 2026. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use,sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to theoriginal author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. |
| Uncontrolled Keywords: | Direct policy search (DPS),Reinforcement learning,Energy management,Multi-objective (MO) control,Neuroevolution,Neural networks architecture |
| Publication ISSN: | 1433-3058 |
| Last Modified: | 18 Feb 2026 08:07 |
| Date Deposited: | 17 Feb 2026 15:48 |
| Full Text Link: | |
| Related URLs: |
https://link.sp ... 521-025-11785-3
(Publisher URL) |
PURE Output Type: | Article |
| Published Date: | 2026-02-13 |
| Published Online Date: | 2026-02-13 |
| Accepted Date: | 2025-10-09 |
| Authors: |
Leite, Gabriel Matos Cardoso
Marcelino, Carolina Gil Jiménez-Fernández, Silvia Wanner, Elizabeth Fialho (
0000-0001-6450-3043)
Salcedo-Sanz, Sancho Pedreira, Carlos Eduardo |
0000-0001-6450-3043