Projects per year
Abstract
Reinforcement learning (RL) has demonstrated its advantages in the intelligent control of many vehicle systems. However, controlling the exploration-to-exploitation (E2E) ratio of RL for the best performance in real-world operations is a great challenge. To obtain the optimal E2E ratio for managing the energy flow of a plug-in hybrid electric vehicle (PHEV) in real-world driving, this paper proposes an ensemble learning scheme based on two independent Q-learning agents working back-to-back competitively. At each sampling time, these agents generate two candidate control actions based on state observation and their control policies. Three decay functions, including the widely-used exponential decay and two new decay methods i.e., reciprocal function-base decay and step-based decay, are introduced to formulate one-dimensional look-up tables for E2E control. Then the PHEV control action will be selected from the candidate actions by a learning automata module(LAM) developed in this paper. The combinations of the three decay methods and three ensemble strategies with maximum-based, random-based, and weighted-based methods are investigated with the considerations of their energy efficiency, real-time performance, and robustness. Experiments are carried out on software-in-loop and hardware-in-the-loop platforms to demonstrate the energy-saving potentials. By implementing the ensemble learning scheme based on the weighted-based ensemble method in the control of the studied PHEV, up to 1.04% of energy can be saved under the predefined real-world driving cycles compared to the conventional Q-learning scheme.
| Original language | English |
|---|---|
| Pages (from-to) | 2479-2489 |
| Number of pages | 11 |
| Journal | IEEE Transactions on Intelligent Vehicles |
| Volume | 10 |
| Issue number | 4 |
| Early online date | 18 Mar 2024 |
| DOIs | |
| Publication status | Published - 27 Aug 2025 |
Bibliographical note
Publisher Copyright:© 2016 IEEE.
Keywords
- Energy management
- ensemble learning
- exploration-to-exploitation ratio
- hybrid electric vehicle
- reinforcement learning
ASJC Scopus subject areas
- Automotive Engineering
- Control and Optimization
- Artificial Intelligence
Fingerprint
Dive into the research topics of 'Optimal Energy Management of Plug-In Hybrid Electric Vehicles Through Ensemble Reinforcement Learning With Exploration-to-Exploitation Ratio Control'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Hybrid Electric Push-Back Tractor (HEIP-BT)
Xu, H. (Principal Investigator) & Olatunbosun, R. (Co-Investigator)
1/07/15 → 30/09/17
Project: Other Government Departments