Abstract
There is a trend to implement end-to-end learning-based control for electric vehicles; however, how to ensure the optimality and safety of the learning-based control methods is a great challenge. This article proposes a novel ensemble Q approach incorporating in-target minimization and update-to-data (UTD) mechanisms to control a plug-in hybrid vehicle that has an end-to-end control architecture. By employing these mechanisms, the prevalent issue of Q-function overestimation bias in Q-learning can be addressed with a reduction from 1.18 to 5.2 × 10-4. Meanwhile, the proposed method can also improve sample efficiency, outperforming conventional Q-learning methods in terms of learning stability and performance. The advantages of the proposed method in terms of energy efficiency and learning stability were validated through experiments on software-in-the-loop (SiL) and hardware-in-the-loop (HiL) platforms. The results show that up to 10.09% and 15.59% energy savings can be achieved through the comparison with conventional Q-learning and double Q approaches.
| Original language | English |
|---|---|
| Pages (from-to) | 11570-11581 |
| Number of pages | 12 |
| Journal | IEEE Transactions on Transportation Electrification |
| Volume | 11 |
| Issue number | 5 |
| Early online date | 13 Jun 2025 |
| DOIs | |
| Publication status | Published - Oct 2025 |
Bibliographical note
Publisher Copyright:© 2015 IEEE.
Keywords
- End-to-end learning
- energy management
- ensemble learning
- hybrid electric vehicles
- in-target minimization
ASJC Scopus subject areas
- Automotive Engineering
- Transportation
- Energy Engineering and Power Technology
- Electrical and Electronic Engineering