1. Introduction
The escalating global energy demand and environmental challenges posed by climate change have intensified the search for clean energy solutions and efficient storage technologies. Lithium-ion batteries (LIBs), with their broad operating temperature range, extended cycle life, and high energy density, have emerged as a cornerstone of the green energy transition. While these batteries power diverse applications from consumer electronics to spacecraft, their role in electric vehicles (EVs) is particularly crucial. Industry and academic researchers have made significant strides in enhancing EV safety, performance, and range. The continuous advancement of EV technologies—including battery systems, autonomous capabilities, and other innovations—underscores their fundamental role in achieving a sustainable future.
Hybrid methods have emerged as a promising solution, combining multiple approaches such as Coulomb counting with Kalman filtering to optimize accuracy, robustness, and computational efficiency. The selection of an appropriate SOC estimation method ultimately depends on application-specific requirements. While basic applications may find simpler methods sufficient, high-performance EVs typically require advanced techniques to ensure reliable operation across diverse conditions.
Accurate SOC estimation in LIBs is crucial for the safe and efficient operation of EVs. While laboratory settings offer controlled conditions for SOC characterization, real-world driving presents significant challenges. Variable initial SOCs, temperature fluctuations, unpredictable load variations, and diverse driving patterns distort voltage signals, rendering conventional SOC estimation methods unreliable. Existing studies often simplify scenarios with constant loads or predetermined SOC levels, which do not generalize well to the complexities of real-world driving. This discrepancy between controlled experiments and real-world conditions creates a critical need for robust SOC estimation methods capable of handling diverse and noisy data. The primary objective of this study is to develop a novel data augmentation technique using synthetic data to improve the accuracy and robustness of SOC estimation models in real-world EV applications. Generating synthetic data mitigates the limitations associated with relying exclusively on real-world datasets, which are often constrained in terms of size, diversity, and their ability to represent all possible driving conditions. Training models on a combination of real and synthetic data enhance their generalization capabilities for unseen real-world scenarios, thereby improving the accuracy of SOC estimation under challenging conditions.
This model synthesizes dynamic data representing real-world driving scenarios, enabling more effective training of deep learning-based SOC estimation models. TS-p2pGAN integrates environmental, vehicle, battery, and heating system variables, concatenating them with time-series features to generate synthetic SOC and motion data. It ensures robust temporal dependencies among variables and accommodates varying sequence lengths, offering efficient representations of complex time-series data. By training on historical time-series data, TS-p2pGAN captures temporal patterns and generates plausible future trajectories, enhancing dataset diversity and machine learning model performance, especially when real-world data collection is challenging. This capability makes TS-p2pGAN suitable for a wide range of time-series analysis and prediction tasks.
The key contributions of this study are as follows:
Data augmentation framework for EV driving scenarios: A GAN-based framework for synthesizing multivariate time-series data specifically for EV driving scenarios is presented. This framework addresses the limited availability of real-world data by enabling the generation of larger and more diverse datasets suitable for training and evaluating SOC estimation models.
TS-p2pGAN model architecture: The TS-p2pGAN model, incorporating an integrated transformation network and a multiscale discriminator, is designed to handle high-dimensional, extended time sequences. This architecture aims to capture complex temporal dependencies within the data and generate synthetic time series that preserve these dependencies.
Evaluation protocol using quantitative and qualitative metrics: An evaluation protocol employing both quantitative and qualitative metrics is implemented to assess the quality and characteristics of the generated synthetic data. This protocol provides insights into the model’s performance and facilitates further development.
Validation with real-world driving data: The model’s performance is evaluated using data from 70 real-world driving trips, demonstrating its ability to generalize to real-world conditions and its potential for practical application in EV SOC estimation.
4. Conclusions
The challenge of limited access to real-world data significantly impacts machine learning applications in EV and power battery dynamics analysis, as traditional time-series data augmentation methods often struggle to maintain essential signal characteristics while expanding datasets. To address this challenge, TS-p2pGAN, a novel model designed for generating variable-length synthetic time-series data while preserving original signal properties, is introduced. The model’s architecture uniquely incorporates a transformation net generator for point-to-point translation, utilizing gradient flow from multiple discriminators to a single generator across various scales to effectively capture complex EV parameter influences, including SOC and motor output torque.
Validation, conducted using an open dataset of 70 EV driving trips with comprehensive battery condition data, demonstrated TS-p2pGAN’s superior performance compared to TTS-GAN and TimeGAN in generating realistic and accurate time-series data. The model particularly excelled in preserving temporal dynamics, crucial for maintaining inter-variable relationships across time sequences. Quantitative analysis revealed impressive results, with RMSE values consistently below 3% and MAE values under 1.5% across all trips. Qualitative assessments through t-SNE and PCA visualizations further confirmed the high fidelity of generated data, while discriminative and predictive capability tests highlighted TS-p2pGAN’s advantages over TimeGAN in time-series generation.
The practical implications of TS-p2pGAN extend beyond data generation, offering significant potential for enhancing SOC and motor torque estimation, ultimately contributing to EV energy consumption optimization. The model’s ability to effectively leverage both spatial and temporal features surpasses traditional methods in learning complex time-series patterns while maintaining data integrity.
Despite these achievements, TS-p2pGAN’s reliance on paired datasets presents a notable limitation in diverse real-world environments where such data are often scarce. Future research could enhance the model’s versatility by exploring integration with unpaired learning approaches, particularly through CycleGAN architectures augmented with physics-informed constraints. While this limitation exists, the model demonstrates remarkable robustness through its ability to generate synthetic parameters that maintain consistency with both physical constraints and vehicle dynamics across various driving conditions. Ultimately, TS-p2pGAN marks a breakthrough in synthetic time-series generation for electric vehicle applications, delivering a framework that successfully balances high fidelity, practical utility, and real-world applicability.
Future work could explore the application of this data augmentation framework across diverse domains, including digital energy management systems for LIBs, autonomous vehicle comfort systems, and broader EV technologies. Real-time implementation and validation of TS-p2pGAN in diverse on-road scenarios are crucial to evaluate its performance under dynamic and variable conditions. Collaborating with industries, such as automotive manufacturers, and integrating the framework into existing control systems can significantly enhance its practical utility. The framework’s ability to generate synthetic parameters that adhere to physical constraints and maintain consistency with underlying system dynamics across different conditions demonstrates its robustness and potential for wide-ranging applications. Furthermore, the framework can be employed to accurately estimate SOC for LIBs’ battery management systems (BMSs), improving their reliability and efficiency. This offers significant contributions to fields requiring high-fidelity synthetic time-series data, particularly in enhancing the performance of LIBs and BMS technologies.
Source link
Shyr-Long Jeng www.mdpi.com