journal homepage:
Keywords: This paper presents a non-intrusive approach for modeling a bidirectional DC-DC converter used in mild hybrid
DC-DC bidirectional converter electric vehicles. A black-box identification methodology is proposed to find a model based on the data acquired
Mild hybrid electric vehicle from the input/output terminals. Measured data include the steady state and transient response, and different
Deep learning
operating conditions of the DC-DC converter, including the buck and boost modes. A deep learning architecture
Neural network
based on a long-short-term memory neural network (LSTM-NN) is applied. The trained network is tested under a
set of operating points different from those used during the training stage. The proposed method is compared
with three black-box modeling techniques commonly used in power converters, proving its superior perfor
mance. Results presented in this paper indicate that the proposed model is able to replicate the behavior of the
bidirectional converter without a priori knowledge of the converter circuitry. This approach can also be applied
to other power devices.
Universitat Politècnica de Catalunya, Electrical Engineering Department, Rambla Sant Nebridi 22, 08222 Terrassa, Catalunya, Spain.
E-mail addresses: (G. Rojas-Dueñas), (J. Roger Riba), (M. Moreno-
Received 3 May 2021; Received in revised form 22 June 2021; Accepted 26 June 2021
Available online 10 July 2021
0263-2241/© 2021 Elsevier Ltd. All rights reserved.
neural network is tested under new conditions, and the predicted out
puts are compared to the actual measured outputs of the bidirectional
converter. Finally, the root mean squared error (RMSE) and the coeffi
cient of determination (R2) for the two predicted signals are calculated.
Fig. 3. LSTM cell unit.
2.1. Mild hybrid electrical vehicle (MHEV) modeling time-series problems with good accuracy. However, their
main problem is the difficulty of learning long term dependencies of the
The system to be evaluated in this paper is a MHEV, this topology data because of the gradient decay. LSTM-NNs overcome this problem
being used by major manufacturers such as Audi, Toyota, Nissan or [26]. This type of network is able of discarding unrelated information
Honda, among others [23]. This type of vehicle improves fuel efficiency during the training process, leading to the preservation of the back
by supplying power to the electric motor in certain situations or by propagation error over time. The weights of the neural network are
applying regenerative braking. MHEVs require a higher voltage (48 V) updated based on the memory units of the LSTM cells, which are not
to feed the electrical systems of the vehicle in order to reduce conduction affected by external perturbations [27]. A great advantage of LSTM-
losses [5,24]. The MHEV is a transition technology that is placed in the NNs, compared to traditional RNNs, is that they keep learning at
middle point between the conventional 12 V supply system and the 48 V every time step, even in case of numerous time steps.
system proposed by some manufacturers of premier cars [7]. The fundamental unit of the LSTM-NN is the cell, which includes
This paper focuses on the dual-voltage architecture that considers memory blocks that store, write, read and erase information during the
two batteries and a DC-DC bidirectional converter, the last one being the training process. This is accomplished thanks to the presence of gates,
device to be modeled. Understanding the working principle and the that allow the network to update the weights of the neurons at each
topology of the electrical system of the MHEV is fundamental in order to iteration [28].
obtain an accurate model of the power converter. Fig. 3 shows the architecture of LSTM cells, where c refers to the cell
Fig. 2 shows the dual architecture of this electrical system and its memory state, h is the hidden state, x is the input value and σ represents
main components. It can be seen that at the 48 V side there is a battery, a the sigmoid function. Subscript t refers to the time step of the cell, since
starter generator, a drive (DC-AC or AC-DC depending whether the there are as many cells as time steps in the time series. The LSTM is a
machine works as a motor or as a generator, respectively) and loads, block where different cells are connected sequentially, so that each cell
whereas at the 12 V side there is a battery and a set of low voltage loads. generates a future output depending on the past values. Fig. 3 shows the
Fig. 2 does not specify the topology of the DC-DC converter since it may input, forget, output and input modulation gates of a LSTM cell. The first
change depending on the manufacturer of the automobile and its re three are controlled by the sigmoid function σ , and they generate an
quirements [7]. However, for the purpose of this paper this is indif output between 0 and 1 based on the significance of the ht-1 input value
ferent, since the proposed black-box approach is able to model the [28], whereas the input modulation gate uses the ‘tanh’ activation
bidirectional converter despite of its topology. function. Pointwise operations are applied inside the cell in order to
Each battery is in charge of stabilizing its own network. When the connect the four gates and to obtain the output values. The interaction
belt-driven starter generator (BISG) is operating as a generator, the between the gates and the capacity of forgetting irrelevant information
converter works in the buck mode, the two batteries are charged and the allows the network learning long range temporal dependencies [9]. It is
power is supplied to the loads. When the BISG operates as a motor, the desirable for the problem studied in this paper, because the outputs of a
DC/DC converter operates in the boost mode and the power flows from DC-DC converter depend on the previous states [29].
the 12 V side to the 48 V side [5]. Some manufacturers include a starter This architecture allows calculating the output values just by
on the low voltage side in order to start the motor in case of failure of the applying mathematical calculations to the input data. Eq. (1) presents
48 V network. the operations that take place at the different gates of the LSTM cell.
Table 3
Bayesian Optimization Range.
Hyper-parameter Minimum Maximum
Neurons 5 150
Learning rate (LR) 0.001 0.1
Gradient decay factor (GDF) 0.8 0.99
L2 regularization value 1e-10 1e-2
Table 4
Method RMSE R2 Time
Iin Vout Iin Vout
Fig. 9. Scatter plot comparing measured and estimated signals. (a) output
voltage (LSTM-NN), (b) output current (LSTM-NN), (c) output voltage (NARX),
(d) output current (NARX), (e) output voltage (Polytopic), (f) output current
Fig. 8. Transient response measurement and estimation of (a) Output voltage (Polytopic), (g) output voltage (State space), (h) output current (State space).
(boost mode to buck mode), (b) Output current (boost mode to buck mode), (c)
Output voltage (buck mode to boost mode), (d) Output current (buck mode to
The following step consists of validating the NN when tested under
boost mode).
different conditions than the ones used during the training stage.
Table 4 shows the RMSE and R2 values obtained after evaluating 125
experiment. In this case it is well appreciated the transient response of
experiments from the test dataset. It also compares these values to the
the bidirectional converter.
ones obtained by means of the predictions made using the other three
A total of 750 experiments was assigned to the training dataset,
methods. The proposed methodology outperforms the other three
whereas the validation and test datasets contain 125 experiments each.
techniques because the RMSE is lower and the R2 coefficient is closer to
The next step is to tune the hyper-parameters of the NN.
Table 3 presents the minimum and maximum bounds of the variables
To better understand the accuracy of the proposed model, Fig. shows
to be tuned by the Bayesian optimization algorithm. A total number of
the actual and predicted outputs of two experiments of the test dataset.
30 NNs were trained by the BOA, each training process run for 100
Fig. a) and b) show an experiment in which the converter changes its
operating mode from boost to buck, whereas in Fig. c) and d) the
The BOA took 12 min to train the 30 neural networks. It was found
operating mode changes in the opposite way. In both cases, the esti
that the optimal point has the following hyper-parameters: 29 neurons,
mation of the LSTM-NN is close to the measurements. Fig. 8 also show
LR = 0.03426, GDF = 0.974 and L2 = 0.00169. Subsequently, the LSTM-
that the accuracy of the transient response is better than those estimated
NN was trained with these hyper-parameters for 1000 epochs using the
by the other three models. The LSTM-NN model accurately replicates the
‘adam’ solver.
steady state and transient response values, while the other methods
Fig. 7 shows the evolution of the training progress.
exhibit some discrepancies. It is worthy to note that the estimation of the
