Page 153 - Artificial Intelligence for Computational Modeling of the Heart

P. 153

Chapter 4 Data-driven reduction of cardiac models 125

Figure 4.4. Deep neural network model employed for computing cFFR ML : fully
connected architecture with four hidden layers.

is deﬁned independently at each centerline location. Since local
coronary hemodynamics are inﬂuenced by both the local and the
proximal and distal anatomy, features are deﬁned based on lo-
cal, proximal and distal anatomical characteristics. The coronary
circulation has a tree like structure, and, thus, there is a single up-
stream path, but typically multiple downstream paths. Hence, to
deﬁne the features of the distal anatomy, a main downstream path
is deﬁned, determined based on the healthy reference radius, the
number and the length of downstream branches.
A deep neural network containing 4 hidden layers is used as
machine learning model (Fig. 4.4). Each neuron in each layer is
connected to all neurons in the next layer, i.e. a fully connected
architecture is employed (no convolutional layers). A total of 28
features is extracted from the anatomical model for each loca-
tion, and connected to the input layer of the network. The ﬁrst
hidden layer contains 256 neurons, and, then, the number of neu-
rons is decreased by a factor of 4 in each subsequent layer. All
activation functions are of type sigmoidal, and the output layer,
represented by a single neuron, has a linear activation function.
Random (Xavier) initialization is performed for all weights. By ﬁrst
training each layer as an autoencoder, the overall model training
time is reduced. The loss is deﬁned as the mean squared error be-
tween the CFD based FFR values and the ML predicted values,
and the parameter optimization is performed using a Stochastic
Gradient Descent algorithm. To further reduce the model training
time, an optimized GPU based implementation was employed.
The synthetic datasets were randomly split into training / vali-
dation datasets using a 5:1 split. During training and validation,
when learning rate, momentum and other relevant hyperparame-
ters were tuned, the model was never evaluated on patient speciﬁc
datasets.
In the following the speciﬁc features that were deﬁned as input
to the network are described in detail.

148 149 150 151 152 153 154 155 156 157 158