Page 273 - Machine Learning for Subsurface Characterization

P. 273

236 Machine learning for subsurface characterization

Figs. 8.9 and 8.10 compare the original and synthesized NMR T2
distributions obtained by processing the inversion-derived and raw logs,
respectively, using the deep learning models. Deep learning models exhibit
better NMR T2 synthesis as compared to the shallow-learning models. The
synthetic NMR T2 is smooth and appears physically consistent.

4.6 Cross validation
Because of the limited size of the training dataset, the deep learning models
were trained with fixed training steps. To ensure the deep learning models
do not overfit, it is recommended to use the cross-validation method. In our
study, we test the efficacy of the cross validation on the VAE-NN model. A
validation set is built by randomly selecting 100 samples from the 300-ft
interval of the shale formation. The training set contains 375 samples. The
validation loss is monitored during training, and when the validation loss
increases, the training is automatically stopped to prevent overfitting. The
VAE-NN model so trained exhibits a R2 score of 0.60 on the testing dataset
containing 100 samples. Notably the cross validation led to decrease in the
testing/generalization performance due to the reduction in data size under the
limited data available to us for this study. For our case, having a separate
validation set reduces the training set size resulting in the poorly trained,
highly biased deep learning model. Cross validation is suitable when the
model has access to large training dataset; in those cases the model can use
the validation set to monitor overfitting.

5 Comparison of the performances of the deep
and shallow models

Both shallow- and deep learning models can be implemented for NMR T2 log
synthesis. Shallow models are easier to apply due to their simplicity in
architecture, lower training-resource requirements, and relatively easier
explainability. Log-synthesis performance of shallow models in terms of the
R2 score is in the range of 0.6–0.75. A simple ANN with two hidden layers
exhibits the best performance among the shallow-learning models in terms
of R2; nonetheless, ANN performs poorly in terms of the generalization of
the NMR T2 distribution to ensure that the synthesis is physically
consistent (Figs. 8.2 and 8.3).
The four deep learning models implemented in this study process the same
dataset as the shallow-learning models. The two-step training process used for
VAE-, VAEc-, and GAN-based deep neural networks leads to stable and
physically consistent NMR T2 distributions, which resemble those in
the training set (Figs. 8.5 and 8.6). The performance of the four deep
learning models is better than that of shallow-learning models. The average
R2 score for the log synthesis accomplished by the deep models ranges from

268 269 270 271 272 273 274 275 276 277 278