Page 88 - Machine Learning for Subsurface Characterization
P. 88

Shallow neural networks and classification methods Chapter  3 73


             Flag-5 captures difference in the deviation of pore size distributions about a
             dominant pore size. These flags help improve the prediction performance as
             they provide additional relevant information to the predictive models about
             each depth.
                An expert geologist provided us formation tops and dominant lithology
             along the well length. We used this information to assign integer values to
             Flag-1 for each depth. Values for Flags 2–5 were manually assigned to each
             depthbyexamining theNMR T 2 distribution for that depth. After manually
             creating the flags, which are categorical features, we trained a k-nearest neigh-
             bor (KNN) classifier (supervised learning) to correctly predict the categorical
             values of the five flags by processing a specific combination of conventional
             logs and inversion-derived logs. KNNtrainingshouldbedoneonlyon the
             training dataset used for training the ANN models for NMR T 2 synthesis. This
             avoids contaminating the performances of the ANN models when synthesiz-
             ing the NMR T 2 distributions for the testing dataset. Once the KNN is trained,
             new conventional and inversion-derived logs from new wells without the
             NMR T 2 distributions can be processed to generate the Flags 2–5. The
             KNN predictions of Flags 2–5 along with Flag 1 obtained from the Geologist
             are fed into the ANN models along with the conventional and inversion-
             derived logs for purposes of NMR T 2 synthesis in the new wells. Our hypoth-
             esis is that the Flags 1–5 act as additional informative features that help
             improve the performances of the ANN models. Table 3.B1 lists the substantial
             improvement in predictive performance of the first ANN model when the flags
             were generated and then used as additional features when training and testing
             the ANN model. Fig. 3.C1 shows that Flags 1–4 are very important features for
             the proposed NMR T 2 synthesis task.
                When creating the training dataset, Flags 2–5 can be easily generated by
             examining the available T 2 distributions acquired along the well length and then
             assigning specific values to the Flags for each training depth depending on the
             characteristics of the T 2 distribution at that depth. However, for testing data or
             any new data, T 2 distribution needs to be predicted and is unavailable; conse-
             quently, we cannot manually assign specific values to these flags. So, for the
             testing data and new data, we first train a KNN classifier on the training dataset
             with manually assigned flags to relate the available conventional logs to the
             flags. Following that, the trained KNN classifier is used to process the available
             conventional logs in the testing dataset and new dataset to generate the flags.
             Flags 2–5 need to be predicted prior to the primary objective of generating
             the T 2 distribution. The goal of KNN classifier is to first learn to relate the avail-
             able “easy-to-acquire” conventional and inversion-derived logs to the Flags 2–5
             in the training dataset, in the presence of NMR T 2 distribution. The trained KNN
             is then used to process the easy-to-acquire conventional logs to generate the
             Flags 2–5, in the absence of NMR T 2 distribution. Twenty-two conventional
             and inversion-derived logs are used as features to predict the Flags 2–5one
             by one with four separate KNN classifiers.
   83   84   85   86   87   88   89   90   91   92   93