Page 222 - Machine Learning for Subsurface Characterization
P. 222

192 Machine learning for subsurface characterization


            3.2 Data preparation
            Data preparation, or data preprocessing, affects the convergence time of neural
            networks during the training phase. Each input has a different range; for
                                                              3
                                                           3
            example, illite volume fraction ranges from 0.1 to 0.5 m /m , whereas water
                                              3
                                                 3
            and oil volume fraction is below 0.1 m /m . Each input data is scaled to
            have zero mean and unit variance, which is also referred as the standard
            scaler. Such scaling is suitable when there are only a few outliers in the data.
            As the number of outliers increase, it is advisable to use robust scaling.
            Output/target data generally do not require either scaling or normalization.
            Outliers in targets were screened and removed by fitting NMR T2
            distribution, discretized into 64 T2 bins, with a cubic spline and then
            removing those with three peaks and unusual characteristics when fitting the
            NMR T2. Depth shift was performed to ensure that the 10 inversion-derived
            logs and the corresponding NMR T2 distribution correspond to the same
            depth. This ensures that the deep neural networks are well trained to relate
            10 features (inversion-derived logs) with 64 targets (NMR T2 distribution
            discretized into 64 T2 bins).
            4 Neural network architectures for the NMR T2 synthesis

            4.1 Introduction to NMR T2 synthesis
            We implement four deep neural network architectures that can learn to process
            the seven mineral volume content logs, namely, illite, chlorite, quartz, calcite,
            dolomite, anhydrite, and kerogen, and three fluid saturation logs, namely, bound
            water, free water, and oil, to synthesize NMR T2-distribution responses along a
            well length. These 10 logs will be referred as the inversion-derived logs. The
            first half of the VAE-NN, GAN-NN, and VAEc-NN architectures learn to
            abstract and understand the concept of NMR T2 distribution. These three
            architectures first train deep neural networks to accurately reproduce the
            NMR T2 distribution in the training dataset by extracting certain
            representative features of the NMR T2 distribution; following that, a simple
            multilayer neural network is connected with the pretrained deep neural
            network to reliably synthesize the NMR T2 distribution from the 10
            inversion-derived logs. Instead of individually predicting T2 amplitude for
            each of the 64 T2 bins of the NMR T2 distribution, we implement deep
            networks to simultaneously predict T2 amplitudes for the 64 T2 bins to
            generate the complete NMR T2 spectra, thereby providing greater constraint

            FIG. 7.1, CONT’D formation tops or lithology indicators provided by the geology expert. Tracks
            1,2,3,and4aretheconventionallogsonwhichinversionisperformedtoobtaintheinversion-derived
            logs in Track 5. CMR_ADT in Track 6 is NMR T2 distribution response that is the target for the data-
            driven modeling. NMR T2 distribution response at a specific depth comprises 64 T2 amplitudes as a
            function of T2 times. Deep neural networks were trained to relate the 10 inversion-derived logs to the
            entire NMR T2 spectra, comprising 64 T2 amplitudes as a function of T2 times.
   217   218   219   220   221   222   223   224   225   226   227