Page 312 - Machine Learning for Subsurface Characterization
P. 312

Classification of sonic wave Chapter  9 273


             discontinuities (lower row). Three concentration parameters (kappa, k, and
             inverse of dispersion) equal to 0, 5, and 1000 are selected for this study. When
             the kappa equals 0, the orientations of discontinuity follow a uniform distribu-
             tion. When the kappa equals 5, the orientations are centered at 0 degree and pri-
             marily dispersed between  50 and 50 degrees. When the dispersion parameter
             equals 1000, all the fractures are almost parallel to the y-axis, which is 0 degree.
             These three types of materials represent high, intermediate, and low dispersivity
             of discontinuities around the primary orientation.
                When compressional wave propagates through such materials containing
             discontinuities, the multipoint measurements of compressional wavefront travel
             times are too complicated for humans to recognize and differentiate in terms of
             the dispersion of the embedded network of discontinuities that give rise to the
             measured travel-time signature. There do not exist simple simulation techniques
             and physical law to characterize such embedded network of discontinuities
             based on the measurements of the travel times. Machine learning provides a
             possible way to extract complex relationships/patterns from the multipoint
             wavefront travel-time measurements and relate them to the dispersion of the
             embedded network of discontinuities. To that end, DFN model was used to gen-
             erate 10,000 samples (realizations) for each of the three types of dispersion
             around the primary orientation. In total, the training/testing dataset contains
             30,000 samples. The compressional wavefront travel times originate from
             one source and are measured at 28 sensors/receivers located on the three bound-
             aries of the material, as shown in Fig. 9.22. The boundary on which one trans-
             mitter/source is placed is known as the source-bearing boundary. There are no
             receivers on the source-bearing boundary. The receivers are referred using indi-
             ces ranging from 0 to 27. For purposes of developing data-driven models, each
             sample in the dataset contains 28-dimensional feature vector, representing the
             travel times measured at the 28 receivers. Each sample was assigned a label,
             denoting a material type, for purpose of model training and testing. The
             user-assigned label categorically represents the dispersion around primary ori-
             entation of the embedded network of static discontinuities in a sample. The
             dataset of compressional wavefront travel times with associated user-assigned
             labels is processed by the nine classifiers to learn to noninvasively characterize
             the material containing discontinuities in terms of dispersion around primary
             orientation. The classification task is to classify the 28-dimensional travel-time
             measurements into the four classes representing the dispersion of the embedded
             network of discontinuities.


             5.1.2 Model accuracy
             Nine classifiers are applied to the dataset: KNN, SVM with linear kernel, SVM
             with radial basis function (RBF) kernel, decision tree, random forest, AdaBoost,
             Naı ¨ve Bayes, ANN, and voting classifier. These nine classifiers cover the most
             popular classifiers used in machine learning and data-driven approaches.
   307   308   309   310   311   312   313   314   315   316   317