Page 193 - Machine Learning for Subsurface Characterization
P. 193

Dimensionality reduction and clustering techniques Chapter  6 167


             Though the formulation shown in Eq. (6.5) is simple, calculating the
             incorporated parameters from well logs is a challenge.

             4.2.1 Step 1: NMR decomposition using factor analysis
             Wireline NMR T2 distribution (Fig. 6.5, Track 1) is an overall response of all
             the fluid phases/components filling a range of pore sizes, such that the T2
             signature of a fluid phase overlaps with those from other phases/components.
             The NMR T2 distribution used in this study are T2 amplitudes measured at 64
             discrete T2 times in the range of 0.3 ms to 3000 ms. In other words, the T2
             distribution has been discretized into 64 T2 bins. The mix of constituent
             signals masks the information related to the relative volumes and pore sets
             occupied by various fluid phases in the pore. Volume fractions of various
             fluid phases in Eq. (6.5) are obtained by decomposing the NMR T2
             distribution into the contributions of individual fluid phases/components,
             which in our study are oil and water. The proposed decomposition assumes
             fluid phases possess distinct properties and occupy different sets of pore
             spaces. We implement Jain et al.’s [32] factor analysis to decompose the
             subsurface NMR-T2 distribution responses acquired in the shale formation.
             Factor analysis is a statistical method to extract unobserved latent variables,
             so it can only be applied to formations that share similar formation
             characteristics. Factor analysis describes variability among several observed,
             correlated variables in terms of a potentially lower number of unobserved
             variables called factors. Hence, it serves as a dimensionality reduction
             technique. For example, the variations in the 64 bins of NMR T2 distribution
             mainly reflect the variations in unobserved (underlying) variables/factors,
             such as bound water, bound oil, free water, and free oil.
                Much like cluster analysis that involves grouping similar samples into clusters,
             factor analysis involves grouping similar features into fewer constructed features.
             The purpose of factor analysis is to explain observed features/variables (in our
             study, the measurements for the 64 bins of NMR T2 distribution) in terms of a
             much smaller number of variables called factors. These factors represent
             underlying concepts that cannot be adequately measured by the original
             variables. Factor analysis simplifies data. We are implementing an exploratory
             factor analysis to compute the underlying factors that influence the NMR T2
             distribution response measured over 64 discrete bins without setting any
             predefined structure to the outcome. Factor analysis involves factor extraction
             followed by factor rotation. Factor extraction involves making a choice about
             the type of model as well as the number of factors to extract. Factor rotation
             comes after the factors are extracted, with the goal of achieving simple structure
             in order to improve interpretability. Factor analysis is applied to units with
             relatively similar pore structures. In our study the upper and lower shales are
             remarkably similar. The middle shale is associated with lower TOC and larger
             pore sizes. Consequently, factor analysis is jointly applied to the upper and
             lower shales and separately on the middle shale. Five factors were identified in
             the upper and lower shales, and eight factors were identified in the middle
             sections (Fig. 6.4). Each factor represents a fluid phase occupying a set of pores.
   188   189   190   191   192   193   194   195   196   197   198