Page 195 - Vibrational Spectroscopic Imaging for Biomedical Applications
P. 195

W idefield Raman Imaging of Cells and T issues   171


        within that pixel. The number of pixels within an image depends on
        the microchip size of the Raman camera. For example, a typical high
        resolution EMCCD camera may have 512 × 512 pixels; therefore, it is
        possible to have more than 200,000 spectra in an image. However, the
        number of pixels in an image will decrease when the binning param-
        eters are increased. Nonetheless, there are copious spectra that make
        up an image and require the use of chemometric techniques to ana-
        lyze the data. Chemometrics is defined as the use of mathematics and
        statistics on chemical data in order to extract useful information that
        can be used for decision making. The following section will describe
        some of the chemometric techniques used in the analysis of a wide-
        field Raman image. Each analytical technique will be briefly described,
        but for a more rigorous examination of the techniques, several books
        on chemometrics are referenced herein. 64–66

        6.5.1 Principal Component Analysis
        In the case of a widefield Raman image of a biological tissue, there are
        abundant associated Raman spectra. Because they consist of the same
        biomaterials (proteins, lipids, nucleic acids, carbohydrates), they are
        all very similar; however, within these similar spectra, there should
        be some finite number of independent variations occurring in the
        spectral data. Hopefully, the largest variations in the spectral data
        would be the changes in the spectrum due to different concentrations
        of biological molecules that comprise the cells or tissue. Other possi-
        ble variations are due to instrument variation (unless removed by
        preprocessing steps), environmental conditions, differences in sam-
        ple preparation, and so on. It is possible to calculate a set of “variation
        spectra” that represent the changes in the Raman scatter at all wave-
        lengths in the spectra; these variation spectra could be used instead of
        the spectral data for comparison. There should be fewer common
        variations amongst the data than the number of spectra; although,
        since they come from the original data, the variation spectra retain
        the interrelationship of the original spectra.
            The variation spectra are called eigenvectors or principal compo-
        nents (PCs). The method of breaking down a set of spectroscopic data
        into its most basic variations is called principal component analysis or
        PCA. It is mathematically defined as an orthogonal linear transforma-
        tion that transforms the data to a new coordinate system such that the
        greatest variance by any projection of the data comes to lie on the first
        coordinate (called the first principal component or first PC), the second
        greatest variance on the second coordinate, and so on.
            For example, imagine all the spectra from a widefield Raman
        image plotted in multi-dimensional space. The first PC would be a
        vector plotted through the data to find a single axis to capture or span
        as much of the variance of the data as possible. This is accomplished
        by a least-squares fit of the data to the new axis. Once the first PC is
   190   191   192   193   194   195   196   197   198   199   200