Page 45 - Handbook of Deep Learning in Biomedical Engineering Techniques and Applications
P. 45

Chapter 2 Deep convolutional neural network in medical image processing  33




               diseases. The extracted images using different modalities as pre-
               sented in Table 2.1 are used as input to the ML/DL model.
               Currently, DL, especially CNN has proven to be an effective
               advancement of ML for the analysis of medical imaging. There-
               fore, in the subsequent section, the architecture of different
               CNN followed by its application to the analysis of different dis-
               eases encountered in different organs of the human body is
               explicitly presented.


               3. Convolutional neural network and its
                   architectures

                  DL is a subset of ML in which several linear, as well as
               nonlinear, processing units are organized in a deep layer design
               so that it could model the abstraction in the data [6]. At present,
               many DL techniques are in use for the diverse area of applications.
               Different DL techniques include Boltzman machines, autoen-
               coders, recurrent neural network, deep belief networks, and deep
               CNNs. Among these techniques, CNN-based models have more
               applications in the domain of medical image processing.
                  CNNs are named so due to the presence of convolutional
               layers in their construction [38]. Detection of certain local fea-
               tures in every location of the particular input image is the main
               work of the convolutional layers. Each of the nodes in a convolu-
               tional layer is linked with only a small subset of spatially con-
               nected neurons for the detection of the local features in the
               input image. To aid in the search of the same local features all
               through the input channels, the weights of the connection are
               shared among the nodes of the convolutional layers. Each set of
               these shared weights is named as the convolutional kernel or sim-
               ply kernel. These convolutional layers along with kernels gradu-
               ally learn to detect local features whose strength throughout the
               input images is evident in the final feature maps. To achieve a hi-
               erarchical set of features and to lessen computational complexity,
               each of the convolutional layer sequences is followed by a pooling
               layer. The pooling layer is evocative of simple as well as complex
               cells in the primary visual cortex [39]. The max-pooling layer plays
               a major role in reducing the feature maps' size by choosing the
               maximum feature response in local neighborhoods and removing
               the exact location of such maximum responses. As a result of this
               process, max-pooling further improves translational invariance.
                  A typical CNN model comprises multiple convolutional and
               pooling layer pairs followed by several consecutive fully con-
               nected layers as presented in Fig. 2.5. Finally, the network ends
   40   41   42   43   44   45   46   47   48   49   50