Page 156 - Artificial Intelligence in the Age of Neural Networks and Brain Computing
P. 156

2. AI Development     145




                  development of a navigation aide for Dr. Strange in his bizarre world. Our data
                  must accord with the task we are using it for.
                     We don’t know “ground truth” for our casesdwe never do. Unless our data is
                  simulateddand thereby not directly applicable to real-world problemsdwe always
                  have a problem knowing the validity of assignment of cases to our classes. Well,
                  perhaps that’s not much of a problem for distinguishing elephants from people; how-
                  ever, I’ll bet that for pet images from the internet, some of our dogs would meow. In
                  the medical context we often use human experts for ground truth, and most people
                  would readily agree that radiologists can make mistakes in the assignment of normal
                  versus lesion present cases from radiographs. What is more startling is the propensity
                  for disagreement among pathologists, see Fig 7.6. If we use pathology to determine
                  ground truth, then for the sake of our equanimity we might use one pathologist rather
                  than a panel; because, if we do the latter, we’ll discover how frequently they disagree.
                  Of course, this is not a recommendation, because we need to use the best possible
                  estimate for our ground truth.
                     This leads to a second “ground truth” concern. Obviously there are easier and
                  tougher cases for any discrimination task, and clearly the uncertainty in ground truth
                  is directly correlated with the degree of case difficulty.
                     Our data are noisy. All data are noisy, especially if they come from sensors of
                  some kind. This at least should be quantifiable. If the data are mediated by human
                  intervention, then additional variation is to be expected. We must try to identify
                  and where possible minimize the noise in our data to avoid propagating error in
                  our features and final outcome.
                     Our data are incomplete. Images can be misplaced; questionnaire items can be
                  left blank; data items can be “out of range” and nonsensical. For each of our data
                  points or cases we need to be sure that all appropriate data are present or we must
                  have a justifiable procedure in place to handle any deficits.
















                  FIGURE 7.6
                  A panel of three typical pathologists. We are particularly concerned by the results reported
                  by the one on the left. For pathology discordance see, for example, Elmore et al. [7],
                  finding that “Among the 3 consensus panel members, unanimous agreement of their
                  independent diagnoses was 75%.”
                                                                   Image Garosha j Dreamstime.
   151   152   153   154   155   156   157   158   159   160   161