Page 169 - Data Architecture
P. 169

Chapter 4.6: Textual Disambiguation





































               Fig. 4.6.4 Mapping.


           In almost every case, the mapping process is done in an iterative manner. The first
           mapping of a document is created. A few documents are processed, and the analyst sees
           the results. The analyst decides to make a few changes and reruns the document through
           textual disambiguation with the new mapping specifications. The process of gradually
           refining the mapping continues until the analyst is satisfied.


           The iterative approach to the creation of a mapping is used because documents are
           notoriously complex and there are many nuances to a document that are not immediately
           apparent. For even an experienced analyst, the creation of the mapping is an iterative
           process.


           Because of the iterative nature of the creation of the mapping, it NEVER makes sense to
           create a mapping and then process thousands of documents using the initial mapping.
           Such a practice is wasteful because it is almost guaranteed that the initial mapping will
           need to be refined.


           Fig. 4.6.5 shows the iterative nature of the mapping process.






                                                                                                               169
   164   165   166   167   168   169   170   171   172   173   174