Page 188 - Data Architecture
P. 188

Chapter 4.7: Taxonomies
           sophisticated analysis of text.


           Note that on output of the processed text, the analyst can now create a query on “car”
           and find all mentions of any type of car. Also note that the term “car” appears nowhere
           in the raw text. This is just a glimpse at the value added by taxonomies when taxonomies
           are applied to text.


           The ability to classify data externally is extremely useful when disambiguating
           nonrepetitive unstructured data.



           Taxonomies and Textual Disambiguation—Separate

           Technologies



           Taxonomies—the gathering, classification, and maintenance of the taxonomy—require
           their own care and handling. Usually, it makes sense to build and manage the taxonomy
           external to the technology for textual disambiguation. Fig. 4.7.7 shows that arrangement.














































                                                                                                               188
   183   184   185   186   187   188   189   190   191   192   193