Page 87 - Biomimetics : Biologically Inspired Technologies
P. 87

Bar-Cohen : Biomimetics: Biologically Inspired Technologies DK3163_c003 Final Proof page 73 21.9.2005 11:40pm




                    Mechanization of Cognition                                                   73

                    context and accumulated knowledge. The particular example considered is an extension of the
                    architecture of Figure 3.2.
                      The confabulation architecture illustrated in Figure 3.3 allows the meaning content of a previous
                    sentence to be brought to bear on the continuation, by consensus building following a starter (shown
                    in green in Figure 3.3) for the second sentence. The use of this architecture, following knowledge
                    acquisition, is illustrated in Figure 3.4 (where for simplicity, the architecture of Figure 3.3 is
                    represented as a ‘‘purple box’’). This architecture, its education, and its use are now briefly explained.
                      The sentence continuation architecture shown in Figure 3.3 contains two of the sentence modules
                    of Figure 3.2; along with two new sentence meaning content summary lexicons (one above each
                    sentence module). The left-hand sentence module is used to represent the context sentence, when it is
                    present. The right-hand sentence module represents the sentence to be continued.
                      To prepare this architecture for use, it is educated by selecting pairs of topically coherent
                    successive sentences, belonging to the same paragraph from a general coverage, multi-billion-word
                    proper English text corpus. This sentence pair selection process can be done by hand by a human or
                    using a simple computational linguistics algorithm. Before beginning education, each individual
                    sentence module was trained in isolation on the sentences of the corpus.
                      During education of the architecture of Figure 3.3, each selected sentence pair (of which roughly
                    50 million were used in the experiment described here) is loaded into the architecture, completely
                    parsed (including the summary lexicon), and then counts were accumulated for all ordered pairs of
                    symbols on the summary lexicons. The long-term context knowledge base linking the first sentence
































                    Figure 3.3  Two-sentence hierarchical confabulation architecture for English text analysis or generation, illus-
                    trated as the functional machinery of a ‘‘purple box.’’ The sub-architectures for representing the first sentence
                    (illustrated on the left) and that for the second sentence — the one to be continued — illustrated on the right) are
                    each essentially the same as the architecture of Figure 3.2, along with one new lexicon and 20 new knowledge
                    bases. The one additional lexicon is shown above the phrase layer of lexicons of each sub-architecture. This
                    sentence meaning content summary lexicon contains symbols representing all of the 126,000 words and word
                    groups of the phrase-level lexicons (and can also have additional symbols representing various other standard
                    language constructions). Once the first sentence has been parsed; its summary lexicon has an expectation
                    containing each phrase-level lexicon symbol (or construction subsuming a combination of phrase symbols) that
                    is active. The (causal) long-range context knowledge base connects the summary lexicon of the first sentence to
                    the summary lexicon of the second sentence.
   82   83   84   85   86   87   88   89   90   91   92