Page 143 - Data Architecture

P. 143

Chapter 4.3: Parallel Processing

Fig. 4.3.5 Text is parsed then placed in the appropriate processor.

Fig. 4.3.5 shows that in the MPP architecture, the parsing of the data greatly affects the
placement of the data. One record is placed on one node. Another record is placed on
another node.

The great benefit of parsing the data and using the parsing information as the basis for the
placement of data is that the data are efficient to locate. When an analyst wishes to
locate a unit of data, the analyst specifies the value of data that is of interest to the
system. The system uses the algorithm that was used to place the data into the database
(typically a hashing algorithm), and the system locates the data very efficiently.

In the Roman census approach to parallelization, the sequence of events is different from
the MPP approach. In the Roman census approach, query is sent to the system to search
for some data. The data managed by a node are searched and then parsed. Upon parsing,
143

138 139 140 141 142 143 144 145 146 147 148