Page 240 - Mechanical Engineers' Handbook (Volume 2)

P. 240

6 Data Display and Reporting 229

2 h at 15:00, the starting point on a trend line would be 13:00 and the ending point would
be 15:00. Ten minutes later the starting point on the trend line would be 13:10 and the
ending point would be 15:10.
For relational data, there is often a desired time frame as described above (to restrict
the size of the data to be searched) and some selection criteria based on characteristics of
the data itself or of related data, including events. For example, one may wish to ﬁnd those
manufactured units for the past month that were for product X and see how many had quality
defects. As described above under third-party data acquisition systems, there are techniques
for selecting data from process history databases that approach (but do not equal) relational
capability.

Representation of Historical Data
The primary difference between current data and historical data is that there are multiple
data points, normally with an order deﬁned by the time they were acquired (for process
history data) and/or by their relationship to other variables and events (for relational data).
Once the time frame and relationships are selected, the data must have some method of
representation on a display.
Historical data have a number of potential representation techniques. Multiple data val-
ues can be combined into a single number such as average, standard deviation, mode, max-
imum, minimum, range, variance, and so on. These can then be represented by techniques
for individual data elements as described above. The equations in Section 4.4 are examples
of summary statistics formulas.
The simplest form to represent historical data is the list. Create a column for each
variable of interest. If they are all collected at the same time (the ‘‘records’’ described above),
then each time data were collected can be used as the ﬁrst value on the left in each row.
The data for each variable of interest can be placed on the row corresponding to its time
stamp, similar to that for ﬁles shown in Table 4.
Where data were collected with different time stamps, a new row can be created when-
ever a new value is obtained and values only placed in the column–row combinations where
there is a corresponding time stamp between the row’s time stamp and the time stamp of
the variable in question. While useful, the problem with this is that there are now holes in
the data. This list is very useful when viewing a trend line or graphical tool to validate
numbers, to verify time stamps, and diagnose problems with collection.
Often, one desires to view the relationships between data and time or other variables.
Trend plots typically are used to show the relationship between variables and time. The X
axis is typically represents the time range and the Y axis represents the value of the desired
variable. The range of data on the Y axis can vary depending on how one wishes to view
the data. The maximum and minimum of the data can be used to set the top and bottom of
the range of the Y axis, respectively. This can have two undesired effects. It may make small
movements in data appear to be very large when the range is small. In cases where there
are spikes in the data where a value is disproportionately high or low, representing the Y
axis based on the maximum and minimum could make it difﬁcult to view normal variation
in the data. One has to understand the potential use of the data to choose the Y-axis scale
appropriately.
When more than one variable is shown on a trend chart, the selection of scale of the Y
axis becomes more complex. If all the variables are representative of the same domain, such
as all temperatures, then perhaps the same Y axis can be used for all of them. Often, however,
the viewer is attempting to compare relative variations in data, sort of a poor man’s corre-
lation analysis. In this case it may be useful to have multiple Y axes and select the range of

235 236 237 238 239 240 241 242 243 244 245