Page 66 - Statistics for Dummies
P. 66

50
                                         Part I: Vital Statistics about Statistics

                                                    No matter how large a sample is, if it’s based on non-random methods, the
                                                    results will not represent the population that the researcher wants to draw
                                                    conclusions about. Don’t be taken in by large samples — first check to see how
                                                    they were selected. Look for the term random sample. If you see that term, dig
                                                    further into the fine print to see how the sample was actually selected and use
                                                    the preceding definition to verify that the sample was, in fact, selected ran-
                                                    domly. A small random sample is better than a large non-random one.
                                                    Statistic
                                                    A statistic is a number that summarizes the data collected from a sample.
                                                    People use many different statistics to summarize data. For example, data
                                                    can be summarized as a percentage (60% of U.S. households sampled own
                                                    more than two cars), an average (the average price of a home in this sample
                                                    is . . .), a median (the median salary for the 1,000 computer scientists in this
                                                    sample was . . .), or a percentile (your baby’s weight is at the 90th percentile
                                                    this month, based on data collected from over 10,000 babies).
                                                    The type of statistic calculated depends on the type of data. For example,
                                                    percentages are used to summarize categorical data, and means are used to
                                                    summarize numerical data. The price of a home is a numerical variable, so
                                                    you can calculate its mean or standard deviation. However, the color of a
                                                    home is a categorical variable; finding the standard deviation or median of
                                                    color makes no sense. In this case, the important statistics are the percent-
                                                    ages of homes of each color.
                                                    Not all statistics are correct or fair, of course. Just because someone gives you
                                                    a statistic, nothing guarantees that the statistic is scientific or legitimate. You
                                                    may have heard the saying, “Figures don’t lie, but liars figure.”
                                                    Parameter
                                                    Statistics are based on sample data, not on population data. If you col-
                                                    lect data from the entire population, that process is called a census. If you
                                                    then summarize the entire census information from one variable into a
                                                    single number, that number is a parameter, not a statistic. Most of the time,
                                                    researchers are trying to estimate the parameters using statistics. The U.S.
                                                    Census Bureau wants to report the total number of people in the U.S., so it
                                                    conducts a census. However, due to logistical problems in doing such an
                                                    arduous task (such as being able to contact homeless folks), the census num-
                                                    bers can only be called estimates in the end, and they’re adjusted upward to
                                                    account for people the census missed.










                                                                                                                           3/25/11   8:17 PM
                             08_9780470911082-ch04.indd   50                                                               3/25/11   8:17 PM
                             08_9780470911082-ch04.indd   50
   61   62   63   64   65   66   67   68   69   70   71