Page 63 - Intermediate Statistics for Dummies
P. 63

06_045206 ch02.qxd  2/1/07  9:42 AM  Page 42
                                42
                                         Part I: Data Analysis and Model-Building Basics
                                                       Designing the data-collection instrument: Poorly designed instruments
                                                        (including surveys) can result in inconsistent or even incorrect data.
                                                        For example, a survey question’s wording plays a large role in whether
                                                        or not results are biased. A leading question can make people feel like
                                                        they should answer a certain way. For example: “Don’t you think that
                                                        the president should be allowed to have a line-item veto to prevent gov-
                                                        ernment spending waste?” Who would feel they should say no to that?
                                                       Collecting the data: In this case, bias can infiltrate the results if some-
                                                        one makes errors in the recording of the data or if interviewers deviate
                                                        from the script.
                                                       Deciding how and when the data is collected: The time and place you
                                                        collect data can affect whether your results are biased. For example, if
                                                        you conduct a telephone survey during the middle of the day, people
                                                        who work from nine to five aren’t able to participate. Depending on the
                                                        issue, the timing of this survey could lead to biased results.
                                                    Bias can creep into a data set very easily. The best way to deal with bias is to
                                                    avoid it in the first place. You can do this in two major ways:
                                                       Use a random process to select the sample from the population. The
                                                        only way a sample is truly random is if every single member of the popu-
                                                        lation has an equal chance of being selected. Self-selected samples aren’t
                                                        random.
                                                       Make sure that the data is collected in a fair and consistent way. Be
                                                        sure to use neutral question wording and time the survey properly.
                                                    Settling the variance controversy:
                                                    The battle of n–1 versus n
                                                    Not all statistical formulas are free of bias. In other words, some statistics
                                                    have good characteristics (like offering great precision) and some not-so-
                                                    good characteristics (like not giving the best possible result in all situations).
                                                    Statisticians definitely prefer statistics that are both precise and unbiased,
                                                    and the techniques you find in this book have both qualities. However, pre-
                                                    cise and unbiased statistics doesn’t always happen naturally; sometimes the
                                                    basic idea requires a little tweaking to get a statistic that actually meets the
                                                    standards of the statistical powers that be (of which I am not one). The clas-
                                                    sic example of this need to fine-tune is the formula for the variance of a data
                                                    set, which I describe in the following section.
   58   59   60   61   62   63   64   65   66   67   68