Page 43 - Statistics II for Dummies
P. 43

Chapter 2: Finding the Right Analysis for the Job  27


                                In order for you to compare these complex relationships, you must build a
                                model to evaluate each group’s impact on political affiliation (or some other
                                categorical variable). This kind of model-building is explored in-depth in
                                Chapter 8, where I discuss the topic of logistic regression.

                                Logistic regression builds models to predict the outcome of a categorical
                                variable, such as political affiliation. If you want to make predictions about a
                                quantitative variable, such as income, you need to use the standard type of
                                regression (check out Chapters 4 and 5).



                      Statistics for Quantitative Variables


                                Quantitative variables, unlike categorical variables, have a wider range of
                                statistics that you can do, depending on what questions you want to ask. The
                                main reason for this wider range is that quantitative data are numbers that
                                represent measurements or counts, so it makes sense that you can order,
                                add or subtract, and multiply or divide them — and the results all have
                                numerical meaning. In this section, I present the major data-analysis tech-
                                niques for quantitative data. I expand on each technique in later chapters of
                                this book.



                                Making estimates

                                Quantitative variables take on numerical values that involve counts or mea-
                                surements, so they have means, medians, standard deviations, and all those
                                good things that categorical variables don’t have. Researchers often want to
                                know what the average or median value is for a population (these are called
                                parameters). To do this requires taking a sample and making a good guess,
                                also known as an estimate, of that parameter.

                                To find an estimate for any population parameter requires a confidence
                                interval. For categorical variables, you would find a confidence interval to
                                estimate the population mean, median, or standard deviation, but by far the
                                most common parameter of interest is the population mean.

                                A confidence interval for the population mean is the sample mean plus or
                                minus a margin of error. (To calculate the margin of error in this case, see
                                Chapter 3.) The result will be a range of likely values you have produced for
                                the real population mean. Because the variable is quantitative, the confi-
                                dence interval will take on the same units as the variable does. For example,
                                household incomes will be in thousands of dollars.












          06_466469-ch02.indd   27                                                                    7/24/09   9:31:38 AM
   38   39   40   41   42   43   44   45   46   47   48