Page 339 -
P. 339

CHAPTER


                  Automated data collection                         12

                  methods








                  12.1  INTRODUCTION
                  Data are the building blocks of research. As the recorded output of research efforts,
                  data are the raw materials that must be processed, analyzed, and interpreted to pro-
                  vide answers to research questions. Data collection is therefore a critical phase in any
                  research effort.
                     Data collection is also often one of the most challenging aspects of research.
                  Timing user task completion with a stopwatch, furiously writing notes describing user
                  interactions with software systems, coding notes from ethnographic observations, and
                  many other tasks are laborious, time consuming, and often—as a result—error-prone.
                     Fortunately, human-computer interaction (HCI) researchers can use the comput-
                  ers that are the subject of our research as powerful data collection tools. Software
                  tools can be used to collect vast amounts of user interaction data, often with little or
                  no direct effort on the part of the researcher administering the study. Interaction log-
                  ging software tracking keystrokes and mouse clicks, special-purpose instrumented
                  software designed to track use of specific features in tools, web site access logs, and
                  home-grown customized tools for tracking what users do and when can simplify data
                  collection, increase consistency, and decrease error.
                     Approaches to automated data collection can generally be placed on a spectrum
                  of ease of use and flexibility (Figure 12.1). Existing software tools such as web-
                  site access log analyzers can often be easily used or adapted for research purposes,
                  but capabilities might be limited. System observation and logging software may be
                  somewhat more powerful, but installation and configuration issues can be challeng-
                  ing. Custom-built or modified software can be crafted to meet the precise research
                  needs, but the development effort can be substantial.
                     All of these automated methods for computerized data collection are capable of
                  producing voluminous data sets. This can pose a substantial problem for researchers:
                  while generating data is easy, deciding which data to collect and how best to analyze
                  that data can be challenging. Although many projects involving automated data col-
                  lection (including data collected from human subjects—see Chapter 13) follow a
                  familiar arc of planning, data collection, cleaning, analysis, and iteration to refine
                  methods and techniques, details vary between projects due to technological differ-
                  ences in data acquisition methods and analytic differences associated with varying
                  research questions. Some of these issues will be discussed here and in Chapter 13,


                  Research Methods in Human-Computer Interaction. http://dx.doi.org/10.1016/B978-0-12-805390-4.00012-1  329
                  © 2017 Elsevier Inc. All rights reserved.
   334   335   336   337   338   339   340   341   342   343   344