Page 437 -
P. 437

428    CHAPTER 14  Online and ubiquitous HCI research





                           CAPTCHA AND reCAPTCHA—CONT'D
                             Subsequent work merged CAPTCHA's goal of using human intelligence
                           tasks as security with the ESP games notion of using these tasks to
                           accomplish useful work, leading to the reCAPTCHA tool (von Ahn et al.,
                           2008). reCAPTCHA was designed to solve the problem of digitizing
                           text that had proven challenging for optical-character recognition (OCR)
                           systems. reCAPTCHA provides users with images including text that has
                           proven difficult for computer vision systems to interpret. Specifically, the
                           original reCAPTCHA asked users to decipher words that have each failed
                           to be consistently recognized by two different OCR programs. Each time a
                           reCAPTCHA is used, the user is asked to interpret images containing two
                           words: one for which the interpretation is known, and another which has not
                           yet been classified. If the user provides a correct answer for the known word,
                           the answer for the other word is assumed to be correct. Each word is presented
                           to multiple users, and words can be promoted to become known words if
                           sufficient accurate human guesses are provided. All words are distorted in an
                           attempt to defeat computer vision programs (Figure 14.1A) (von Ahn et al.,
                           2008). reCAPTCHA has been used on many web sites to provide the security
                           that motivated the design of the original CAPTCHA, primarily verification
                           of user registration and login on web sites. reCAPTCHA was purchased by
                           Google in 2009 (Zlatos, 2009), with subsequent evolution of the tool including
                           variants for labeling images (Figure 14.1B) and predictive tools capable of
                           identifying users as human based on interactions with the widget, without the
                           need for image labeling (Shet, 2014).
                             reCAPTCHA's use of images highlights a key design challenge. The image-
                           labeling tasks in the ESP game were purely entertainment on the part of the
                           users. CAPTCHAs, on the other hand, are often used on sites that might be the
                           sole route for users to access functionality needed for personal or professional
                           purposes. As a result, accessibility becomes a key concern, as some users—
                           particularly those with low vision or blindness—might struggle with some of
                           the images used in tools like reCAPTCHA. This problem is magnified by the
                           nature of the tools—by definition, the images used in reCAPTCHA are those
                           that have been in some ways hard to process. reCAPTCHA has always had an
                           audio option, which has generally asked users to type a sequence of spoken
                           digits. Alternative CAPTCHA tests have been the subject of multiple research
                           efforts (Sauer et al., 2010; Davidson et al., 2014).


                            Although reCAPTCHA is likely the most familiar human computation task, the
                         notion of using games to motivate participation has been used in many different do-
                         mains. Online games have been particularly successful in scientific fields, with the
                         Fold.It game (http://fold.it) harnessing the power of multiple users to generate high-
                         quality protein models (Khatib et al., 2011; Eiben et al., 2012) and bioinformatics
   432   433   434   435   436   437   438   439   440   441   442