CAPTCHA

Identifying humans

Paul Krzyzanowski

April 25, 2024

Introduction

CAPTCHA (Completely Automated Public Turning test to tell Computers and Humans Apart) is not a technique to authenticate users but rather a technique to identify whether a system is interacting with a human being or with automated software. This concept is used primarily to prevent automated software or bots from performing actions such as spamming, creating lots of accounts, or stealing large amounts of content from websites.

The main idea behind CAPTCHA is to create a test that computers find difficult to solve, but that humans can solve easily.CAPTCHAs typically involve tasks like identifying distorted text, solving simple puzzles, or recognizing objects in images, which require perceptual and cognitive skills that are still challenging for AI to mimic effectively.

History

The history of CAPTCHAs began in the late 1990s when the Internet started facing significant issues with automated bots. The first CAPTCHA system was developed by researchers at AltaVista in 1997 to prevent automated URL submissions to their search engine, which were skewing search engine rankings. This system, developed by Andrei Broder and his colleagues, involved asking users to identify and type the characters that appeared in a distorted image.

Shortly thereafter, in 2000, a more formalized version of the CAPTCHA was created by Luis von Ahn, Manuel Blum, Nicholas Hopper, and John Langford from Carnegie Mellon University. They termed their invention “CAPTCHA” as an acronym for “Completely Automated Public Turing test to tell Computers and Humans Apart.” This innovation was aimed at enhancing security systems and ensuring that the users were human by requiring them to solve tasks that computers were unable to perform efficiently at the time, such as recognizing distorted text. This system was focused on presenting distorted words from a relatively small set of words.

Around the same period, a similar concept called “BaffleText” was developed independently by researchers at Palo Alto Research Center. BaffleText focused specifically on preventing automated scripts from accessing web services by presenting randomly generated, linguistically based strings of text that were visually distorted.

These early developments laid the groundwork for various forms of CAPTCHAs used across the web today, evolving in complexity to counter increasingly sophisticated bots.

Why we still need CAPTCHAs

CAPTCHA mechanisms are necessary for multiple reasons, despite the challenges in implementing them:

Preventing automated exploits
CAPTCHAs are an essential first line of defense against basic automated bots that attempt to exploit websites for malicious purposes, such as spamming forums, comment sections, or emails; brute-force attacks to guess user credentials; and mass account creation for fraud or resource abuse.

While advanced bots may bypass CAPTCHAs, many simpler bots still fail, making CAPTCHA an effective deterrent against low-level threats. CAPTCHA systems are part of a multi-layered security strategy to reduce the risk of automated attacks even when a CAPTCHA isn’t foolproof.
Minimizing Resource Abuse
Many services are designed with a limited allocation of resources (e.g., free trials, APIs, or cloud computing credits). CAPTCHAs ensure that these resources are only accessed by legitimate users, preventing exploitation by bots that can overwhelm the system and deny resources to real users.
Enhancing Data Quality
Automated bots can skew analytics and pollute data by flooding systems with fake inputs, registrations, or interactions. CAPTCHAs help ensure that collected data represents genuine human behavior, making it more reliable for decision-making, marketing, or research.
Human Verification in Specific Contexts
In some scenarios, it is still critical to verify the presence of a human, such as: on-line voting or surveys, confirming the authenticity of e-commerce purchases, or preventing ticket scalping for events (including things like restaurant reservations).

Problems with CAPTCHAs

Man-in-the-middle attacks

CAPTCHAs were susceptible to a form of a man-in-the-middle attack where the puzzle is presented to low-cost (or free) humans whose job is to decipher CAPTCHAs. These are called CAPTCHA farms and their goal is to apply pools of human labor to help the bots (i.e., do the tasks that we can’t script effectively).

In this attack, when the bot is presented with a CAPTCHA test, it forwards the request to a CAPTCHA farm so it can be completed by a human. The human-generated response is sent back to the bot, which can present it to the website.

Accessibility

Traditional CAPTCHAs, particularly those that require users to decipher distorted text or images, can be challenging for individuals with visual impairments. Audio CAPTCHAs were created as an alternative but are often challenging for those with hearing impairments (and often challenging for anyone).

Improved image recognition

As machine learning and artificial intelligence technologies have advanced, so has the capability of bots to solve CAPTCHA challenges. Algorithms can interpret distorted text, recognize objects in images, and even solve audio CAPTCHAs, reducing their effectiveness as a security measure. This led to a race to create ever-more challenging puzzles.

User frustration

As image and audio processing abilities advanced and CAPTCHAs became more challenging, their presence degraded the user experience. Not only is it an extra step in interacting with a service but it turned into one that presents humans with tests that they often fail, leading to multiple attempts to solve and possible abandonment.

Evolution

Ever-improving OCR technology also made text-based CAPTCHAs susceptible to attack. By 2014, Google found that they could use AI techniques to crack CAPTCHAs with 99.8% accuracy.

Getting value out of CAPTCHA: reCAPTCHA

reCAPTCHA is a variant of CAPTCHA that not only challenges users to prove they are human but uses the interaction to help digitize text, annotate images, and build machine learning datasets. It takes advantage of OCR (optical character recognition) situations that algorithms struggle with.

The initial version of reCAPTCHA was developed by Luis von Ahn and his team at Carnegie Mellon University and later acquired by Google in 2009. This version presented users with two words—one that the computer knew and one that it didn’t. By solving these CAPTCHAs, users were helping to digitize books, newspapers, and old radio shows. Google then used this not just for digitizing content but also parsing things such as house numbers in Google Street View.

Beyond text recognition

An alternative to text-based CAPTCHAs are CAPTCHAs that involve image recognition, such as “select all images that have mountains in them” or “select all squares in an image that have street signs.” This can add a layer of difficulty for programmatic solving since it requires parsing the request and solving the problem where the image is broken up into blocks and blocks may hold part of the searched pattern (e.g., part of a street sign).

Other solutions involve dragging a puzzle piece into place or rotating an object to a correct alignment.

noCAPTCHA

A more recent variation of CAPTCHA is Google’s No CAPTCHA reCAPTCHA. This simply asks users to check a box stating that I’m not a robot. This provides a user-friendly interaction in that there is no puzzle to solve. Behind the scenes, however, the system performs a complex analysis of the user’s engagement with the CAPTCHA checkbox and the entire website. It examines cues such as the user’s mouse movements, IP address, session duration, and cookies that are indicative of typical human-driven behavior versus automated scripts.

If the initial risk analysis is inconclusive—perhaps because the user behavior is unusual or indicative of potential automation—the system may present additional challenges. These are similar to traditional CAPTCHAs and may include image recognition tasks or more complex puzzles.

invisible reCAPTCHA

The latest variation of this system is the invisible reCAPTCHA. The user doesn’t even see the checkbox: a frame is oriented tens of thousands of pixels above the origin, so the JavaScript code is run, but the reCAPTCHA frame is out of view. If the server-based risk analysis does not get sufficient information from the Google cookies then it relocates the reCAPTCHA frame back down to a visible part of the screen.

As with noCAPTCHA, if the risk analysis part of the system fails, the software presents a CAPTCHA (recognize text on an image) or, for mobile users, a quiz to search for items within an image.

Let’s incorporate insights from the referenced article into the writeup. Here’s the updated content:

The Future of CAPTCHA in the Age of Advanced AI

Recent developments in artificial intelligence (AI) and machine learning are rendering traditional CAPTCHA systems increasingly obsolete. Advanced AI models are now capable of bypassing even the most sophisticated CAPTCHA challenges, such as image recognition tasks and noCAPTCHA systems, with ease. This raises concerns about the efficacy of CAPTCHA as a tool for distinguishing humans from bots.

AI-powered bots are not just solving CAPTCHA challenges but are beginning to mimic human behaviors convincingly. This has led to the emergence of AI agents that can seamlessly interact with systems, further blurring the line between human users and automated bots.

Challenges with Bot Detection

Behavioral Mimicry: Advanced AI can replicate human-like behaviors, such as realistic mouse movements and session interactions, which are core elements of modern CAPTCHAs like noCAPTCHA reCAPTCHA.
AI as Users: As AI agents become mainstream, the distinction between legitimate AI-based tools (e.g., customer support bots) and malicious bots becomes increasingly complex, potentially undermining trust in bot detection systems.
Evolving Threats: CAPTCHA solutions were designed to counter bots with limited intelligence. The new generation of bots, however, uses neural networks and reinforcement learning to adapt, making traditional methods ineffective.

Implications for Security

As CAPTCHA becomes less effective, security systems need to evolve. Future approaches may focus on:

Behavioral Biometrics: Analyzing unique patterns in typing, swiping, or other actions to identify users.
Contextual Awareness: Using data such as user location, device history, and network activity to assess the legitimacy of interactions.
Human-AI Collaboration: Designing systems that acknowledge the growing role of AI agents and incorporate them into a trusted framework while mitigating malicious use cases.

A Post-CAPTCHA World

The rise of sophisticated bots and AI agents demands a fundamental rethink of how online systems distinguish between legitimate and illegitimate interactions. At the same time, organizations need to balance usability with robust security measures to adapt to these challenges.

References

Irfan Mehmood, Kamran Mahroof, ‘Yes, I am a human’: bot detection is no longer working – and just wait until AI agents come along, The Conversation, December 20, 2024.

Are you a robot? Introducing "No CAPTCHA reCAPTCHA, Google Search Central Blog, December 3, 2014.