Navigating the Wonderland of AI Alt Text Authoring

Drawing on the themes and characters of Alice in Wonderland, this blog post explores why a purely automated approach to generative AI alt text falls short on context, accuracy, and equity. The article details the crucial importance of a Human in the Loop (HITL) strategy and the practical steps of Context Engineering that aids in building a resilient workflow that moves beyond simple error correction and trains AI to become a wiser, more discerning alt text author over time.

Scribely Team

October 27, 2025

5 minutes

Alice pulls back a curtain with one hand while clutching a skeleton key with the other. She wears a dress with short, puffed sleeves and a flaring, calf-length skirt under an apron. Her hair hangs loosely around her shoulders as she leans forward to look at a knee-high door revealed by the curtain.
Image Description
Image Description Goes Here
ALT

Introduction

As an industry, we are constantly seeking scalable solutions to the critical challenge of image accessibility, and generative AI has emerged as a particularly powerful, dual-natured tool. Effectively integrating AI, however, requires us to understand its inherent limitations and carefully guide its alt text authoring. This article will examine why simply implementing automated AI alt text authoring is insufficient, demonstrating that designing Human-in-the-Loop (HITL) workflows is essential to ensure that image descriptions meet both technical compliance and functional user needs for true equity and context.

The world of alt text authoring has become curiouser and curiouser with the introduction of Generative AI as an author. AI promises to solve image accessibility at scale, offering instant, affordable alt text. 

During a recent talk at the New York Public Library Accessible Technology Conference Scribely’s Chief Product Officer, Erin Coleman, discussed AI’s dual nature as a powerful yet growing alt text author by examining its strength and weaknesses, and shared how to build resilient workflows with human-in-the-loop strategies to create virtuous alt text cycles where expert human input goes beyond simple error correction to help AI become a wiser more discerning alt text author. 

This article explores the three biggest flaws in AI-generated alt text, using the iconic images and narrative of Alice in Wonderland to show why a human partner is non-negotiable.

What Are the Main Flaws in AI-Generated Alt Text?

Like the nonsensical rules of Wonderland, automated AI descriptions can fail at the three essentials of meaningful access: context, accuracy, and equity.

A toothy grin nearly spits the face of a wide-eyed cat perched in a tree. The Cheshire Cat’s body dissolves into the shadows of the canopy.

1. Lack of Context: The Grin Without the Cat

The Problem: AI alt text can lack context. It sees an image in isolation, completely divorced from your brand, your website, or the image's purpose.

In Wonderland, Alice asks the Cheshire Cat, "Would you tell me, please, which way I ought to go from here?" His answer: "That depends a good deal on where you want to get to."

This fundamental question of "where you want to get to"— the intent— is precisely where AI alt text struggles. AI can describe the "what" of an image but not the "why" an image is important in context. Without context, AI generates a description that is like the Cheshire Cat's grin floating in mid-air: a technically present feature, but one that's disembodied from any real meaning or purpose.

Alt text that lacks context is functionally useless. It fails to tell the user why the image is there. A well constructed Alt text workflow and context engineering needs to provide the AI Alt text author important information about the reason for the image it is describing – like image intent, surrounding circumstances, brand tone.

Slumping in an armchair at the end of a banquet table, Alice glares at the March Hare, Dormouse, and Mad Hatter, who cluster on the long side. A few blades of hay are loosely woven on the March Hare’s head, and a card on the Mad Hatter’s oversized top hat reads “In this Style 10/6.” Teacups and pots line the table.

2. Lack of Accuracy: "Always Tea Time"

The Problem: AI alt text can be factually inaccurate. Because of AI models’ architecture and the statistically-driven fluency of pre-trained language an AI alt text author can "hallucinate"— a term for when it confabulates, fabricates, or states a delusion as fact.

This means the AI may misidentify objects or invent details that simply aren't there.

At the Mad Tea Party, the Mad Hatter insists it's "always tea time," a frustrating, looping inaccuracy. In the same way, an AI can confidently misidentify a product, invent a person in a landscape, or, like the March Hare offering Alice wine when there was none, describe details that don't exist in the visual truth of the image.

The Queen of Hearts bellows and points imperiously down at Alice, who stands with her arms crossed protectively. Alice is surrounded by a gaggle of people drawn in the style of playing cards, all enclosed within a garden wall.

3. Lack of Equity: "Sentence First, Verdict Afterwards!"

The Problem: In AI alt text authoring there is a potential to mirror and amplify societal biases. AI models are trained on the internet, a dataset filled with imperfect, biased human-generated content.

This means an AI's output can easily replicate and scale harmful stereotypes, resulting in descriptions that are inequitable, reinforce narrow cultural norms, or use damaging language.

Like the Queen of Hearts' mandate of "Sentence first—verdict afterwards!", the AI often generates a description based on arbitrary, biased patterns it learned from its training data. As human alt text managers, we must know when to assert ourselves in the workflow—like Alice standing up to the Queen's absurdity—to ensure fairness and create reason.

What is Context Engineering for AI Alt Text?

To get quality AI alt text authoring we must build a workflow. We call this approach Context Engineering: the method of capturing and feeding essential  knowledge to the AI before it writes the first draft.

We must instruct the AI Alt text author the “why” of an image in the context that it’s in so that AI alt text descriptions are focused by context.

It’s about engineering the AI’s steps to get a better description, rather than leaving the process up to chance.

What Information Does AI Need for Quality Alt text?

To move beyond a simple "grin," an AI needs the "cat." This information includes:

  • Authorial Intent: Why was this image placed here? (e.g., “This image is a testimonial to show a happy customer.”)
  • Brand Guidelines: How should the tone sound? (e.g., Professional, playful, direct.)
  • Equity Constraints: What language should be used or avoided when describing people?
  • Experience-Level Context: What is the caption, the page title, the surrounding text, or the product description?

How Do You Implement Context Engineering?

Alongside the image, context information can be provided to the AI Alt text model is to control the output. This can be done by:

  1. A well-formed prompt that provides strong contextual guidance that constrains the model’s vast output space. Including the surrounding text or summary of the intended purpose in the prompt to guide the AI model to produce contextually appropriate descriptions. 
  2. Provide the model with associated labeled metadata that allows it to generate far more detailed, accurate, and grounded information than it could from an image alone. 

What is a Human-in-the-Loop (HITL) Workflow for Alt Text?

Three playing cards with human heads, hands, and feet stand around the base of a rose bush on a tall, spindly trunk. The two, five, and seven of spades all hold paint brushes, and one holds a pot of paint as they glare at a dark rose.

A Human-in-the-Loop (HITL) workflow is a resilient process that strategically combines AI's speed with essential human skill. It's not just about error correction; it's about creating a virtuous cycle that makes the AI smarter and more aligned with your goals over time.

To prevent the AI from generating "false appearances"—like the playing cards' nonsensical task of painting white roses red—this workflow ensures a human is there to "paint" the context correctly.

The Steps of a Resilient HITL Workflow

  1. Ingest & Contextualize: A human provides the image(s) and the contextual inputs (from your Context Engineering process).
  2. First Draft: The AI generates a first draft based on those specific human inputs.
  3. Review & Refine: A human expert evaluates the draft for accuracy, context, and equity, making necessary edits.
  4. Approval & Publication: The human-approved alt text is published.
  5. Feedback Loop: This is the most crucial step. The human-corrected version and its context are fed back into the system to fine-tune the AI model, making it a wiser, more discerning author for future images.

The Future of AI Alt Text: Augmentation, Not Replacement

The lesson from Wonderland is clear: Generative AI is a powerful tool, but without human input, it can easily lead users astray.

True innovation in AI image description lies not in replacing humans, but in augmenting their expertise. As Alice muses, “And what is the use of a book... without pictures or conversations?” Images are critical to our conversations, and we must build systems that make them truly accessible for everyone.

Ready to build your resilient alt text workflow? Scribely’s Context Engineering approach and platform are designed to help you get started.

Aerial view of a person using a credit card to make a purchase on an e-commerce product page. Their open laptop is resting on a wooden surface next to a pink pencil holder and Apple magic mouse.
Image Description
Image Description Goes Here
ALT

Check out Scribely's 2024 eCommerce Report

Gain valuable insights into the state of accessibility for online shoppers and discover untapped potential for your business.

Read the Report

Cite this Post

If you found this guide helpful, feel free to share it with your team or link back to this page to help others understand the importance of website accessibility.

Table of Contents

Scribely's Alt Text Checker

With Scribely's Alt Text Checker, you can drop a URL and scan for common alt text issues. Download a report and get organized on next steps to making your images accessible.

Free Scan

Related Articles

Alice pulls back a curtain with one hand while clutching a skeleton key with the other. She wears a dress with short, puffed sleeves and a flaring, calf-length skirt under an apron. Her hair hangs loosely around her shoulders as she leans forward to look at a knee-high door revealed by the curtain.

Image Description

Image Description Goes Here

ALT
Abstract digital artwork of geometric shapes with warm orange, blue, and pink tones, creating a layered, architectural concept with sharp angles and overlapping surfaces.

Image Description

Image Description Goes Here

ALT
A black and white isometric illustration depicting a centralized digital network. In the center, a large platform supports an orb representing an AI or neural network with smaller orbs connected. This central hub is connected by lines to various floating user interface windows. Four people stand at the smaller orbs using laptops to interact with the technology to illustrate an interconnected workflow.

Image Description

Image Description Goes Here

ALT
A screenshot of the Instagram "Create new post" screen. On the left, there is a preview of an image featuring a single, vibrant red poppy in a sunlit field of green and yellow wheat. On the right, under the post settings, the "Accessibility" menu is highlighted with a red rectangle, showing the user where to find the option to add alt text.

Image Description

Image Description Goes Here

ALT
A minimalist photograph shows three white, Scrabble-like tiles that spell the word 'ALT.' The tiles are perfectly centered against a solid coral-colored background.

Image Description

Image Description Goes Here

ALT
Collage of 4 photos of the disability rights movement featuring the 504 Sit-in, Disability Independence Day, the 0 Busters at Gallaudet, and the Capitol Crawl.

Image Description

Image Description Goes Here

ALT
The Met Gala 2025 steps featuring deep blue carpet with golden daffodils scattered throughout the scene. Title on image reads, "The Top 10 Looks from Met Gala 2025 with Accessible Image Descriptions."

Image Description

Image Description Goes Here

ALT
Cluttered workspace with open books filled with interior design and architecture images, a pair of black-rimmed glasses, crumpled pieces of paper, notebooks, and a laptop.

Image Description

Image Description Goes Here

ALT
Person points at colorful charts and graphs displayed on a laptop screen, analyzing data in a collaborative work setting with a colleague across the table writing in a notepad.

Image Description

Image Description Goes Here

ALT
A hand holds a white digital stylus, poised over a tablet screen, ready to draw or write. Colorful computer monitors and a keyboard fill the blurred background.

Image Description

Image Description Goes Here

ALT
Overhead view of two people sorting through a collection of abstract art prints laid out before them on a surface. They both point at a piece featuring a dark square with simple white line drawings.

Image Description

Image Description Goes Here

ALT
A freshly sharpened yellow pencil lies on lined paper, surrounded by scattered shavings and graphite dust.

Image Description

Image Description Goes Here

ALT
Hand holds a marker to an easel pad showing a hand-draw visualization of an image workflow that includes a user interface, database, and website creation.

Image Description

Image Description Goes Here

ALT
Person sits in a dimly lit room staring blankly into the light of their smartphone screen, head falling towards the couch like they're drained of energy.

Image Description

Image Description Goes Here

ALT
Closeup of a smart phone fixed to a tripod recording a man with short braids and a floral shirt. He sits in front of a low beige sofa as he smiles and points at the camera.

Image Description

Image Description Goes Here

ALT
First person view of a person holding a smartphone and swiping social media with a blurred view of a photo gallery on a Mac behind it.

Image Description

Image Description Goes Here

ALT
Several dusty and disintegrating framed portraits piled atop one another in an empty, run-down space.

Image Description

Image Description Goes Here

ALT
Media
April 19, 2022

Why NFTs Need Alt Text Now

Three people wearing pink smile together as they look at a smartphone screen. The phone has a bright pink case. One person with long pink hair and another with short brown hair laugh.

Image Description

Image Description Goes Here

ALT
Laptop screen with an image of Vimeo's logo next to YouTube's logo. Vimeo's video player user interface is at the bottom of the screen. Text below reads, "Vimeo and YouTube are letting us down." Scribely decorative squiggles separate the laptop from headphones and audio wave icons. Scribely logo in the bottom right corner.

Image Description

Image Description Goes Here

ALT
Person on the far side of a computer screen with their head buried in both hands under an icon for an accessibility overlay.

Image Description

Image Description Goes Here

ALT
Grid of four GIF screenshots featuring four Disabled women doing various reactions with white caption text on each screenshot like “Spill the tea, girl” and “That’s hot.”

Image Description

Image Description Goes Here

ALT
Close up of a person opening a journal at a wood table. They hold a pen in one hand, and a pot of tea and a mug sit in front of the journal.

Image Description

Image Description Goes Here

ALT
The Met Gala 2024 steps draped in a cream-to-seafoam-green ombré carpet, bordered by lush white blooms and topiary greenery. Title on image reads, "The Top 10 Looks from Met Gala 2024 with Accessible Image Descriptions."

Image Description

Image Description Goes Here

ALT
Screenshot of Scribely’s Alt Text Checker. Text reads “Identify alt text issues on your website. Enter your URL below, and Scribely’s Alt Text Checker will scan your webpage for alt text issues and suggest next steps for improvement.” above a fillable field with “Enter your URL” to the left and an Analyze button to the right.

Image Description

Image Description Goes Here

ALT
Front of a digital camera resting on a tripod with a small fuzzy microphone attached to the top via a red cord with a blurred building in the background.

Image Description

Image Description Goes Here

ALT
Resources
April 3, 2023

How to Make Video Accessible

GIPHY logo in all capital, block letters and the cursive Scribely logo, both in white text against a violet-purple background.

Image Description

Image Description Goes Here

ALT
Glimpsed between two open, silver laptops, a person points at a screen as a slightly smaller pair of hands of a younger person rest near the keyboard.

Image Description

Image Description Goes Here

ALT
Blue flag with a ring of 12 yellow stars printed on a 100 Euro bill, which overlaps an American the D of an American dollar bill.

Image Description

Image Description Goes Here

ALT
Resources
September 1, 2024

European Accessibility Act (EAA)

Graphic. Text below an illustration of an open laptop reads, “A Visual Description & Accessibility Glossary” in white text against a sage-green background. The cursive Scribely logo is in the bottom right corner.

Image Description

Image Description Goes Here

ALT
View down onto an open, silver laptop as a person with long red fingernails touches the built-in mousepad. They hold a green credit card in the other hand.

Image Description

Image Description Goes Here

ALT
Woman throws both arms up as she smiles widely, her eyes closed amid a shower of glittering confetti. She wears a teal-green, velvety jacket.

Image Description

Image Description Goes Here

ALT
Person against wood paneling holds one arm across her body to cup the opposite elbow. She holds that second hand to her chin and index finger on her jawline. She looks up, head tipped to the left and smiling.

Image Description

Image Description Goes Here

ALT
Person facing away from us works at a computer with a wide screen. The person wears headphones, and a laptop sits next to a lamp on the desk.

Image Description

Image Description Goes Here

ALT
Pincers at the end of a robotic arm hold a dark pink Gerbera daisy against a sky-blue background.

Image Description

Image Description Goes Here

ALT
Two different hands reach towards one another, nearly touching, as if they are about to shake hands.

Image Description

Image Description Goes Here

ALT
Resources
August 12, 2020

A Guide to Inclusive Language

Person with shaggy, chin-length hair sits with their back to us as they look at a computer screen. They wear headphones and a black and white plaid shirt.

Image Description

Image Description Goes Here

ALT
Accessibility
November 19, 2020

Talking Images: A Screen Reader Revolution

Two smiling people sit on the ground on either side of a low coffee table. Studio-style microphones are set up in front of each person, and one of them touches the mousepad of a laptop.

Image Description

Image Description Goes Here

ALT
Six dancers wearing all black pose in a tightly knit group in front of a concrete wall under a blue sky.

Image Description

Image Description Goes Here

ALT
Person smiles as they move toward us, listening to their device with earphones with a white wire. Out of focus, others walk along the city street in the background.

Image Description

Image Description Goes Here

ALT
Smiling person captured mid-jump in front of white aluminum siding. The person’s long hair floats up as they tuck their heels close to their hands, which are down by their sides.

Image Description

Image Description Goes Here

ALT
Dozens of people facing away from us gather in a courtyard or square. Two people in the middle of the crowd bow their heads and lift their right fists high.

Image Description

Image Description Goes Here

ALT
Person sitting, folded up in a shopping cart. Out of focus, they rest one elbow on the edge of the cart and rest their forehead in that hand. A text box reads, “2023 E-Commerce Content Accessibility Report.” The cursive Scribely logo is above.

Image Description

Image Description Goes Here

ALT
Dancer strikes a pose resting on one hand and one foot, their hips lifted. Their other hand and leg cross over their body. They are on a brick walkway leading to Voorhees Town Center.

Image Description

Image Description Goes Here

ALT

Ready to get started?

Turn intentions into actions, start here!