Why does automated accessibility testing only catch around 30-40% of real issues?

Question

Accepted Answer

Automation can verify structure — presence of alt text, label associations, contrast ratios, ARIA syntax — but it cannot evaluate meaning. Whether alt text is meaningful, whether the reading order makes sense, and whether a custom widget is usable with a screen reader all require human judgment. The 30-40% figure comes from research by WebAIM and Deque. Automated tools like axe-core evaluate rules that have deterministic answers: is this contrast ratio above 4.5:1? Is this input labelled? Does this element have a valid role? These checks pass or fail without needing to understand the content. What automation cannot evaluate: Alt text quality: an image of a customer signing a contract with alt text "image123" passes the axe check (alt attribute is present and non-empty) but is meaningless to a screen reader user. Keyboard interaction flow: axe can check that elements are focusable, but it can't verify whether the Tab order makes logical sense, whether arrow key navigation within a custo

Why does automated accessibility testing only catch around 30-40% of real issues?

Short answer

Detail

// WHAT INTERVIEWERS LOOK FOR