Foundations
AI is everywhere in QA discourse and most of it is noise. Build the mental model first so you can spot what's actually useful for your work.
A mental model for AI in QA
If you treat an LLM like a search engine, it'll disappoint you. If you treat it like a senior colleague who hallucinates 5% of the time, you'll get a lot done.
You'll learn to
- Explain what an LLM does (and doesn't do) in one paragraph
- Identify the QA tasks where AI gives 5x speedups, and where it gives nothing
- Recognise hallucination patterns specific to test code
- Decide when to trust AI output, when to verify, and when to discard
- Frame AI as a force-multiplier, not a replacement
Prompting fundamentals for testing work
Most 'AI doesn't work for QA' complaints are really 'I asked it badly'. Prompting is a skill you can learn in an afternoon and refine for years.
You'll learn to
- Structure prompts: context, task, constraints, examples
- Use few-shot examples to anchor output style
- Specify the test framework, language, and conventions explicitly
- Iterate on output through follow-up turns, not single shots
- Build reusable prompt templates for recurring tasks
MCP and agentic workflows
Stop copy-pasting between chat windows and your editor. Wire AI directly into your test runner, your repo, and your browser.
Playwright MCP for browser automation
MCP lets an AI agent drive a real browser, read pages, and take actions. The leap from 'AI suggests test code' to 'AI runs the test and reports back'.
You'll learn to
- Set up the Playwright MCP server locally
- Connect it to Claude Code, Cursor, or another MCP-aware client
- Use the agent to explore an app and propose test cases
- Have the agent run tests and read failure output
- Know the limits: when MCP helps, when it adds friction
Claude Code for repo-wide QA tasks
When you need to refactor 200 test files, add coverage for a new feature across a service, or migrate from Cypress to Playwright — a chat window can't help. Claude Code can.
You'll learn to
- Run Claude Code against a real test repo and stay in control
- Delegate large refactors with clear acceptance criteria
- Get the agent to read your code conventions before writing more
- Review diffs critically — assume nothing without reading
- Build a workflow where the agent reports progress at each step
Assemble your AI toolkit
The win isn't any one tool. It's the workflow: a code assistant in your editor, an MCP server for your stack, a prompt library you trust, and the judgement to know which to reach for.
You'll learn to
- Pick a code assistant that fits your codebase and team
- Set up MCP servers for the tools you actually use
- Maintain a personal prompt library for recurring tasks
- Know when to drop AI entirely and just write the code
- Share the workflow with your team in a way they'll adopt
Quality, safety, and limits
Knowing where AI fails is what separates a serious adopter from a hype-driven one. This is the phase that makes you trusted, not just fast.
Validating AI-generated tests
An AI-generated test that runs green tells you nothing if the test asserts the wrong thing. Most failures hide in plausible-looking code that doesn't actually test the behaviour.
You'll learn to
- Write a checklist for reviewing AI test output
- Use mutation testing or similar to expose weak assertions
- Catch silent failures: tests that pass when they should fail
- Cross-check the generated test against the requirement
- Decide what level of human review each AI artefact needs
What AI can't (and shouldn't) replace
OptionalExploratory testing, ambiguity-spotting, stakeholder conversations — these are the parts of QA that resist automation. Knowing which is which protects your career as the tools improve.
You'll learn to
- Name the QA work AI is bad at and explain why
- Defend exploratory testing as a high-value human activity
- Recognise when teams over-delegate and what they lose
- Position yourself for the work AI makes more valuable, not less
Data privacy and code-leak risks
Pasting prod data or proprietary code into a hosted LLM is a security event waiting to happen. Knowing the policies and the alternatives matters.
You'll learn to
- Read and apply your company's AI usage policy
- Identify what can and can't be pasted into a hosted model
- Use local or enterprise AI offerings when sensitivity demands
- Mask or fake data before sharing it with any model
- Recognise the difference between training-on-input and not