On this page10 sections
Release Readiness Report
Evaluate a provided test run summary and defect backlog, then produce a structured release readiness report with a go/no-go recommendation and risk assessment.
Role
QA lead
Difficulty
IntermediateTime limit
60–90 min
Category
release readiness
Scenario
You are the QA lead for a B2B SaaS product's quarterly release. The engineering manager has asked you to produce a release readiness report before the go/no-go meeting in two hours. You have been given a test run summary, an open defect list, and notes from the regression run. The team is under pressure to ship on schedule. Your job is to make an honest, evidence-based assessment and deliver a written report that the engineering manager, product manager, and support lead can all read and act on — even if the conclusion is uncomfortable.
Requirements
- 1.Read and synthesise the provided test run summary, defect list, and regression status into a structured report — interpret the data rather than restating it; note that 9 of 14 failures share a single root cause, and that 2 failures are likely flakes, not real defects
- 2.Triage each open defect by severity and assess its release impact: classify each as a blocker, an acceptable risk with a documented mitigation, or a deferral candidate — and provide a one-line rationale for each classification
- 3.Assess the completeness of test coverage: identify the gap created by the partial reporting-module regression and state how that gap affects your confidence in the go/no-go recommendation
- 4.Review the environment and configuration state: confirm or flag any risks related to feature flag parity between staging and production, database version alignment, or outstanding infrastructure changes
- 5.Write an unambiguous go/no-go recommendation backed by at least two specific data points from the provided summary — the recommendation must be evidence-based, not a gut call, and must include a conditional path if a mitigation could change the outcome
- 6.Define rollback triggers: specify at least two observable, measurable conditions that would cause the on-call team to initiate a rollback in the first hours after deployment — vague triggers ('if things go wrong') do not satisfy this requirement
- 7.Write the report in a structure that serves two audiences simultaneously: an executive summary paragraph that a non-technical product manager can act on, and detailed findings sections that an engineering manager can use to make and justify decisions
Starter data
- ›Test run summary: 847 automated tests run against staging (release candidate v2.4.1-rc2); 831 passed, 14 failed, 2 skipped. The 14 failures break down as: 9 in the reporting module (all caused by a single PDF generation defect), 3 in user-import (CSV with non-ASCII characters), 2 intermittent (both passed on the immediate re-run — suspected flaky).
- ›Open defect list: [SEV-1] PDF export crashes for reports containing more than 500 rows — status: Open, no workaround available, affects enterprise accounts. [SEV-2] CSV user-import fails for filenames or field values containing accented characters (e.g. 'Renée') — status: Open, workaround: encode the CSV as UTF-8 before upload. [SEV-3] Dashboard chart legend overlaps on screens narrower than 768 px — status: Open, cosmetic only. [SEV-3] Email notification delivery delayed by more than 60 seconds in certain timezones — status: Open, intermittent, no data loss. [SEV-4] Tooltip text truncated on hover in Firefox — status: Closed/fixed in this release.
- ›Regression status: full regression suite completed on staging; 100% pass on all core user flows — login, CRUD operations, billing, and permission enforcement. The reporting module regression was blocked by the SEV-1 defect and marked 'partial' — 9 reporting-module test cases could not execute against the RC build.
- ›Environment and config: staging is running v2.4.1-rc2. Both staging and production use PostgreSQL 15.3 — no database version drift. Three feature flags are toggled ON in staging for pre-launch testing of next-quarter features; all three are confirmed OFF in production. No outstanding infrastructure or dependency change is scheduled for this release window.
- ›Rollback information: the previous production release is v2.3.8, deployed six weeks ago. A written rollback procedure exists in the runbook. The procedure was last successfully tested three months ago and has not been rehearsed since the underlying infrastructure was patched in the intervening period.
Expected deliverables
- ✓A structured release readiness report with all seven sections clearly labelled: Executive Summary, Test Results Summary, Defect Triage, Coverage Gaps, Environment and Configuration Assessment, Go/No-Go Recommendation, Rollback Plan and Triggers
- ✓A defect triage table listing every open defect with: severity, current status, release impact classification (blocker / acceptable risk / deferred), and a one-line rationale for the classification
- ✓A go/no-go recommendation that is explicit ('Go', 'No-Go', or 'Conditional Go with stated conditions') and cites at least two specific data points from the provided test summary or defect list
- ✓At least two rollback triggers stated as specific, observable conditions — for example, a named feature failing a post-deploy smoke test or an error rate exceeding a stated threshold — not vague statements
Evaluation rubric
| Dimension | What reviewers look for |
|---|---|
| Data synthesis and interpretation | Does the report interpret the test results rather than restating the raw numbers? Does the candidate notice that 9 of 14 failures are caused by a single defect (not 14 distinct problems), that the 2 intermittent failures are likely flakes and should not inflate the blocker count, and that all core user flows are clean? A report that presents '14 test failures' as equivalent to '14 distinct risks' shows poor signal-to-noise judgement. |
| Risk-based defect triage | Is the SEV-1 (PDF crash for >500-row reports) correctly identified as a release blocker with no available workaround? Is the SEV-2 UTF-8 workaround assessed for adequacy given the actual user base (enterprise non-technical users vs developer-facing tooling)? Are the SEV-3 defects weighted appropriately — acknowledged but not treated as blockers? |
| Coverage gap assessment | Does the report explicitly flag that the reporting module regression is partial, quantify the gap (9 test cases not executed), and state what this means for confidence in the reporting module — rather than treating the 831/847 pass rate as representative of the whole system? Omitting this gap is the most common critical error in release readiness assessments. |
| Go/no-go clarity | Is the recommendation unambiguous? Given a SEV-1 open with no workaround, the correct outcome is No-Go (or Conditional Go with explicit acceptance criteria). Does the candidate provide a conditional path — what would need to be true for the decision to change — rather than leaving the team with no forward options? |
| Rollback plan quality | Are rollback triggers observable and specific enough that an on-call engineer can act on them at 2 a.m. without making a judgment call? Does the candidate flag the risk that the rollback procedure has not been tested since the infrastructure was patched, and recommend a dry run before go-live? |
| Stakeholder communication | Can a non-technical product manager read the executive summary and understand the recommendation and its rationale without reading the rest of the report? Is the full report structured so that an engineering manager can find the detailed evidence they need quickly? A summary that says 'please read section 4 for our recommendation' fails this dimension. |
Sample solution outline
- ›Executive Summary: 'The v2.4.1 release is NOT recommended for go-live in its current state. A SEV-1 defect causes PDF export to crash for reports exceeding 500 rows — a core workflow for the majority of enterprise accounts — and no workaround is available. The reporting module regression is also incomplete as a direct consequence of this defect. All other core user flows (login, CRUD, billing, permissions) are clean. Recommended path: resolve the SEV-1 defect and complete the reporting regression before scheduling go-live. If the business must ship on the current date, product and engineering must formally accept the SEV-1 risk in writing and commit to a hotfix release within five business days.'
- ›Defect triage table: SEV-1 PDF crash → Blocker (no workaround; core enterprise workflow; directly caused 9 of 14 test failures). SEV-2 CSV import → Acceptable risk — document UTF-8 workaround prominently in release notes and assess whether enterprise admins (likely non-technical) can reasonably apply it. SEV-3 chart legend overlap → Deferred — cosmetic, mobile-only, does not block any user task. SEV-3 email delay → Deferred — intermittent, no data loss, monitor in production post-deploy. SEV-4 tooltip (closed) → No action required.
- ›Coverage gap: 9 test cases in the reporting module could not be executed against the RC build because the SEV-1 defect crashed the feature under test. The 831/847 pass rate is therefore misleading as a headline metric — the module most affected by an open blocker has the least test coverage. Confidence in the reporting module is low regardless of the automation pass rate.
- ›Environment assessment: staging and production are on the same PostgreSQL version — no database drift risk. The three feature flags toggled ON in staging are confirmed OFF in production — no flag leakage risk. The rollback procedure was last tested three months ago; recommend a 30-minute tabletop walkthrough of the rollback steps before go-live to confirm the procedure still applies to the current infrastructure state.
- ›Go/no-go recommendation: NO-GO. Evidence: (a) SEV-1 is open with no workaround and directly impacts the enterprise user segment; (b) the reporting module regression is incomplete, leaving a coverage gap in the system's most-affected area. Conditional GO path: SEV-1 is resolved or formally accepted in writing by product with a customer communication plan and a committed hotfix date; the reporting module regression is then completed against the updated build.
- ›Rollback triggers: (1) PDF export failure rate exceeds 5% of export attempts within the first two hours of production deployment, measured in the application error log; (2) the post-deploy smoke test fails on login, billing, or user permissions flows; (3) more than three enterprise accounts report the PDF crash via the support channel within the first business day post-launch. Any one of these conditions is sufficient to initiate rollback without waiting for additional data.
Common mistakes
- Marking the release as go because 831 of 847 tests pass — a 98.2% pass rate sounds healthy, but 9 of the 14 failures stem from a single SEV-1 defect that blocks a core enterprise workflow; a pass rate without reading what failed gives a false sense of quality
- Counting the two intermittent failures as real defects and including them in the blocker count — the provided data notes that both passed on re-run, making them likely flaky tests rather than new defects; conflating flakiness with blocking defects overstates risk and dilutes the report's credibility
- Accepting the SEV-2 workaround without assessing the target user base — a UTF-8 encoding step is trivial for a developer but is a significant and confusing obstacle for a non-technical enterprise administrator who manages user imports; the suitability of a workaround depends on who must apply it
- Omitting the coverage gap caused by the partial reporting module regression — noting that '831 tests pass' without flagging that the most-affected module has 9 unexecuted test cases gives stakeholders a misleading picture of test completeness
- Writing rollback triggers that are vague — phrases like 'if we see customer complaints' or 'if the error rate is too high' do not give an on-call engineer enough information to make a call at 2 a.m.; triggers must name a specific metric, a threshold, and a time window
- Writing an executive summary that requires the reader to reference other sections to understand the recommendation — the summary must stand alone; a product manager in a go/no-go meeting should be able to make a decision from the first paragraph alone
Submission checklist
- Report contains all seven required sections with clear headings: Executive Summary, Test Results Summary, Defect Triage, Coverage Gaps, Environment and Configuration Assessment, Go/No-Go Recommendation, Rollback Plan and Triggers
- Defect triage table classifies all five open defects as blocker, acceptable risk, or deferred — with a one-line rationale for each classification
- Go/no-go recommendation is explicit and unambiguous, citing at least two specific data points from the provided summary
- At least two rollback triggers are stated as specific, observable, measurable conditions — not vague statements
- The partial reporting module regression and its impact on test coverage confidence are explicitly addressed
- The environment/config section covers feature flag state, database version parity, and the untested rollback procedure
- The executive summary paragraph stands alone — a non-technical reader can understand the recommendation and rationale without reading further
Extension ideas
- +Rewrite the same report as a 'Conditional Go' — assume the product manager has formally accepted the SEV-1 risk with a signed customer communication plan and a committed hotfix date; practise changing the recommendation while keeping the evidence base and the rollback plan intact
- +Build a reusable release readiness report template in Markdown with placeholder sections and inline guidance notes for each field, so the next QA lead can complete the assessment under time pressure without starting from a blank page