TL;DR — Purpose-built tools beat generic LLMs for consistent spec output: use Kiro for AI-agent pipelines, ChatPRD for stakeholder docs, Copilot4DevOps for Azure DevOps shops. Structure acceptance criteria with EARS “shall” statements when specs feed AI coding agents. AI produces syntactically valid requirements that routinely miss business intent — human review is non-negotiable, especially in regulated domains.
Two audiences for every spec in 2026
Functional specs now serve two distinct consumers: human stakeholders needing plain-language rationale, and AI coding agents that parse specs as executable contracts [1]. Optimising for only one audience breaks the other [2].
| Audience | Needs | Breaks when |
|---|---|---|
| Humans | Plain language, rationale, stakeholder narrative | Dense jargon, missing context, no “why” |
| AI agents | Structured headings, “shall” clauses, explicit out-of-scope list, non-functionals | Ambiguous prose, vague UI descriptions, missing constraints |
The 2026 pattern: write human-first, then add a machine-optimised layer — .cursorrules, CLAUDE.md, or EARS-structured acceptance criteria [3]. Keeborg generates both layers in 90 seconds [3]; Kiro forces review of the machine layer before any code executes [4].
Spec formats and AI generation maturity
| Format | Best for | AI generation maturity |
|---|---|---|
| PRD | Product vision, scope, user personas | High — purpose-built tools (ChatPRD, Productboard Spark) |
| FRD | “Shall” statements, business rules | High — EARS-compliant generation available |
| BRD | Stakeholder needs, business objectives | Medium — needs strong context injection |
| User stories + AC | Sprint-ready Jira/ADO tickets | High — native AI features in Jira/ADO |
| BDD / Gherkin | Executable test contracts, living docs | Medium — AI misses edge cases and non-functionals |
| SDD spec | AI coding agent contracts | Emerging — Kiro, GitHub Spec Kit, OpenSpec |
Tool comparison
| Tool | Best for | AI-agent optimised | Draft speed | Key limitation |
|---|---|---|---|---|
| Kiro | AWS greenfield; EARS → code pipeline | ✓ | Minutes | AWS-native; limited brownfield |
| ChatPRD | Stakeholder-facing PRDs, PM coaching | ✗ | 5–10 min | No customer evidence layer |
| Keeborg | Full spec stack for AI coding agents | ✓ | 90 sec | New entrant; niche ecosystem |
| Copilot4DevOps | Azure DevOps-native FRDs, user stories | Partial | Minutes | Azure DevOps only |
| Jama Connect | Regulated industries; INCOSE/EARS scoring | Partial | Minutes | Enterprise pricing |
| aqua | Voice-to-requirement; rapid BA field capture | ✗ | 15 sec | Shallow on complex logic |
| GitHub Spec Kit | Portable, MIT-licensed, agent-agnostic | ✓ | Minutes | High review burden; many files |
EARS notation: requirements AI agents can parse
EARS (Easy Approach to Requirements Syntax) was developed at Rolls-Royce and adopted by Airbus, Bosch, NASA, and Siemens [12]. Its structured templates reduce ambiguity and produce consistent requirements even across different authors — making AI-generated drafts easier to validate [13].
Core pattern: While <precondition>, when <trigger>, the <system> shall <response>.
| EARS type | Template | Example |
|---|---|---|
| Ubiquitous | The [system] shall [response] |
The system shall encrypt all PII at rest using AES-256. |
| Event-driven | When [trigger], the [system] shall [response] |
When a payment fails, the system shall notify the user within 5 seconds. |
| State-driven | While [state], the [system] shall [response] |
While unauthenticated, the system shall redirect to /login. |
| Conditional | Where [feature enabled], the [system] shall ... |
Where MFA is enabled, the system shall prompt on each new device. |
Kiro generates EARS-structured requirements by default [9]. Inflectra.ai scores existing FRDs against EARS rules and suggests rewrites [14]. Jama Connect’s Advisor runs the same check inline as you type [6].
Spec-driven development: the 2026 pipeline
SDD moves specs from handoff artifacts into agent-executable contracts [10]. Kiro implements this as three lightweight markdown files you review before any code runs:
Requirements doc → Design doc → Task list → AI generates code
Martin Fowler’s team identifies three SDD maturity levels [10]:
- Spec-first — specs precede code, then are discarded → lowest overhead, highest drift risk
- Spec-anchored — specs persist alongside code, updated as features evolve → practical target for most teams
- Spec-as-source — humans edit only specs, never code directly (Tessl, currently experimental)
⚠ The documented failure mode mirrors BDD’s collapse in the 2010s: when only one role owns spec files, they become abandoned documentation [15]. SDD succeeds only with genuine cross-role authorship — analysts contribute business rules, developers add constraints, testers add boundary cases.
Prompt patterns for BAs
Effective prompts follow Role + Context + Task + Constraints + Output format [16]. Three templates that consistently work with Claude, ChatGPT, or Copilot:
Meeting notes → functional requirement
You are a senior BA. Here are raw stakeholder notes: [paste notes].
Convert to one EARS-formatted functional requirement covering:
- Functional context (what business process this supports)
- Business logic (IF/WHEN/THEN rules, numbered)
- Happy path (numbered steps)
- Unhappy path (numbered steps)
- Open questions (items needing stakeholder clarification)
User story + acceptance criteria
You are a product owner. Create a user story for: [feature description].
Format: "As a [persona], I want [goal] so that [benefit]."
Add 5 acceptance criteria in EARS event-driven format (When/Then).
Flag any edge cases or missing non-functional requirements.
FRD gap analysis
Review this FRD: [paste document].
Identify: (1) ambiguous or missing requirements, (2) EARS structure violations,
(3) missing non-functionals (performance, security, accessibility).
Output as a numbered list with severity: high / medium / low.
The quality of the prompt determines the quality of the output — vague prompts produce generic answers; structured prompts with role, context, and output format produce professional-grade drafts requiring minimal editing [16].
Platform-native integrations
For teams already inside an ecosystem, native integrations avoid context-switching:
| Platform | AI capability | Key action for BAs |
|---|---|---|
| Jira / Atlassian Rovo | Drafts AC from Confluence docs, flags gaps | “Generate AC” and “Find edge cases” in ticket view |
| Azure DevOps / C4D | Generates FRDs, user stories, test cases | One-click backlog generation from PRD parent item |
| Confluence Rovo | 20+ agents: summarise, Q&A, content outline | Convert meeting notes to structured requirement pages |
| Notion AI | Agents run autonomously for up to 20 minutes | Process sprint retrospectives into updated backlogs |
BDD / Gherkin: the edge-case caveat
AI generates syntactically valid Given/When/Then scenarios from a user story in under a minute [20]. The consistent failure mode: AI-generated Gherkin misses business intent, skips non-functional requirements, and duplicates happy-path variations. Treat it as a coverage scaffold to edit, not a finished contract [20].
The same trap that ended BDD adoption for many teams applies here: if QA writes scenarios in isolation — without analyst or developer review — they drift from business intent and eventually get ignored [15]. Gherkin’s value in 2026 is as the contract layer that keeps humans, AI agents, and stakeholders aligned — but only when it’s co-owned.
Human-in-the-loop: what AI cannot replace
AI generates requirements that are plausible, consistent, and fast. It cannot [6]:
- Understand stakeholder politics or unstated business constraints
- Know which requirements carry legal, compliance, or safety weight
- Make tradeoff calls when requirements conflict
- Validate that generated language matches real business intent
In regulated industries (ISO 26262, DO-178C, IEC 62304), human review “isn’t optional — it’s a structural requirement baked into every applicable standard” [6]. For everyone else, the practical rule: AI generates, humans approve, no AI-generated spec goes straight to the sprint backlog without a refinement session [5].
The BA/FA/PO role in 2026 shifts from writing specs to validating and refining AI-generated specs — more architectural judgement, less blank-page struggle [21].