Atlas survey

AI-Augmented Functional Specs and Documentation

Tool selection, EARS notation, spec-driven pipelines, and prompt patterns for BAs/FAs/POs writing AI-augmented functional specs in 2026.

21 sources ~7 min read #204 business-analysis · functional-specifications · requirements · AI-tools · documentation · spec-driven-development · EARS

TL;DR — Purpose-built tools beat generic LLMs for consistent spec output: use Kiro for AI-agent pipelines, ChatPRD for stakeholder docs, Copilot4DevOps for Azure DevOps shops. Structure acceptance criteria with EARS “shall” statements when specs feed AI coding agents. AI produces syntactically valid requirements that routinely miss business intent — human review is non-negotiable, especially in regulated domains.

Two audiences for every spec in 2026

Functional specs now serve two distinct consumers: human stakeholders needing plain-language rationale, and AI coding agents that parse specs as executable contracts [1]. Optimising for only one audience breaks the other [2].

Audience Needs Breaks when
Humans Plain language, rationale, stakeholder narrative Dense jargon, missing context, no “why”
AI agents Structured headings, “shall” clauses, explicit out-of-scope list, non-functionals Ambiguous prose, vague UI descriptions, missing constraints

The 2026 pattern: write human-first, then add a machine-optimised layer — .cursorrules, CLAUDE.md, or EARS-structured acceptance criteria [3]. Keeborg generates both layers in 90 seconds [3]; Kiro forces review of the machine layer before any code executes [4].

Spec formats and AI generation maturity

Format Best for AI generation maturity
PRD Product vision, scope, user personas High — purpose-built tools (ChatPRD, Productboard Spark)
FRD “Shall” statements, business rules High — EARS-compliant generation available
BRD Stakeholder needs, business objectives Medium — needs strong context injection
User stories + AC Sprint-ready Jira/ADO tickets High — native AI features in Jira/ADO
BDD / Gherkin Executable test contracts, living docs Medium — AI misses edge cases and non-functionals
SDD spec AI coding agent contracts Emerging — Kiro, GitHub Spec Kit, OpenSpec

Tool comparison

Tool Best for AI-agent optimised Draft speed Key limitation
Kiro AWS greenfield; EARS → code pipeline Minutes AWS-native; limited brownfield
ChatPRD Stakeholder-facing PRDs, PM coaching 5–10 min No customer evidence layer
Keeborg Full spec stack for AI coding agents 90 sec New entrant; niche ecosystem
Copilot4DevOps Azure DevOps-native FRDs, user stories Partial Minutes Azure DevOps only
Jama Connect Regulated industries; INCOSE/EARS scoring Partial Minutes Enterprise pricing
aqua Voice-to-requirement; rapid BA field capture 15 sec Shallow on complex logic
GitHub Spec Kit Portable, MIT-licensed, agent-agnostic Minutes High review burden; many files

[5] [6] [7] [8] [9] [10] [11]

EARS notation: requirements AI agents can parse

EARS (Easy Approach to Requirements Syntax) was developed at Rolls-Royce and adopted by Airbus, Bosch, NASA, and Siemens [12]. Its structured templates reduce ambiguity and produce consistent requirements even across different authors — making AI-generated drafts easier to validate [13].

Core pattern: While <precondition>, when <trigger>, the <system> shall <response>.

EARS type Template Example
Ubiquitous The [system] shall [response] The system shall encrypt all PII at rest using AES-256.
Event-driven When [trigger], the [system] shall [response] When a payment fails, the system shall notify the user within 5 seconds.
State-driven While [state], the [system] shall [response] While unauthenticated, the system shall redirect to /login.
Conditional Where [feature enabled], the [system] shall ... Where MFA is enabled, the system shall prompt on each new device.

Kiro generates EARS-structured requirements by default [9]. Inflectra.ai scores existing FRDs against EARS rules and suggests rewrites [14]. Jama Connect’s Advisor runs the same check inline as you type [6].

Spec-driven development: the 2026 pipeline

SDD moves specs from handoff artifacts into agent-executable contracts [10]. Kiro implements this as three lightweight markdown files you review before any code runs:

Requirements doc → Design doc → Task list → AI generates code

Martin Fowler’s team identifies three SDD maturity levels [10]:

  • Spec-first — specs precede code, then are discarded → lowest overhead, highest drift risk
  • Spec-anchored — specs persist alongside code, updated as features evolve → practical target for most teams
  • Spec-as-source — humans edit only specs, never code directly (Tessl, currently experimental)

⚠ The documented failure mode mirrors BDD’s collapse in the 2010s: when only one role owns spec files, they become abandoned documentation [15]. SDD succeeds only with genuine cross-role authorship — analysts contribute business rules, developers add constraints, testers add boundary cases.

Prompt patterns for BAs

Effective prompts follow Role + Context + Task + Constraints + Output format [16]. Three templates that consistently work with Claude, ChatGPT, or Copilot:

Meeting notes → functional requirement

You are a senior BA. Here are raw stakeholder notes: [paste notes].
Convert to one EARS-formatted functional requirement covering:
- Functional context (what business process this supports)
- Business logic (IF/WHEN/THEN rules, numbered)
- Happy path (numbered steps)
- Unhappy path (numbered steps)
- Open questions (items needing stakeholder clarification)

User story + acceptance criteria

You are a product owner. Create a user story for: [feature description].
Format: "As a [persona], I want [goal] so that [benefit]."
Add 5 acceptance criteria in EARS event-driven format (When/Then).
Flag any edge cases or missing non-functional requirements.

FRD gap analysis

Review this FRD: [paste document].
Identify: (1) ambiguous or missing requirements, (2) EARS structure violations,
(3) missing non-functionals (performance, security, accessibility).
Output as a numbered list with severity: high / medium / low.

The quality of the prompt determines the quality of the output — vague prompts produce generic answers; structured prompts with role, context, and output format produce professional-grade drafts requiring minimal editing [16].

Platform-native integrations

For teams already inside an ecosystem, native integrations avoid context-switching:

Platform AI capability Key action for BAs
Jira / Atlassian Rovo Drafts AC from Confluence docs, flags gaps “Generate AC” and “Find edge cases” in ticket view
Azure DevOps / C4D Generates FRDs, user stories, test cases One-click backlog generation from PRD parent item
Confluence Rovo 20+ agents: summarise, Q&A, content outline Convert meeting notes to structured requirement pages
Notion AI Agents run autonomously for up to 20 minutes Process sprint retrospectives into updated backlogs

[17] [18] [19]

BDD / Gherkin: the edge-case caveat

AI generates syntactically valid Given/When/Then scenarios from a user story in under a minute [20]. The consistent failure mode: AI-generated Gherkin misses business intent, skips non-functional requirements, and duplicates happy-path variations. Treat it as a coverage scaffold to edit, not a finished contract [20].

The same trap that ended BDD adoption for many teams applies here: if QA writes scenarios in isolation — without analyst or developer review — they drift from business intent and eventually get ignored [15]. Gherkin’s value in 2026 is as the contract layer that keeps humans, AI agents, and stakeholders aligned — but only when it’s co-owned.

Human-in-the-loop: what AI cannot replace

AI generates requirements that are plausible, consistent, and fast. It cannot [6]:

  • Understand stakeholder politics or unstated business constraints
  • Know which requirements carry legal, compliance, or safety weight
  • Make tradeoff calls when requirements conflict
  • Validate that generated language matches real business intent

In regulated industries (ISO 26262, DO-178C, IEC 62304), human review “isn’t optional — it’s a structural requirement baked into every applicable standard” [6]. For everyone else, the practical rule: AI generates, humans approve, no AI-generated spec goes straight to the sprint backlog without a refinement session [5].

The BA/FA/PO role in 2026 shifts from writing specs to validating and refining AI-generated specs — more architectural judgement, less blank-page struggle [21].

Citations · 21 sources

Click the Citations tab to load…