Context · The Pitch · The Build · The Product · The Demo · Outcome · Rollout · Reflection

Major Australian Bank · 2026

From pitch to proof in two weeks: 90× faster requirements generation.

I pitched an agentic document generator to a skeptical migration programme, built a working system in two weeks, and now have 11 active users at 100% adoption generating requirements, PO briefs, and architecture docs with 10 validation gates and mandatory human review. The Agentic Document Generator (ADG) draws from a governed Context Library to produce citation-traced SDLC documents in minutes, not weeks. This is the first Generation-phase product built on The Context Lifecycle.

AI Product Agentic Workflows Context Engineering Enterprise AI 0→1
My Role
AI Product Lead · Cross-functional rollout
Stack
Cursor (agentic AI) · Claude · GitHub Enterprise · Context Library
Organisation
Major Australian Bank · Personal Banking
Timeline
Pitch → demo in 2 weeks · production rollout ongoing
Status
In production · 11 active users · 100% adoption
Results analysis comparing four methods for generating requirements documentation, showing PRP cascade with Context Library at about 19 minutes versus a human BA at 27 to 40 hours

Measured results from a live migration epic: four methods compared on time, token use, and quality risk. The ADG system (PRP cascade + Context Library) completed in ~19 minutes with low risk. A human BA took 27 to 40 hours.

Documentation was gating a high-stakes migration.

A major platform migration programme had dozens of epics in flight across Product, Business Analysis, Architecture, and Solution Design. The delivery cycle was sequential: teams could not start building until requirements, architecture, and solution design documents were written and signed off. Manual documentation was the bottleneck, and it was getting worse as the programme scaled.

The root cause was not writing speed alone. It was knowledge fragmentation. Context lived across Confluence, Word, PowerPoint, Excel, and people's heads. Collecting it to inform a single document was time-consuming. AI tools like Cursor were already licensed, but without structured context to draw from, trust in AI accuracy stayed low. Prompting was tedious, there was no reusable process, and the learning curve added friction on top.

01

Sequential delivery, blocked on documents

Requirements gathering, architecture, and solution design had to complete before engineering could proceed. Every epic waited on documents that took days to produce from scattered sources.

02

AI without grounding fails at enterprise scale

Raw prompting over unstructured documents produced hallucinations, no lineage, and no constraints. Teams tried Cursor, hit the same wall, and went back to manual work.

03

The platform existed. The consuming product did not.

I had already built the Context Lifecycle and its underlying library infrastructure. What the migration programme needed was a product on top of it: a system that turned governed context into signed-off SDLC documents, on demand.

Cursor can accelerate documentation, but only when it has structured context to draw from. Solve fragmentation, and the rest follows. ADG is the consuming application. The Context Lifecycle is the framework underneath it.

I pitched the value before I had the product.

The migration programme needed a reason to believe before they would commit time. I built a proposal site and presentation framing the problem, the approach, and the projected ROI. The pitch came first. The working system came two weeks later.

Everything in this section is projected, not measured. I was explicit about that with stakeholders. These were the numbers I used to secure attention and sign-off on the plan.

14d → 1–2d
Projected requirements doc cycle: from kickoff-to-sign-off elapsed time, down to one or two days with ADG.
~7.1 hrs
Projected hours saved per person per week across drafting (60% reduction) and information gathering (70% reduction).
66 days
Projected capacity recovered on requirements alone, across 22 in-flight epics in the programme.

Nine features that addressed every objection.

The pitch did not sell AI magic. It sold reliability: easy contribution, team-customised templates, accuracy targeted at 70–80% with mandatory human review, built for speed, reusable across domains, low maintenance, no prompt engineering, human-in-the-loop at every gate, and a system that improves with use.

01

Technically feasible

Built on existing GitHub Enterprise infrastructure. No new platforms to provision. Foundation used proven internal sync scripts. No autonomous agents, no external services: stayed within enterprise AI guardrails.

02

Adoptable (90%+ expected)

Technical users get native Cursor commands that feel like an acceleration of their existing workflow. Non-technical users get a web interface and natural-language chat. No terminal required. A citation-grounded draft in under two minutes was designed to sell the tool faster than any training session.

The architects and engineers in the room were skeptical. They did not commit time to the build. I would have to prove it alone.

Two weeks. Solo build. Then cross-functional rollout.

With no engineering support and no architect time allocated for the initial build, I defined and shipped the entire Agentic Document Generator myself: the agent definitions, the slash commands, the distillation pipeline, the team templates, the 10 validation gates, the adversarial review pass, and the staging output workflow. Two weeks from pitch to a demo-ready system running on real migration content.

Rollout was never solo. From reviews and demos onward, I worked with an Enablement Manager, a principal engineer, a senior delivery manager, an agile coach, and a senior engineer, presenting to business executives and securing approval to expand across the programme.

The constraint was the design brief. If the system only worked when a technical champion was in the room, it would never scale across a 1,000-person domain. Every command had to be self-explanatory. Every output had to ship with lineage. Every draft had to arrive with a human review checklist, not a black box.

2 weeks
Pitch to working demo. Solo definition and build for the first sprint.
4 agents
Collector, librarian, researcher, author. Each mapped to a Context Lifecycle phase.
10 gates
Automated validation on every run, plus author self-review and an adversarial pass before HITL.

ADG: an agentic system built on governed context.

The Agentic Document Generator is a product within the bank's intelligence ecosystem (PIE). It consumes the Context Library I built as part of The Context Lifecycle, customised for the migration programme's Confluence sources, asset team templates, and sub-domain blueprints.

Four agents. Four lifecycle phases. One command each.

01

Collector · Collection · /collect

Reads from external sources (Confluence), converts to machine-readable context files, and places them in the team's sub-domain repo.

02

Librarian · Processing · /process

Validates structure, classifies content, builds relationships, and creates search indexes. Raw pages become governed distillations.

03

Researcher · Retrieval

When a document is requested, searches the library scoped to the product, ranks by relevance with citations, and presents context for review.

04

Author · Generation · /create-requirements · /go-live

Structures output using team templates, applies validation criteria, stages for review, then stamps with reviewer name and date on go-live.

What was in the library at demo time

1,331
Distillations processed. Growing every week as teams contribute.
2,374
Confluence pages collected from the migration programme's sources.
8
Content types: requirements, architecture, change, research, principles, decisions, glossary, personas.

Each distillation carries a title, body, source link, classification, tags, and lineage. The system retrieves by relevance. The document cites every source. When a BA reviews a generated doc and notices a gap, that correction feeds the learning loop. The library improves with use.

Responsible AI: evals baked into every run

Enterprise documentation cannot ship on blind trust. ADG treats responsible AI as workflow design, not a policy slide: human-in-the-loop gates, automated validation, and agent self-critique before a human ever opens the draft.

01

10 automated validation gates

Every generation run passes schema, source-integrity, template, citation, and completeness checks. On a live migration epic: 8 pass · 1 warn · 1 fail. Gate 1 (source-integrity) flagged a missing asset file for BA review before PO sign-off.

02

Author agent self-review

Before staging, the author agent produces a review report: overall confidence rating, what it included, and what it doubts. It flags thin NFR coverage and gaps the BA must validate, not hide.

03

Adversarial pass + mandatory HITL

An adversarial agent challenges the draft before output is staged. The HITL review report gives the BA a checklist: top items to verify, gate summary, and lineage back to library sources. Go-live stamps reviewer name and date.

The demo did the selling. Not the slides.

I ran a live demo on real migration content: a BA types /create-requirements, the author agent asks a few questions, loads team settings and library context, and produces a full requirements draft with citations, a Confluence preview, a lineage file, and a HITL review report. Under two minutes. Citation-grounded. Ready for human review, not blind trust.

Workflow for create-requirements: user types the command, author agent asks questions and loads context, user confirms, agent searches library and drafts with template, outputs five files including requirements doc, preview, session brief, lineage, and HITL review report.

The Generation workflow: seven steps from a single slash command to five staged output files, with human confirmation in the middle and a mandatory review gate at the end.

Measured results: four methods compared

After the demo, I ran a structured analysis comparing four approaches to generating the same requirements document on a live migration epic. This is where projected becomes measured.

Results analysis table comparing four methods: PRP cascade with Context Library at about 19 minutes, human BA at 27 to 40 hours, raw prompt with raw docs at 2.5 to 4 hours, raw prompt with Context Library at 45 to 90 minutes.

Measured on a live epic. Method 0 (ADG: PRP cascade + Context Library) completed in ~19 minutes with low quality risk. A human BA took 27 to 40 hours. Raw prompting without the library was faster than a human but carried high hallucination risk and used 2 to 4× more tokens.

Three layers stack to create the speed: clean distillations with schema (eliminates source noise), pre-configured templates (eliminates formatting guesswork), and automated validation gates (eliminates manual QA cycles). Remove any one layer and speed drops sharply. The Context Library alone is not enough. The templates alone are not enough. Together they produce one-pass generation with no waste.

What the output looked like

On a live migration epic (RS-WSS-001), a single /create-requirements run produced a staged requirements draft, session brief, lineage file, and HITL review notes. The draft included 16 functional requirements, 12 NFRs, and 23 source citations. Overall confidence: medium-high, with explicit gaps flagged for human validation.

Three-panel view of ADG draft output on a live migration epic: Confluence staging preview marked DRAFTING, session brief with lineage metadata, and HITL review notes showing 10 validation gates with 8 pass, 1 warn, 1 fail, medium-high confidence, and a BA checklist before PO sign-off.

Every run ships five files. The HITL panel is the proof layer: author self-review, 10 validation gates, adversarial pass results, and a BA checklist before PO sign-off. Not perfect. Not autonomous.

The skeptics became contributors. Then the programme adopted.

The architects and engineers who did not give me their time during the pitch were at the demo. After seeing a citation-grounded requirements draft generated from their own migration content in under two minutes, the conversation changed. They wanted in: to help refine the system, customise templates for their domains, and start using it themselves.

That conversion unlocked production rollout. The migration programme now has 11 active users (6 BAs, 5 POs) at 100% adoption, generating requirements docs and PO briefs in daily use, with architecture doc templates scaling to architects next.

11/11
Active users at 100% adoption: 6 BAs and 5 POs using ADG in the migration programme.
~19 min
Measured time to first draft on a live epic. Down from 27 to 40 hours for a human BA on the same scope.
90–130×
Speed advantage on first draft, measured on the same epic. Human review gate before sign-off.

Business impact (early)

Dollar ROI is still being measured; this is a new production rollout. Early projected FTE savings from the pitch remain the working estimate: ~7.1 hours saved per person per week across drafting and information gathering, and 66 days of capacity recovered on requirements alone across 22 in-flight epics. I was explicit with stakeholders that these were projections at pitch time; adoption metrics are now measured, dollar impact is next.

Cross-functional team

I defined and built the system in the first two weeks. Scaling it required a team. I led rollout alongside:

Enablement
Manager

Programme sponsorship

Co-presenting to executives and teams, securing approval to expand ADG across the migration programme and into adjacent asset teams.

Principal
Engineer

Architecture review

Technical partner for agent workflow design, validation gate criteria, and enterprise integration with the Context Library.

Senior
Delivery Mgr

Delivery alignment

Aligning ADG rollout with migration programme milestones, epic cadence, and sign-off workflows across BA and PO cohorts.

Agile
Coach

Adoption and change

Co-designing how BAs and POs integrate ADG into existing ceremonies, template customisation, and feedback loops.

Senior
Engineer

Demo and review

Review partner for agent definitions, slash commands, and staging workflow before each programme demo and rollout gate.

What changed

Before After
Requirements docs taking 14+ days from kickoff to sign-off First draft generated in ~19 minutes, with human review gate before sign-off
Context scattered across Confluence, Word, PowerPoint, Excel 1,331 governed distillations, searchable, classified, with source lineage
AI prompting: tedious, repetitive, high hallucination risk Single slash command, template-driven, citation-traced, validation-gated
Architects and engineers skeptical, no time allocated Executive approval secured; architects scaling in for architecture doc templates
Requirements and PO briefs only, manual workflow Requirements + PO briefs live; architecture docs scaling; onboarding pipeline for next teams
Context Lifecycle: platform with no consuming product First Generation-phase proof point: context in, signed-off SDLC documents out

Persona by persona. Doc type by doc type.

ADG did not launch to everyone at once. The roadmap expands by persona and document type: requirements for BAs first, PO briefs next, architecture docs for architects now scaling, then solution design and UX, then a self-serve onboarding pipeline for asset teams across the domain.

ADG rollout roadmap over 10 weeks: Foundation W1-W2, Pilot Alpha W3-W4 with 100 percent PO and BA adoption at W4, Pilot Beta W5-W6 scaling architecture docs to Architects, Expansion to solution designers next, then handover to all teams.

100% adoption for POs and BAs landed at W4 (11/11 active). We are now in Pilot Beta, scaling architecture doc templates to architects. Next: solution designers, then self-serve onboarding for asset teams.

The pace-setter is trust, not ship dates. Each phase proves one persona and doc type on real programme content before the next cohort onboarded. That is how you get 100% adoption instead of a launch announcement nobody uses.

Pitch the promise. Ship the proof. Let the demo convert.

The ADG story is really two stories stacked: a platform story (The Context Lifecycle) and a product story (ADG). The platform gives you governed context. The product turns that context into documents people actually sign off on. Neither works without the other, and having both is what makes this credible as an AI product narrative, not just an infrastructure experiment.

What I learned: skeptical technical stakeholders do not convert on projections, no matter how well-framed. They convert when you show them their own content, cited correctly, in a format they recognise, with a review checklist they would have written themselves. The two-week solo build was not a compromise. It was the strategy. Speed forced focus. Focus produced a demo that sold faster than any slide deck.

What I'd do differently: I would still pitch first (the projections secured the mandate), but I would loop one friendly BA into the build earlier, even without architect time. A co-builder during week one would have surfaced template gaps before the demo, not after it.

Two weeks to demo. 11 users at 100% adoption. 10 validation gates on every run. ~19 minutes vs 27 to 40 hours. The architects who said no became the team scaling it forward.

← Previous
Intelligence Hub
Next →
Financial Wellbeing
Documentation in progress