AE-104CLBuilderLaunch5 hrsRefreshed 2025-09-21
GPT-5 PreparednessClaude 4 ConstitutionalGrok RealtimeGemini PerceptionEdge Ready

Agent Field Test

T1 - Agent Engineering Foundations

Execute controlled agent tests with humans in the loop - Capture success and failure stories - Outline next iteration plan

Design dependable coding agents with clear guardrails and success metrics.

Key outcomes

  • Execute controlled agent tests with humans in the loop
  • Capture success and failure stories
  • Outline next iteration plan

Deliverables

  • Field test journal
  • Incident log
  • Iteration memo

Prerequisites

  • AE-103

Evaluation signals

  • AGENT-FIELD-001
  • SAFE-INCIDENT-001

Persona fit

Agent EngineerDelivery LeadRisk & Governance

Assistant orchestration

View assistant playbook

Scout

Agent

Research horizons, regulation updates, and pattern watchlists.

  • Refresh critical intel within 24 hours of change
  • Maintain 95% citation accuracy
  • Flag module freshness risks automatically

Coach

Agent

Pair with learners during modules, labs, and retrospectives.

  • Median response under 2 seconds
  • Satisfaction above 4.6/5
  • Escalate risky experiments within 10 minutes

Critic

Agent

Guardrails, evaluations, and red-team simulations.

  • Detect 98% evaluation anomalies
  • Zero unlogged high-severity incidents
  • Attach control evidence to every flagged issue

Archivist

Agent

Evidence locker, credential manifests, and knowledge graph links.

  • Tag 100% deliverables with owners and signals
  • Keep schema drift under 1%
  • Generate credential payloads automatically

Companion

Agent

Health, pacing, and personalised nudges across squads.

  • On-time nudges for 90% milestones
  • Keep burnout false positives below 5%
  • Publish weekly sponsor-ready progress snapshots

Navigator

Agent

CTA instrumentation, sponsor digest composition, and mastery guardrails.

  • Cover 95% persona CTAs every sprint
  • Generate sponsor digest drafts within 5 minutes of module completion
  • Hold mastery drift within one tier

Micro paths featuring this module

LaunchAgent Engineer

Stand up a dependable coding agent that ships quality pull requests.

Day 1 - Frame and scope
AE-101AE-102
Day 2 - Prototype loop
AE-103RP-201
Day 3 - Guardrails
AE-104OR-501
Day 4 - Demo narrative
CC-401
Day 5 - Handoff
CC-404OR-503
Launch micro path

Credential alignment

This module contributes evidence across multiple credentials. See the credential framework for details.

Primary documentation