AE-103LBBuilderBuild6 hrsRefreshed 2025-09-21
GPT-5 PreparednessClaude 4 ConstitutionalGrok RealtimeGemini PerceptionEdge Ready

Coding Agent Skeleton

T1 - Agent Engineering Foundations

Implement agent routing and tool interfaces - Instrument evaluation events and telemetry - Package a reusable agent starter kit

Design dependable coding agents with clear guardrails and success metrics.

Key outcomes

  • Implement agent routing and tool interfaces
  • Instrument evaluation events and telemetry
  • Package a reusable agent starter kit

Deliverables

  • Agent starter repo
  • Tool contract definitions
  • Telemetry checklist

Prerequisites

  • AE-101
  • AE-102

Evaluation signals

  • AGENT-ROUTER-001
  • EVAL-HOOK-001

Persona fit

Agent EngineerDeveloper

Assistant orchestration

View assistant playbook

Scout

Agent

Research horizons, regulation updates, and pattern watchlists.

  • Refresh critical intel within 24 hours of change
  • Maintain 95% citation accuracy
  • Flag module freshness risks automatically

Coach

Agent

Pair with learners during modules, labs, and retrospectives.

  • Median response under 2 seconds
  • Satisfaction above 4.6/5
  • Escalate risky experiments within 10 minutes

Critic

Agent

Guardrails, evaluations, and red-team simulations.

  • Detect 98% evaluation anomalies
  • Zero unlogged high-severity incidents
  • Attach control evidence to every flagged issue

Archivist

Agent

Evidence locker, credential manifests, and knowledge graph links.

  • Tag 100% deliverables with owners and signals
  • Keep schema drift under 1%
  • Generate credential payloads automatically

Companion

Agent

Health, pacing, and personalised nudges across squads.

  • On-time nudges for 90% milestones
  • Keep burnout false positives below 5%
  • Publish weekly sponsor-ready progress snapshots

Navigator

Agent

CTA instrumentation, sponsor digest composition, and mastery guardrails.

  • Cover 95% persona CTAs every sprint
  • Generate sponsor digest drafts within 5 minutes of module completion
  • Hold mastery drift within one tier

Micro lessons

Toolchain Blueprint

35 min

Objective: Design routing logic and tool adapters for the coding agent.

Activities

  • List candidate tools
  • Define interface contracts
  • Sketch routing diagram

Knowledge checks

  • Which tool handles evaluation resets?
  • How are secrets managed?

Telemetry Wiring

30 min

Objective: Emit Langfuse and OpenTelemetry traces for each agent step.

Activities

  • Install tracing SDK
  • Label spans with guardrail IDs
  • Verify run export to Supabase stub

Knowledge checks

  • Which span indicates prompt rewrite?
  • Where are traces stored?

Evaluation Dry Run

40 min

Objective: Execute scripts/eval-harness locally and interpret initial metrics.

Activities

  • Select eval config
  • Run realtime and perception probes
  • Record baseline metrics

Knowledge checks

  • What is the realtime freshness score?
  • Which preparedness gap was logged?

Knowledge points

Agent Skeleton Architecture

Define planner, executor, critic, and monitor components with clear data contracts.

Evaluation Baseline Metrics

Capture pass rate, latency, and preparedness averages as the baseline before scaling runs.

Micro paths featuring this module

LaunchAgent Engineer

Stand up a dependable coding agent that ships quality pull requests.

Day 1 - Frame and scope
AE-101AE-102
Day 2 - Prototype loop
AE-103RP-201
Day 3 - Guardrails
AE-104OR-501
Day 4 - Demo narrative
CC-401
Day 5 - Handoff
CC-404OR-503
Launch micro path

Credential alignment

This module contributes evidence across multiple credentials. See the credential framework for details.

Primary documentation