References & Further Reading

The research this course draws on. Follow the sources — the course’s opinions should belong to its telemetry and the literature, not its author.

This course is built mostly from a real, shipped harness, so it cites few formal papers. The reflective-loop unit (06-reflection.md ) is where the named literature appears.

Papers

  • Reflexion — Shinn et al. (2023). An agent critiques its own trajectory in words and uses that as feedback. Supports the reflective loop in Unit 06 (06-reflection.md ).
  • Self-Refine — Madaan et al. (2023). Iterative self-feedback. Supports the reflective loop in Unit 06 (06-reflection.md ).
  • Building Effective Agents — Anthropic. Names the same shape as the evaluator-optimizer loop, cited in Unit 06 (06-reflection.md ).

Note: concepts like OODA, open/closed loops, control theory, and OpenTelemetry appear throughout the course but are not academic citations. For the operational side, see the repo’s Observability Standard rather than a paper.

Last modified June 26, 2026