Twelve-factor agents: Building reliable LLM applications

June 1, 2025 technology llm architecture

Dexter Horthy has put online a guide on building production level LLM applications. The guide presents a structured approach for developing LLM-powered software, in similar fashion as the original 12 factor framework.

12 factors for LLM applications

Building LLM applications presents challenges when moving from prototype to production. The gap between demonstration quality and production readiness requires proven engineering principles. The original 12 Factor Apps methodology provided structure for SaaS development (web-based software delivered as a service to users). Dexter created a similar framework for LLM applications.

Most production AI agents are not fully autonomous systems. They consist of traditional software with LLM components integrated at specific points. This guide documents patterns for building functional systems.

Diagram showing how LLMs make real-time decisions to determine execution paths — Let the LLM make decisions in real time to figure out the path

The twelve principles are:

Natural language to tool calls
Own your prompts
Own your context window
Tools are just structured outputs
Unify execution state and business state
Launch/pause/resume with simple APIs
Contact humans with tool calls
Own your control flow
Compact errors into context window
Small, focused agents
Trigger from anywhere, meet users where they are
Make your agent a stateless reducer

The complete guide is available at 12-factor-agents on GitHub.

Two factors stand out as particularly important yet factor 4 is for example often overlooked in the current AI hype with all these marketing terms:

Factor 4: Tools are just structured outputs

This principle defines tools as structured JSON outputs from LLMs that trigger deterministic code. All the function calls you see today are simply JSON outputs. The pattern separates decision-making from execution. The LLM determines actions while application code controls implementation. This provides flexibility in execution logic without being constrained to specific function calls.

Factor 10: Small, focused agents

This principle recommends building specialised agents for specific tasks rather than monolithic, everything-in-one systems. Context window limitations affect LLM performance as complexity increases. Limiting agents to 3-10 steps maintains manageable context and improves reliability. This approach facilitates debugging and testing while allowing gradual scope expansion.

The original 12 Factor Apps

The 12 Factor Apps methodology, created by Adam Wiggins, established principles for building web applications. It addressed configuration management, dependency isolation, and deployment practices in SaaS development.

The 12-factor agents methodology addresses similar fragmentation in LLM application development. Both methodologies aim to create systematic approaches for building scalable software. Just as 12 Factor Apps gives order to web application development, this new framework provides structure for the field of LLM powered applications.

View this page on GitHub.