Loop Engineering: How Claude Code and Codex are Replacing the Prompt

Every developer using LLMs has experienced the same friction: the exhausting ritual of re-explaining project context, coding standards, and architectural constraints at the start of every new session. We have spent the last two years treating AI as a sophisticated chat interface, exchanging single turns of input and output in a hope that the model remembers the nuance of our codebase. But the industry is hitting a ceiling with this conversational approach. The friction of the prompt is becoming a bottleneck, and a new paradigm is emerging to solve it. We are moving away from telling the AI what to do and toward designing the systems that tell the AI how to operate.

The Architecture of the Recursive Loop

This shift is known as Loop Engineering. Rather than a linear sequence of prompts, Loop Engineering treats AI interaction as a recursive goal system. In this model, a developer defines a high-level objective, and the AI operates within a loop, executing tasks, verifying results, and iterating until the goal is achieved. Boris Cherny, the lead for Claude Code at Anthropic, describes the core of modern AI development not as prompt writing, but as the construction of these loops. This approach aligns with what Peter Steinberger calls the transition from prompting to designing the loops that execute those prompts, effectively moving the developer from the role of an operator to that of a system architect.

To function, a loop requires more than just a powerful model; it requires a harness. Both Claude Code and Codex implement this through five critical components. First are Automations, the heartbeat of the system. While Codex allows users to set execution prompts and intervals via an Automations tab, Claude Code integrates deeper into the developer workflow using the `/loop` command, cron jobs, and GitHub Actions. This allows the agent to continue working on a task even after the developer has closed their laptop.

Second is the implementation of Worktrees. When multiple agents operate on a codebase simultaneously, file conflicts are inevitable. To solve this, these tools leverage `git worktree` to isolate work directories while sharing the same repository history. Codex builds this in natively, while Claude Code utilizes the `--worktree` flag and the `isolation: worktree` configuration. This ensures that helper agents can spin up an independent environment, complete a task, and clean up after themselves without polluting the main development branch.

Third, the system addresses the goldfish memory of LLMs through Skills. Instead of repeating conventions in every prompt, developers maintain a `SKILL.md` folder. This directory stores build steps and coding standards that the agent reads at the start of every execution, filling the gaps in the model's immediate intent.

Fourth, the Model Context Protocol (MCP) serves as the connective tissue via Plugins and Connectors. MCP allows the agent to step outside the text editor and interact with the real world—querying databases, reading issue trackers, calling APIs, or sending notifications to Slack. Finally, the system employs Sub-agents to separate concerns. By defining agent teams in `.codex/agents/` using TOML files or via `.claude/agents/`, developers can split the workload between a writer and a reviewer. When paired with memory systems like Linear boards or markdown files, the AI maintains continuity across sessions that would normally be lost in a standard chat window.

The Maker-Checker Tension and the Risk of Cognitive Surrender

The real power of Loop Engineering is not just automation, but the introduction of verifiable stop conditions. In a standard chat, the AI tells you it is finished, and you have to trust it. In a loop, the `/goal` command creates a hard requirement that must be mathematically or logically proven true before the loop terminates. This introduces the Maker-Checker pattern. To prevent the common failure where an AI validates its own incorrect code, a separate, often smaller, model is deployed as the Checker. For instance, a developer might set a goal such as test/auth must pass all tests and lint must be clean. The Maker agent writes the code, and the Checker agent runs the tests. The loop only breaks when the Checker confirms the condition is met.

However, this leap in productivity introduces three systemic risks that the modern engineer must manage. The first is the displacement of verification responsibility. A loop that reports done is not providing a proof of correctness; it is merely reporting that its internal conditions were met. The final sign-off remains a human obligation.

The second risk is comprehension debt. As loops generate larger volumes of code with less human intervention, a gap opens between the actual state of the codebase and the developer's mental model of it. When a developer ships code they didn't personally write or deeply review, they are accumulating a debt that will eventually be paid during a critical system failure.

Finally, there is the danger of cognitive surrender. The sheer convenience of a functioning loop can lead to a state of passive acceptance, where the developer stops critically analyzing the AI's output because it usually works. This erosion of critical thinking is the most dangerous byproduct of automation, as it transforms the engineer from a creator into a mere supervisor of a black box.

Success in the era of Loop Engineering is no longer measured by who can write the most clever prompt, but by who can design the most robust state management and role distribution systems. The loop is a lever for leverage, but the engineer's primary value now lies in their ability to resist the gravity of automation and maintain the rigorous judgment required to ensure software quality.

Loop Engineering: How Claude Code and Codex are Replacing the Prompt

The Architecture of the Recursive Loop

The Maker-Checker Tension and the Risk of Cognitive Surrender

Related Articles