The modern integrated development environment is beginning to feel like a relic. For decades, the mark of a great engineer was their fluency in a language, their ability to navigate complex syntax, and the speed at which they could translate a requirement into a working function. But a strange phenomenon is taking hold in the high-end dev community. Engineers are reporting a sudden, jarring detachment from the keyboard. The act of typing code is no longer the primary bottleneck of production; instead, the bottleneck has shifted to the human's ability to judge whether the generated output is actually good.
The Commoditization of the Commit
By the second half of 2025, the industry hit a critical performance threshold. The release of models such as Opus 4.5, GPT-5.2, and Gemini 3 between November and December 2025 fundamentally altered the economics of software creation. These models reached a level of proficiency that mirrors seasoned engineers, collapsing tasks that previously required hours of deep focus into a few minutes of prompt refinement. The impact is not theoretical. Boris Cherny, the creator of Claude Code, reported that as of December 2024, 100% of his committed code was authored by AI, and he had completely abandoned the use of a traditional IDE.
This shift is accelerating toward a total inversion of the development process. Anthropic CEO Dario Amodei predicted that by March 2025, AI would be responsible for writing 90% of all code. This trajectory suggests a world where the marginal cost of producing a line of code is effectively zero. Malte Ubl, the CTO of Vercel, has analyzed this trend as the convergence of software production costs toward zero. When the ability to generate syntactically correct, functional code becomes a commodity, the competitive advantage of the engineer can no longer reside in the act of writing. The value proposition is migrating away from the how and toward the what.
The Internal Evaluation Function
If writing code is now a commodity, the new premium is a quality referred to as taste. In technical terms, taste is the quality of an engineer's internal evaluation function. This is the cognitive mechanism that allows a human to look at two different AI-generated implementations and instinctively know which one is superior. This function operates across three distinct layers of sophistication.
First is recognition. This is the ability to use pattern matching to identify which implementation is more scalable or simpler. It is the difference between a solution that works today and one that will break the moment the user base doubles. Second is the compass. This is the directional capacity to determine if a specific feature is even necessary before it is built, steering the AI toward the most efficient convergence point. Third is vision. This is the highest form of taste, involving the ability to predict the importance of a system's architecture two years into the future, ensuring the current build does not create a legacy nightmare.
These internal evaluation functions manifest in high-stakes technical decisions. For example, the OpenAI Codex team chose Rust over TypeScript when developing their CLI. This was not a decision based solely on raw performance benchmarks, but rather a recognition that the constraints of Rust—its strong typing and explicit memory management—would shape a more disciplined engineering culture. Conversely, Claude Code opted for TypeScript, recognizing that the model's on-distribution strengths were better aligned with that ecosystem. The choice of language is no longer about what the human can write, but about what the system should be.
This new paradigm also introduces a structural shift in how teams operate, most notably through the 30/70 rule. The Codex team, while using prompts for nearly all code generation, maintains a strict human-in-the-loop requirement for the 30% of the codebase that is mission-critical, such as permission systems. The remaining 70% of non-core code is left to AI review. To optimize this, they have introduced `AGENTS.md`, a specialized README file written specifically for AI agents rather than humans, providing the structural context the model needs to increase its success rate during codebase navigation.
As the volume of AI-generated Pull Requests skyrockets, the very nature of the code review is changing. Peter Steinberger has proposed redefining PRs as Prompt Requests. In this model, the reviewer stops obsessing over the generated code itself and instead reviews the prompt that produced it. If the prompt is logically sound and the intent is clear, the resulting code is often a secondary concern. This strategy prevents reviewer burnout and shifts the focus back to the systemic intent.
For the engineer, this means the zone of value creation has narrowed to five critical areas. First is problem selection, finding the high-leverage point where solving one problem resolves several others. Second is system architecture, designing thin business logic layers that allow models to be swapped out with minimal friction. Third is quality judgment, setting the bar for what is actually sufficient for release. Fourth is user empathy, such as designing nuanced UX cues like loading messages that explain a model's current thought process to reduce perceived latency. Finally, there is communication, the ability to frame technical achievements into a narrative the market can understand.
Developers who continue to compete on implementation speed are fighting a losing battle against an exponential curve. The path to longevity in the AI era requires a pivot toward product thinking and business fluency. The most successful engineers will be those who stop seeing themselves as writers of code and start seeing themselves as architects of intent, transitioning into roles akin to founding engineers or tech leads who decide exactly what should exist and why.




