Sources#
Summary#
Andrej Karpathy coined "vibe coding" in 2025 and, a year later, names its serious successor: agentic engineering. The distinction is about which bar moves. Vibe coding raises the floor — anyone can build software now. Agentic engineering preserves the quality bar of professional software while going much faster: "you're not allowed to introduce vulnerabilities due to vibe coding; you're still responsible for your software, but can you go faster — and how do you do that properly?" It's an engineering discipline for coordinating spiky, fallible, stochastic-but-powerful agents without sacrificing quality.
The two bars#
- Vibe coding — floor up. Everyone can vibe-code anything. "Amazing, incredible." Democratization (cf. Printing Press Software Democratization). Quality is not the point; access is.
- Agentic engineering — ceiling up, quality held. You keep the responsibilities of professional software (security, correctness, maintainability) and use agents to go faster without dropping below that bar. "Doing that well and correctly is the realm of agentic engineering."
These are different activities, not points on one line. One lowers the entry cost; the other raises the output ceiling for people who already clear the bar.
"10x is not the speedup"#
Karpathy explicitly retires the old "10x engineer" trope as too small: "10x is not the speedup you gain… people who are very good at this peak a lot more than 10x." The ceiling on agentic-engineering capability is very high, and the spread between mediocre and AI-native practitioners widens, not narrows. (Echoes Harness Shrinkage as Models Improve: the leverage keeps growing as models improve; the binding constraint becomes the operator's taste — see Outsource Your Thinking, Not Your Understanding.)
What an AI-native practitioner looks like#
Asked to contrast a mediocre vs. a fully AI-native user of cloud code / codex / open claw, Karpathy's answer is mundane and important: invest in your setup, use all the tool's features. Same as the engineers who got the most out of Vim or VS Code — now applied to Claude Code / Codex. Mastery is configuration-and-features fluency, not a secret prompt.
Hiring has to be refactored#
A practical corollary: most teams still hire with the old paradigm (puzzles, leetcode). Karpathy argues agentic-engineering hiring should look like "give me a really big project and watch someone implement it well" — e.g., build a secure Twitter-clone-for-agents, then a red-team agent ("codex 5.4 xhigh") tries to break it and can't. Hiring should test verifiable, end-to-end build-and-defend ability, not isolated puzzle-solving. (See The Verifiability Thesis for why "and it can't be broken" is the load-bearing half.)
The human residue#
Even at the high ceiling, the human stays in charge of spec, taste, judgment, and oversight — agents do the fill-in-the-blanks. His MenuGen war story: the agent matched Stripe and Google accounts by email address instead of a persistent user ID — "such a weird thing to do," the kind of mistake Jagged Intelligence (Ghosts, Not Animals) predicts. You must design the spec ("these must be unique user IDs we tie everything to") and supply the taste; the agent handles the API details you've stopped memorizing.
Connections#
- Andrej Karpathy — coined both terms
- Software 3.0 — the paradigm both activities operate in
- Jagged Intelligence (Ghosts, Not Animals) — why agentic engineering needs human oversight: agents make weird, spiky mistakes
- The Verifiability Thesis — the discipline leans on verifiable build-and-defend tasks
- Outsource Your Thinking, Not Your Understanding — taste/judgment/spec is the residual human bottleneck the discipline turns on
- Harness Shrinkage as Models Improve — the ">10x and widening" leverage curve as models improve
- Printing Press Software Democratization — the floor-raising half is the same democratization Boris Cherny describes
- Claude Code Best Practices — concrete agentic-engineering practice (explore→plan→code, verification-driven)
- Verification as the New Bottleneck — Fiona Fung's org-level account of "preserve the quality bar while going faster"
- Claude Code — the surface where this plays out
Open Questions#
- Karpathy hints at "one domain that's very [valuable]" for founders but won't say which (didn't want to "vague-post on stage"). What verifiable RL-environment domain is he gesturing at?
- If the mediocre/AI-native spread keeps widening, what does that do to team composition — a few extreme outliers plus agents, vs. broad mid-level staffing?
Sources#
Cited by 10
- Andrej Karpathy
Co-founder OpenAI, ex-Tesla AI, Eureka Labs; coined "vibe coding," Software 1/2/3.0, "ghosts not animals," "agentic eng…
- Boris Cherny
Creator of Claude Code at Anthropic; phone-driven workflow with hundreds of agents; primary advocate of `/loop` primiti…
- Building Is Cheap, Arguing Is Expensive
"In technical debate, code wins": generate three PRs vs whiteboard; prototype over design doc; reduce design docs
- Claude Code
Anthropic's agentic coding product; created by Boris Cherny late 2024; TypeScript/React; CLI/desktop/web/mobile/IDE sur…
- Harness Shrinkage as Models Improve
Prompt scaffolding shrinks each model release; Cat Wu's pruning discipline; Boris Cherny "100 lines of code a year from…
- Jagged Intelligence (Ghosts, Not Animals)
"Ghosts not animals": jagged statistical circuits, no intrinsic motivation; car-wash/strawberry failures; stay in the l…
- Outsource Your Thinking, Not Your Understanding
"You can outsource your thinking but not your understanding"; understanding as the non-delegable human bottleneck; know…
- Printing Press Software Democratization
Boris Cherny's analogy: 1400s literacy expansion → AI software-writing expansion; domain knowledge displaces coding ski…
- Software 3.0
Karpathy's taxonomy: 1.0 code, 2.0 weights, 3.0 prompting; LLM as programmable interpreter; MenuGen "shouldn't exist";…
- The Verifiability Thesis
LLMs automate what you can *verify* as computers automate what you can *specify*; RL verification rewards → jagged peak…
Related articles
- Andrej Karpathy
Co-founder OpenAI, ex-Tesla AI, Eureka Labs; coined "vibe coding," Software 1/2/3.0, "ghosts not animals," "agentic eng…
- Compute Allocator
The human's evolving role: deciding what's worth spending compute on; ~1% of generated tokens ship, 99% is scaffolding…
- Boris Cherny
Creator of Claude Code at Anthropic; phone-driven workflow with hundreds of agents; primary advocate of `/loop` primiti…
- Cat Wu
Head of Product for Claude Code and Cowork at Anthropic; primary articulator of AI-native product cadence and engineer-…
- Claude Code
Anthropic's agentic coding product; created by Boris Cherny late 2024; TypeScript/React; CLI/desktop/web/mobile/IDE sur…
