Sources#
- OpenAI Codex lead on the new shape of product work
- Predicting model behavior before release by simulating deployment
Summary#
OpenAI is an AI research company and the maker of the GPT‑5 series (including GPT‑5 Thinking and the Codex coding models) and the ChatGPT product. In this vault it is the principal counterweight to Anthropic across two threads: frontier-safety methodology and agent tooling. It is also the company Andrej Karpathy co-founded — the origin point of the Software 1/2/3.0 and vibe-coding framings that recur throughout the wiki.
What it does (in this corpus)#
- Frontier-safety research. OpenAI authored Deployment Simulation (June 2026) — replaying ~1.3M de-identified production conversations to forecast a candidate model's deployment-time behavior before release, and the cross-lab mitigation for evaluation awareness. Earlier, Deliberative Alignment (Guan et al. 2025) is OpenAI's spec-grounded-CoT alignment method and the strongest non-MSM baseline in the alignment cluster.
- Agent tooling and orchestration. OpenAI ships Codex and the surrounding harness layer: the Codex App Server Protocol (JSON-RPC stdio for headless sessions), the Symphony open-source orchestrator (Linear as a control plane for Codex), and the "harness engineering" framing for an agent-first Codex workflow.
- A measurement asset. Its scale of production traffic is what makes Deployment Simulation work at all — the same proprietary-traffic advantage Production-Sourced Evaluation names, here turned toward pre-release safety forecasting rather than capability benchmarking.
- Workforce-economics research. Its June 2026 study The Shift to Agentic AI: Evidence from Codex uses Codex usage telemetry to document the move from conversational to agentic AI across three populations — the OpenAI/Codex counterpart to Anthropic's returns-to-expertise study (which it cites), and the third major usage-telemetry source in this corpus.
- Its own product culture (self-reported). Andrew Ambrosino's June 2026 interview is the wiki's window into how OpenAI builds: nearly all employees use Codex weekly (dogfooding as culture); teams are "very agentic" with "unlimited tokens," so "everybody's building everything" (Implementation Abundance Inverts Product Work); a bottoms-up exploration culture where products disrupt each other internally; large, mostly-IC teams of "former founders" with "high agency and taste"; and the member-of-technical-staff convention (Role Averaging, Not Role Elimination). Blunt internal feedback loops ("a 2,000-message Slack thread about how stupid we are") are named as why the external product works.
Position relative to Anthropic#
The two labs converge on shared problems from different angles, which is why OpenAI sources keep pairing with Anthropic ones in this wiki:
- On evaluation awareness, Anthropic names the problem (the marquee Opus 4.8 concern) and OpenAI ships a mitigation (deployment-distribution replay).
- On alignment training, deliberative alignment (OpenAI) is the direct-CoT-training baseline that Anthropic's Model Spec Midtraining (MSM) outperforms while better preserving Chain-of-Thought Monitorability.
- On agent orchestration, Symphony/Codex (OpenAI) and Claude Code (Anthropic) are the two reference harnesses the agent-tooling pages compare.
Connections#
- Deployment Simulation — OpenAI's pre-release safety method and its most-cited contribution in this corpus
- Deliberative Alignment — OpenAI's spec-grounded-CoT alignment training (Guan et al. 2025)
- Codex — OpenAI's agentic coding/work platform; the tool whose adoption the June 2026 study measures
- Symphony — OpenAI's open-source Codex orchestrator
- Codex App Server Protocol — OpenAI's headless-Codex JSON-RPC protocol
- Conversation-to-Delegation Shift — the thesis of OpenAI's June 2026 Codex usage study; agentic AI as delegated production
- Andrej Karpathy — OpenAI co-founder; originated the Software 3.0 / vibe-coding framings
- Andrew Ambrosino — product & engineering lead for the Codex desktop app; the source for OpenAI's internal product culture
- Implementation Abundance Inverts Product Work — the "everybody's building everything" product-process shift, drawn from inside OpenAI
- Anthropic — the frontier-lab peer it is repeatedly contrasted with on safety methods and agent tooling
- Perplexity — a deep-research competitor that runs Anthropic (not OpenAI) base models; OpenAI Deep Research is benchmarked against it on DRACO
Sources#
- Predicting model behavior before release by simulating deployment — OpenAI, 2026-06-04 (Deployment Simulation; ~1.3M-conversation GPT‑5-series study)
- The Shift to Agentic AI: Evidence from Codex — OpenAI Economic Research, 2026-06-25 (Codex usage across three populations)
- OpenAI Codex lead on the new shape of product work — Lenny's Podcast, 2026-06-28 (Ambrosino on OpenAI's product culture and the Codex desktop app)
- Also referenced in: An open-source spec for Codex orchestration: Symphony., Harness engineering: leveraging Codex in an agent-first world, Model Spec Midtraining: Improving How Alignment Training Generalizes (deliberative-alignment baseline)
Cited by 18
- Agentic Technical Debt
Debt that *compounds* (not just accumulates) because each agentic-coding session re-derives architectural decisions wit…
- Agentic Work Systematization
OpenAI Codex study's 'systematization' margin: the shift from ad-hoc agent use (describe task → agent does it → done) t…
- Andrej Karpathy
Co-founder OpenAI, ex-Tesla AI, Eureka Labs; coined "vibe coding," Software 1/2/3.0, "ghosts not animals," "agentic eng…
- Andrew Ambrosino
Product & engineering lead for the Codex desktop app at OpenAI; a designer→engineer→PM→founder generalist whose June 20…
- Codex
OpenAI's agentic coding and work platform: a CLI (April 2025) plus a desktop app (built Nov 2025, released Feb 2026) bu…
- Conversation-to-Delegation Shift
OpenAI's Codex usage study (June 2026): the move from conversational AI ('asking') to agentic AI ('delegated production…
- Deployment Simulation
OpenAI's pre-release safety method: replay recent production conversations with a candidate model (strip the old final…
- Dogfooding as Product Discipline
Product sense is built by relentless first-hand use ("ant food"); Mr. Peanut catch; cross-source (Cat Wu vibe-checks, G…
- Implementation Abundance Inverts Product Work
Andrew Ambrosino's inversion thesis: when talking to a frontier model can stand up any feature from scratch, implementa…
- Entities — People, Orgs, Tools & Projects
Map of Content for all 39 entity pages. See Home for concept domains.
- Organizational Complements to AI
The general-purpose-technology argument that AI's productivity gains depend on complementary workflow/skill/org-design…
- Parallel Agent Orchestration
OpenAI Codex study's concurrency + runtime margins: the intensive-user workflow where a human oversees a team of agents…
- Polish No Longer Signals Readiness
Andrew Ambrosino's observation that the medium used to encode process-stage — a production-looking artifact meant late-…
- Prototype Over PRD
Dan Carey's prototype-replaces-PRD method: record a why-not-what conversation, transcribe it, hand the transcript to Cl…
- Role Averaging, Not Role Elimination
Andrew Ambrosino's nuanced OpenAI-side take on role collapse: your role is 'the average of what you spend your time on'…
- Symphony
OpenAI's open-source agent orchestrator (March 2026): turns Linear into a control plane for Codex, per-issue workspace,…
- Vibe Coding vs. Agentic Engineering
Vibe coding raises the floor (anyone builds); agentic engineering preserves the quality bar while going faster; ">10x a…
- Why AI Lags at Design
Andrew Ambrosino's four reasons frontier models are worse at visual/product design than at code: design is hard to grad…
Related articles
- Andrew Ambrosino
Product & engineering lead for the Codex desktop app at OpenAI; a designer→engineer→PM→founder generalist whose June 20…
- Codex
OpenAI's agentic coding and work platform: a CLI (April 2025) plus a desktop app (built Nov 2025, released Feb 2026) bu…
- Engineer PM Convergence
Generalists across disciplines; product taste as bottleneck skill; Anthropic Claude Code team as case study; "just do t…
- Harness Shrinkage as Models Improve
Prompt scaffolding shrinks each model release; Cat Wu's pruning discipline; Boris Cherny "100 lines of code a year from…
- Implementation Abundance Inverts Product Work
Andrew Ambrosino's inversion thesis: when talking to a frontier model can stand up any feature from scratch, implementa…
