H
Howardism
Plate IIEntities中文HOWARDISM

Anthropic Institute

PublishedJune 7, 2026FiledEntityDomainEntitiesTagsEntityOrgAI PolicyGovernanceAnthropicReading3 minSourceAI-synthesised

Anthropic's policy/governance research arm; published *When AI builds itself* (Favaro & Clark, 2026) on recursive self-improvement; agenda includes building the verification systems a credible multilateral AI slowdown would require

Illustration for Anthropic Institute

Sources#

Summary#

The Anthropic Institute is Anthropic's research and policy arm focused on the societal and governance implications of frontier AI. It published When AI builds itself (June 2026) — this wiki's primary source on Recursive Self-Improvement — and has a stated agenda to build, in collaboration with others, the systems that a credible AI slowdown or pause would require (Frontier Pause Verification).

What it does#

  • Public-facing trajectory analysis. When AI builds itself combines public benchmarks (Task Time-Horizon Scaling) with previously-unreported internal Anthropic data (AI Accelerating AI Development) to argue AI is already accelerating AI development and to lay out three futures for RSI.
  • Coordination infrastructure. It plans to "conduct research — in collaboration with many others — and take actions to help build the systems that a credible slowdown or pause would require": verification that other developers have actually stopped, and that a bad actor cannot exploit a coordinated slowdown to jump ahead in secret (Frontier Pause Verification).
  • Convening. In the months after the essay, the Institute plans to organize conversations among policymakers, researchers, civil society, and other AI companies, and to publish the results — explicitly inviting voices outside AI companies into the deliberation.

People#

  • Marina Favaro and Jack Clark co-authored When AI builds itself (editorial support from Santi Ruiz; visuals by Shan Carter, Romello Goodman, Nikki Makagiansar from data by Brian Calvert and Jun Shern Chan).

Connections#

Open questions#

  • How does the Institute's policy posture (favoring an option to pause) interact with Anthropic's commercial incentive to ship frontier models? The essay acknowledges the competitive/geopolitical pressure but doesn't resolve it.
  • What concrete verification mechanisms will the Institute prototype, and on what timeline relative to the RSI trend it warns about?

Sources#

  • When AI builds itself — Anthropic Institute, When AI builds itself (Marina Favaro & Jack Clark, June 2026)
§ end
About this piece

Articles in this journal are synthesised by AI agents from a curated wiki and are refreshed automatically as new concepts arrive. Topics, framing, and editorial direction are curated by Howardism.

Cited by 10
  • AI Accelerating AI Development

    The empirical core of *When AI builds itself*: measured evidence AI already speeds AI R&D at Anthropic — >80% of merged…

  • AI R&D Autonomy Evaluation (AECI)

    How Anthropic measures whether a model can automate or dramatically accelerate AI research — the capability that drives…

  • Anthropic

    AI safety company / vendor of Claude; mission-as-tiebreaker culture; ~30–40 PMs across teams; Mike Krieger leads Labs r…

  • Frontier Pause Verification

    The arms-control problem of a credible, verifiable slowdown or pause of frontier AI: detectability is harder than for o…

  • LLM-Driven Vulnerability Research

    Claude Mythos Preview's emergent cybersecurity capabilities: autonomous zero-day discovery, full exploit chains, and An…

  • METR

    Independent AI-evaluation org behind the 'time horizons' benchmark — the task length a model can complete reliably on i…

  • Entities — People, Orgs, Tools & Projects

    Map of Content for all 32 entity pages. See Home for concept domains.

  • Mythos Model

    Anthropic preview-tier frontier model and the first member of the Mythos-class tier (above Opus); gated for safety, use…

  • Open Questions Backlog

    _96 pages with open questions, as of 2026-06-14._

  • Recursive Self-Improvement

    An AI system autonomously designing and developing its own successor; Anthropic Institute's *When AI builds itself* arg…

Related articles
  • AI Accelerating AI Development

    The empirical core of *When AI builds itself*: measured evidence AI already speeds AI R&D at Anthropic — >80% of merged…

  • Recursive Self-Improvement

    An AI system autonomously designing and developing its own successor; Anthropic Institute's *When AI builds itself* arg…

  • Mythos Model

    Anthropic preview-tier frontier model and the first member of the Mythos-class tier (above Opus); gated for safety, use…

  • Responsible Scaling Policy Evaluations

    Anthropic's RSP gates deployment on pre-release capability evaluations in CBRN, automated AI R&D, and high-stakes misal…

  • Claude Opus 4.8

    Anthropic's most capable general-access model (May 2026); upgrade on Opus 4.7 in SWE/agentic/knowledge work; does not a…