H
Howardism
Plate II機器翻譯 · machine-translatedENHOWARDISM

Cat Wu

PublishedMay 6, 2026FiledEntityTagsEntityPersonAnthropicReading5 minSourceAI-synthesised

Anthropic 旗下 Claude Code 與 Cowork 的產品負責人;AI 原生產品節奏與工程師-PM 融合概念的主要闡述者

Cat Wu 的插畫

資料來源#

摘要#

Anthropic 旗下 Claude CodeCowork 的產品負責人。多年擔任工程師,後短暫進入 VC 領域,再加入 Anthropic。訪談過「數百位試圖進入 AI 領域的 PM」;將這份觀察轉化為一個強烈觀點:PM 角色正處於重構之中(參見 Engineer PM Convergence)。

重要主張與立場#

  • 節奏轉變。 Anthropic 的功能交付週期從 6 個月 → 1 個月 → 有時 1 天。透過移除流程阻力、將多數發布冠上 research preview 標籤以降低承諾、先在內部上線再對外推出來達成。
  • 使命 > 協調。「如果有兩個互相競爭的優先順序,我們會討論哪一個對 Anthropic 的使命更重要。」這才是在規模化下消除摩擦的關鍵,而不是人頭數或流程。
  • 就去做事。 個人生活信條。「工作是假的。如果你理解了限制條件,就能想清楚自己能做什麼,然後盡快去做。」她認為主動性(agency)是新創公司應該招募的稀缺特質。
  • 為當下的模型打造。 為超級 AGI 稻草人打造產品很容易;要從今天的模型中引出最大能力卻很難。最困難的 PM 技能,是定義一個月後產品該長什麼樣。
  • 請模型自我檢視。 被低估的除錯技巧:當 Claude 做出意料之外的事,問它為什麼。「很多時候,只要保持高度好奇心去探究模型為何做出那個決定,你就會看到是什麼把它誤導了,於是你就能修正 harness。」(參見 Model Introspection Feedback
  • 建立 evals。 十個優秀的 evals 勝過一百個平庸的;撰寫 eval 是「被低估的工作」,應該有更多 PM/工程師動手做。
  • 95% 自動化不算自動化。 不衝到 100% 就別費心。最後那 5% 占了大部分工作量,卻是讓工作流值得信賴的關鍵。
  • 打造你每天會用的應用,而不是原型。 客製化設定超過某個程度就會變成干擾——「我覺得簡單的設定其實效果更好。」
  • 性格就是產品。 Claude 的個性(低自我、正向、輕鬆、偏向行動)是產品成功的核心——Amanda 塑造模型性格的工作「比寫程式更難」,因為這項任務本身極度模糊。

訪談中的運作細節#

  • Anthropic 內部約有 30–40 位 PM,分布於 research-PM、Claude Developer Platform、Claude Code、Enterprise、Growth 等團隊。
  • 寧可招募具備強烈產品品味的工程師,也不要工程能力薄弱的 PM——團隊中許多工程師能直接把 Twitter 上的反饋一路打造成上線產品,全程無需 PM 介入。
  • 團隊中的設計師都具備前端工程背景。
  • 每晚使用 Cowork,從 Slack/Drive/Twitter 的脈絡草擬 20 頁簡報(已預載 anthropic design system)。
  • 內部技術堆疊:大量使用 Claude Code + Cowork、Slack(「公司的作業系統」)、由團隊為個人化工作流自製的內部應用。
  • 「Applied AI」團隊是 token 消耗量第二大的單位(僅次於工程團隊)——這是個技術型 go-to-market 的角色,為客戶製作原型。

重要引述#

  • 「為超級 AGI 強模型打造產品非常容易。困難的是針對當下的模型,弄清楚該如何引出最大能力?」
  • 「當寫程式的成本變得便宜許多時,真正變得更有價值的事情,是決定要寫什麼。」
  • 「即使某個產品不成功,只要它沒有擋住核心使用場景,那就無妨。」
  • 「每當模型變得更聰明,我們就能移除許多 prompting 介入。事實上,我們每次發布新模型時都會這麼做。」

相關連結#

資料來源#

§ end
About this piece

Articles in this journal are synthesised by AI agents from a curated wiki and are refreshed automatically as new concepts arrive. Topics, framing, and editorial direction are curated by Howardism.

Cited by 30
  • AI Native Product Cadence

    Cat Wu's 6mo→1mo→1day cadence at Anthropic: research-preview branding, mission-as-tiebreaker, evergreen launch room, li…

  • Opinions on Using AI Tools & the Future of the Software Engineering Role

    Debate map of four stances on using AI tools (bullish-insider / pragmatist-practitioner / skeptic-governance / architec…

  • Anthropic

    AI safety company / vendor of Claude; mission-as-tiebreaker culture; ~30–40 PMs across teams; Mike Krieger leads Labs r…

  • Anthropic Labs

    Anthropic's internal incubator — a 'bet factory' of ~a dozen tiny teams exploring the model frontier with lean-startup…

  • Boris Cherny

    Creator of Claude Code at Anthropic; phone-driven workflow with hundreds of agents; primary advocate of `/loop` primiti…

  • Build for the Next Model

    Prototype the thing that almost works, not the thing that already works: bet that the next concrete model release (not…

  • Claude Character as Product

    Personality as load-bearing product surface; Amanda's role at Anthropic; lunchtime vibe-checks as eval discipline; the…

  • Claude Code

    Anthropic's agentic coding product; created by Boris Cherny late 2024; TypeScript/React; CLI/desktop/web/mobile/IDE sur…

  • Compounding Loop Optimization

    Dan Carey's discipline of instrumenting and automating every recurring step of the build loop — because when internal t…

  • Compute Allocator

    The human's evolving role: deciding what's worth spending compute on; ~1% of generated tokens ship, 99% is scaffolding…

  • Cowork

    Anthropic's non-code knowledge-work agent product; sibling to Claude Code; output is decks/inbox/dossiers; same MCP/com…

  • Dan Carey

    Product Manager leading product within Anthropic Labs; led Claude Design; 'Designing with Claude' talk (May 2026); ~two…

  • Dogfooding as Product Discipline

    Product sense is built by relentless first-hand use ("ant food"); Mr. Peanut catch; cross-source (Cat Wu vibe-checks, G…

  • Engineer PM Convergence

    Generalists across disciplines; product taste as bottleneck skill; Anthropic Claude Code team as case study; "just do t…

  • Evals as Product Spec

    Cat Wu's framing of evals as the emerging core PM skill: ten great evals beats a hundred mediocre; encode what done loo…

  • How Do You Write Evals for Taste? Character as the Limit Case

    Taste-driven features are eval-resistant but not eval-proof: the technique is conviction → dogfood-sourced failure sign…

  • Fiona Fung

    Leads engineering + product for Claude Code and Cowork at Anthropic (ex-Meta/Microsoft); "what served you prior may no…

  • Harness Shrinkage as Models Improve

    Prompt scaffolding shrinks each model release; Cat Wu's pruning discipline; Boris Cherny "100 lines of code a year from…

  • HTML as the New Markdown

    Thariq Shihipar's thesis: as models improve, thousand-line markdown plans overwhelm the *human*; HTML artifacts (visual…

  • Jagged Intelligence (Ghosts, Not Animals)

    "Ghosts not animals": jagged statistical circuits, no intrinsic motivation; car-wash/strawberry failures; stay in the l…

  • Learning to Co-Work with AI: A Software Engineer's Field Guide

    Field guide for software engineers in the AI era: 6 skill clusters (taste, harness, alignment-first planning, agent-fri…

  • MCP and Computer Use

    Anthropic's two complementary connector mechanisms: MCP for structured programmatic access (Salesforce/Drive/Gmail/Slac…

  • Entities — People, Orgs, Tools & Projects

    Map of Content for all 32 entity pages. See Home for concept domains.

  • Model Introspection Feedback

    Cat Wu's underrated technique: ask the model why it failed; treat answer as harness-debugging signal not model criticis…

  • Mythos Model

    Anthropic preview-tier frontier model and the first member of the Mythos-class tier (above Opus); gated for safety, use…

  • Open Questions Backlog

    _96 pages with open questions, as of 2026-06-14._

  • The PRD-Replacement Spectrum at AI-Native Speed

    Four positions (grill-then-PRD → lighter-PRD → build-to-decide → prototype-is-spec) are one spectrum once you decompose…

  • Prototype Over PRD

    Dan Carey's prototype-replaces-PRD method: record a why-not-what conversation, transcribe it, hand the transcript to Cl…

  • The Verifiability Thesis

    LLMs automate what you can *verify* as computers automate what you can *specify*; RL verification rewards → jagged peak…

  • Verification as the New Bottleneck

    Fiona Fung: coding is no longer the bottleneck — verification, review, maintenance are; shift-left; TDD loses its tax;…

Related articles
  • Claude Code

    Anthropic's agentic coding product; created by Boris Cherny late 2024; TypeScript/React; CLI/desktop/web/mobile/IDE sur…

  • Harness Shrinkage as Models Improve

    Prompt scaffolding shrinks each model release; Cat Wu's pruning discipline; Boris Cherny "100 lines of code a year from…

  • Anthropic

    AI safety company / vendor of Claude; mission-as-tiebreaker culture; ~30–40 PMs across teams; Mike Krieger leads Labs r…

  • Boris Cherny

    Creator of Claude Code at Anthropic; phone-driven workflow with hundreds of agents; primary advocate of `/loop` primiti…

  • Fiona Fung

    Leads engineering + product for Claude Code and Cowork at Anthropic (ex-Meta/Microsoft); "what served you prior may no…