Howardismvol. 03 · quiet corner of the web

Howardism · Vol. 03Plate I · No. 01

LLM Evaluation, tagged.

Notes3TagLLM EvaluationOldest14 Apr 2026Newest23 May 2026

Every article tagged llm evaluation, newest first.

C01
The Verifiability Thesis
LLM Architecture LLM Evaluation Agent Engineering
AI Engineering23 May 2026 · 5′
C02
Interactivity Benchmarks
LLM Evaluation Multimodal Human AI Collaboration
Interaction & Multimodal13 May 2026 · 4′
C03
Scale-Dependent Prompt Sensitivity
LLM EvaluationPrompt EngineeringInverse Scaling+2
LLM Architecture14 Apr 2026 · 9′