Writing on AI agent quality, statistical process control for LLMs, and what it means to run production AI responsibly.
A comprehensive guide to understanding why your AI agent's quality degrades over time — and what statistical signals appear before users ever notice.
Read articleThe "honeymoon period" is real. Here's the data on why AI agent quality peaks at launch and what causes the inevitable — but preventable — decline.
Read articleHow Shewhart's 1924 control charts apply perfectly to LLM output quality monitoring — a technical deep dive with worked examples.
Read articleThe most common quality monitoring mistake is deploying without a baseline. Here's the exact process for establishing control limits before you go live.
Read articleQuality degradation in AI agents rarely announces itself. By the time users complain, the damage is already compounding. Here's how to quantify it.
Read article