Blog

Systematically generating tests that would have caught Anthropic's top‑K bug

Most testing strategies miss rare edge cases until customers find them in production. We’ve developed a system that automatically generates targeted unit tests for rare bugs, including the one that would have caught Anthropic’s recent approximate top-K bug.
Read more →