Why Heuristic Detectors Beat LLMs at Finding Agent Failures

1 / 2

Why Heuristic Detectors Beat LLMs at Finding Agent Failures

DEV Community·Tuomo Nikulainen·18 days ago

#M5SgvJuF

#ai #agents #llm #pisama #agent #failure

Reading 0:00

15s threshold

TL;DR: We built 20 core rule-based detectors that find failures in AI agent traces. On the TRAIL benchmark (Patronus AI), they achieve 60.1% accuracy vs. 11.9% for the best LLM. Zero false positives. Zero LLM cost. On Who&When (ICML 2025), combined with a single Sonnet call for attribution, they beat GPT-5.4 Mini on both agent identification (60.3% vs. 60.3%) and step localization (24.1% vs. 22.4%). pip install pisama Enter fullscreen mode Exit fullscreen mode The assumption everyone makes When an AI agent fails in production (it hallucinates, gets stuck in a loop, ignores instructions, drops context), the standard approach is to throw another LLM at the problem. LLM-as-judge. Agent-as-judge. Feed the trace to GPT-4 and ask "what went wrong?" We tested this assumption. The answer is surprising: for most agent failures, simple heuristics work better.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Create free account Log in

Menu

Why Heuristic Detectors Beat LLMs at Finding Agent Failures