A production Celery task in Codens Green started returning this, intermittently, only under real load: Control request timeout: initialize Enter fullscreen mode Exit fullscreen mode The string is suspiciously specific. It looks like the kind of message you would see if Claude Code CLI's MCP initialization handshake had timed out on the other side of a pipe. That is what it sounds like. That is not what it was. The task is analyze_code_specification . It spawns Claude Code CLI as a subprocess to analyze a repository against a PRD. It worked in staging, worked locally, worked in CI. It failed in production a few times a day, almost always when more than one analysis was running at the same time. What we eventually shipped: route that task to a dedicated Celery queue, run that queue on a separate EC2 worker tier with 8 GB of memory, pin concurrency to 1. The real bug was the Linux kernel OOM killer terminating Claude Code CLI partway through startup, before it could complete its handshake with the parent task.…