Why I'm Building a Local-First AI Coding Workspace (And How Behavioral Routing Makes It Work) There's a pattern forming in the AI coding tool space that I think is worth paying attention to. GitHub paused Copilot Pro+ signups because agentic workloads broke their cost model. Cursor Pro+ is $60/mo and climbing. Claude Code might leave the Pro tier entirely. The common thread: these tools are all cloud-only, which means every user interaction is a cost event on someone else's infrastructure. At scale, the math stops working. Prices go up, access gets gated, and developers end up paying more for less. I left my role at Wise earlier this year to build a different kind of AI coding tool. One where local inference is the default and cloud is a resource you use intentionally. That tool is called Rada. This post is a technical walkthrough of how it works under the hood. The core thesis Most of what developers ask an AI coding assistant to do doesn't need a frontier cloud model.…