A couple of weeks ago I was hammering through millions of tokens daily, hitting quotas and rate limits left and right, forcing me to switch providers and juggle subscriptions. Then I found Caveman . Most foundational models aim to be helpful assistants, mimicking friendly support staff stuffed with pleasantries. All those "This is a brilliant idea!" and "According to my research using the internet with the playwright web browser, I did find more information regarding that topic bla bla bla..." are bloating up the available context space. What I, as someone running a fleet of AI coding agents daily, really want from my agents is to be efficient communicators: highlight noteworthy information, omit irrelevant, unimportant, or redundant text. Every page of information sent back and forth between me, my agents, and the LLMs costs tokens and pollutes context space. That's where Caveman comes in.…