Key Takeaways The memory wall is a primary constraint on coding agents, not model intelligence. Quadratic attention costs, KV cache growth, and "lost in the middle" degradation create a hard ceiling on how long agents can maintain coherent reasoning. Research breakthroughs compose: 30x+ KV memory reduction is within reach. TriAttention's intelligent pruning and TurboQuant's 3-bit quantization are complementary techniques that stack naturally, while Latent Briefing cuts multi-agent context sharing costs by 49%. Fundamentally different theories of agent memory are emerging. For example, MemPalace bets on structured archival with spatial retrieval; Hippo Memory bets on intelligent forgetting with decay-based consolidation. The field hasn't converged on what wins or perhaps it changes depending on the use case. The harness is becoming an operating system for agent memory.…