I’ve been coding for about 20 years professionally, and frankly I don’t really “code” anymore—if by coding you mean typing syntax on a keyboard. Surprisingly, I don’t feel any less effective or any less of an engineer. I actually feel rather excited as various bottlenecks around me are removed. So yeah, I’m not worried about AI taking my career away. It already took away my coding, and I still love my job. But I am worried about something else: overfitting. As a senior engineer, I want better models and agents! The more I can delegate with confidence, the more actual engineering I can do and the more valuable systems I can architect. Unfortunately, recent models have been overhyped, overfit, and benchmaxxed, all while increasing in price. Why? It’s Goodhart’s Law in practice: “When a measure becomes a target, it ceases to be a good measure.” We’ve made these benchmarks the measure of a model’s efficacy, but in the process we’ve ended up ruining the measurement and polluting the training.…