Goodhart’s law is ruining models, a call to (experienced) arms.

1 / 2

Goodhart’s law is ruining models, a call to (experienced) arms.

DEV Community·Justin Schroeder·about 1 month ago

#HupmJUBn

#ai #webdev #programming #productivity #experienced #coding

Reading 0:00

15s threshold

I’ve been coding for about 20 years professionally, and frankly I don’t really “code” anymore—if by coding you mean typing syntax on a keyboard. Surprisingly, I don’t feel any less effective or any less of an engineer. I actually feel rather excited as various bottlenecks around me are removed. So yeah, I’m not worried about AI taking my career away. It already took away my coding, and I still love my job. But I am worried about something else: overfitting. As a senior engineer, I want better models and agents! The more I can delegate with confidence, the more actual engineering I can do and the more valuable systems I can architect. Unfortunately, recent models have been overhyped, overfit, and benchmaxxed, all while increasing in price. Why? It’s Goodhart’s Law in practice: “When a measure becomes a target, it ceases to be a good measure.” We’ve made these benchmarks the measure of a model’s efficacy, but in the process we’ve ended up ruining the measurement and polluting the training.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Create free account Log in

Menu

Goodhart’s law is ruining models, a call to (experienced) arms.