Argo Rollouts in Production: Canary, AnalysisTemplates, and the Gotchas Nobody Documents

1 / 2

Argo Rollouts in Production: Canary, AnalysisTemplates, and the Gotchas Nobody Documents

DEV Community·Akshat Sinha·26 days ago

#pOMTIJkI

#alerts #deploys #rollout #argo #canary #rollouts

Reading 0:00

15s threshold

It started with a routine Tuesday deploy. Nothing fancy, a small config change to our ingress controller across a few clusters. We'd done this a hundred times. Standard values.yaml modification and then letting ArgoCD do its magic, watch the rolling update do its thing, grab a Tea ( personal preference, you can grab a coffee as well ). Famous last words. By the time I checked the dashboards, three clusters were throwing 502s. The rolling update had dutifully cycled through pods, but it had no clue that the new config was messing up our TLS termination. It just kept going. That's the thing about Kubernetes Deployments, they're optimistic to a fault. They'll roll out bad code with the same enthusiasm as good code, and by the time your metrics catch up, you've already blasted through all your replicas. I spent the afternoon writing rollback scripts and explaining to stakeholders why "production-ready" Kubernetes had just taken down three environments.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Create free account Log in

Menu

Argo Rollouts in Production: Canary, AnalysisTemplates, and the Gotchas Nobody Documents