How We Cut API Response Time from 2.3s to 180ms Using Redis + Smart Caching

📰

How We Cut API Response Time from 2.3s to 180ms Using Redis + Smart Caching

DEV Community: redis·Nitin Srivastava·about 1 month ago

#dev #class #code #cache #redis #article

Reading 0:00

15s threshold

p95 latency dropped from 2.3 seconds to 180 milliseconds. Same hardware, same database, same traffic. The only thing that changed was how we cached — and I don't mean slapping @lru_cache on a function. I'm writing this because every Redis caching tutorial I read before this project showed me the same 15-line example: redis.get(key) or fetch_from_db() . That code works in a notebook. It will absolutely wreck you in production the first time real traffic hits it. This is the layered strategy that actually survived. FastAPI + Python on the server, Redis 7 for caching, Postgres behind it. Everything below is from a real project we shipped for a B2B client earlier this year — roughly 800 requests per minute on the hot endpoints, with read-heavy traffic around product catalog and pricing. The endpoint that was killing us The problematic endpoint returned a pricing quote for a product variant, filtered by region, customer tier, and active promotions.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Create free account Log in

Menu

How We Cut API Response Time from 2.3s to 180ms Using Redis + Smart Caching