#Dynosim

1 post

Feed

Images only1 of 1 post

🖼️

DynoSim: Simulating the Pareto Frontier

NVIDIA Technical Blog·Yongming Ding·3 days ago

#GVzBoSJs

#developer #planner #cache #engine #replay #dynosim

Modern LLM serving is hard to tune because each deployment is a stack of interacting choices: model backend, tensor-parallel shape, prefill/decode split…

15s

Menu

#Dynosim

DynoSim: Simulating the Pareto Frontier