ControlNet API: pose, depth, and canny-locked image generation If you've ever shipped a creative tool that needed "same pose, different outfit" or "same building, different style", you know the gap. Plain text-to-image gives you a coin flip on composition. Today we're closing that gap with a single endpoint that locks the bits you care about and regenerates the rest. What it does POST /v1/image/controlnet is guided image generation. You hand us a reference image and tell us which signal to extract from it — canny for edges, depth for 3D structure, openpose for human pose, or scribble for rough line drawings. We pull that signal, freeze it, and then generate a new image around it from your text prompt. The practical upshot: the silhouette, stance, perspective, or line structure of your reference survives the round-trip. Everything else — colour, style, subject, background, lighting — is yours to direct with the prompt.…