When my partner asked this question, it sounded very specific—but it exposed a broad engineering issue many of us hit in production agent systems: When DeepSeek V3.2 selects a tool via tool_choice="auto", what tokens are actually generated, how is that different from older special-token function-calling formats or strict structured calling, and what does that do to ordered system-instruction adherence? I expected a simple “function-calling behavior” answer. What I found is more useful: this is not just a model question. It is a protocol + parser + orchestration question. The core insight For open-weight DeepSeek V3.2 workflows, tool calling in auto mode is typically: model emits textual wrapper content (DSML-like blocks), runtime/parser extracts tool calls from that text, runtime normalizes into tool_calls[] objects. So the system is often text-generation first, structure recovery second.…