This is a submission for the Gemma 4 Challenge: Write About Gemma 4 Note: this post describes a real local architecture I use for development. Exact model names in Ollama, Hugging Face or Kaggle may vary depending on the runtime you use. The important part is not memorizing one command, but understanding how to separate chat, reasoning, multimodal context, code execution and repositories on your own machine. My Local Copilot: Gemma 4 + Open WebUI + OpenHands for Coding Without Leaving My Machine For a long time, I used local models as if they were just another chat window. I pasted an error, copied the answer, went back to my editor, ran tests, copied the next error, and repeated the loop. That works, but it leaves a lot on the table. What makes Gemma 4 interesting to me is not only that it is an open model family with multimodal capabilities and variants that can target different hardware profiles.…