If you’ve been following the AI arms race this year, you know the vibe is currently "Multimodal or Bust." OpenAI has been teasing its massive visual updates, but Google isn't about to let its home turf at Google I/O go uncontested. According to a massive new leak reported by TestingCatalog , Google is internally testing a next-generation model dubbed "Gemini Omni." This isn't just another incremental update to the Gemini 2.0 or 3.0 lines; this is a native, high-fidelity video-to-audio model designed for real-time interaction. If you’re a developer building the next generation of "eyes and ears" for AI agents, this leak just changed your roadmap. Here is what we know about Omni, how it competes with Nano Banana 2, and what the code might look like. 👇 🎥 What is "Gemini Omni"? The "Omni" designation suggests a unified architecture. While earlier models often relied on separate "vision" and "language" encoders that passed tokens back and forth, Omni is rumored to be a native multimodal model.…