TL;DR Nvidia released Nemotron 3 Nano Omni, an open-weight multimodal model that unifies vision, audio, and language in a single architecture with 30B parameters but only 3B active per inference. It claims 9x throughput over comparable open models and tops six benchmarks. Available under Nvidia’s Open Model Agreement for commercial use, it targets edge AI agent deployment on single GPUs, making Nvidia a competitor not just in AI infrastructure but in the models that run on it. Nvidia released Nemotron 3 Nano Omni on Tuesday , an open-weight multimodal AI model that unifies vision, audio, and language understanding in a single architecture designed to power autonomous AI agents on edge devices. The model has 30 billion parameters but activates only three billion per forward pass through a mixture-of-experts design, a ratio that allows it to run on a single GPU while matching or exceeding the multimodal capabilities of models several times its size.…