Gemma 4 26B (MoE) and 31B (Dense) from Google are now available on Vercel AI Gateway . Built on the same architecture as Gemini 3, both open models support function-calling, agentic workflows, structured JSON output, and system instructions. Both support up to 256K context, 140+ languages, and native vision. 26B (MoE): Activates only 3.8B of its 26B total parameters during inference, optimized for lower latency and faster tokens-per-second. 31B (Dense): All parameters are active during inference, targeting higher output quality. Better suited as a foundation for fine-tuning. To use Gemma 4, set model to google/gemma-4-31b-it or google/gemma-4-26b-a4b-it in the AI SDK . import { streamText } from 'ai' ; const result = streamText ( { model : 'google/gemma-4-26b-a4b-it' , // or 'google/gemma-4-31b-it' prompt : ` Break down this codebase into modules, identify circular dependencies, and generate a refactoring plan with implementation steps.…