Run Step 3.7 Flash on NVIDIA GPUs with Enterprise-Ready Multimodal AI

1 / 5

Run Step 3.7 Flash on NVIDIA GPUs with Enterprise-Ready Multimodal AI

NVIDIA Technical Blog·Anu Srivastava·4 days ago

#f1LmagkG

#developer #nvidia #flash #tuning #fine #photo

Reading 0:00

15s threshold

AI applications are moving beyond text generation to multimodal systems that can perceive, search, and reason across images, documents, video, and language in real time—turning fragmented information into actionable insights.   Step 3.7 Flash , the latest from StepFun, brings these capabilities to production and enterprise-scale, available on NVIDIA-accelerated infrastructure. It is a 198B-parameter Mixture-of-Experts vision-language model, with approximately 11B activated parameters per forward pass, optimized for agentic workflows that combine perception, search, and multi-step reasoning at production scale.  With native image and video input, three configurable reasoning levels—low, medium, and high—and a 256k context window, it is designed for enterprise use cases such as financial analysis, concurrent…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Create free account Log in

Menu

Run Step 3.7 Flash on NVIDIA GPUs with Enterprise-Ready Multimodal AI