DeepSeek previews new AI model that 'closes the gap' with frontier models

1 / 2

DeepSeek previews new AI model that 'closes the gap' with frontier models | TechCrunch

TechCrunch·Ram Iyer·about 1 month ago

#dYUyQztN

#apple #amazon #cloudcomputing #evs #google #deepseek

Reading 0:00

15s threshold

Chinese AI lab DeepSeek has launched two preview versions of its newest large language model, DeepSeek V4 , a much-awaited update to last year’s V3.2 model and the accompanying R1 reasoning model that took the AI world by storm . The company says both DeepSeek V4 Flash and V4 Pro are mixture-of-experts models with context windows of 1 million tokens each — enough to allow large codebases or documents to be used in prompts. The mixture-of-experts approach involves activating only a certain number of parameters per task to lower inference costs. The Pro model has a total of 1.6 trillion parameters (49 billion active), which makes it the biggest open-weight model available, outstripping Moonshot AI’s Kimi K 2.6 (1.1 trillion), MiniMax’s M1 (456 billion), and more than double DeepSeek V3.2 (671 billion). The smaller, V4 Flash has 284 billion parameters (13 billion active).…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Create free account Log in

Menu

DeepSeek previews new AI model that 'closes the gap' with frontier models | TechCrunch