Menu

Post image 1
Post image 2
1 / 2
0

DeepSeek previews new AI model that 'closes the gap' with frontier models | TechCrunch

TechCrunch·Ram Iyer·about 1 month ago
#dYUyQztN
Reading 0:00
15s threshold

Chinese AI lab DeepSeek has launched two preview versions of its newest large language model, DeepSeek V4 , a much-awaited update to last year’s V3.2 model and the accompanying R1 reasoning model that took the AI world by storm . The company says both DeepSeek V4 Flash and V4 Pro are mixture-of-experts models with context windows of 1 million tokens each — enough to allow large codebases or documents to be used in prompts. The mixture-of-experts approach involves activating only a certain number of parameters per task to lower inference costs. The Pro model has a total of 1.6 trillion parameters (49 billion active), which makes it the biggest open-weight model available, outstripping Moonshot AI’s Kimi K 2.6 (1.1 trillion), MiniMax’s M1 (456 billion), and more than double DeepSeek V3.2 (671 billion). The smaller, V4 Flash has 284 billion parameters (13 billion active).…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Read More