DeepSeek has released its new open-source model V3.1, with context length expanded to 128K

AI Daily News updated 2m ago dongdong

92 0

DeepSeek has announced the open-source release of its new base model, DeepSeek-V3.1-Base. After being published on Hugging Face, the model quickly rose to the 4th spot on the trending models list. DeepSeek-V3.1-Base adopts a Mixture of Experts (MoE) architecture, with its context length expanded to 128K, while maintaining the same number of parameters as the V3 version.

© Copyright Notice

The copyright of the article belongs to the author. Please do not reprint without permission.

Related Posts

Google releases updates to the Gemini 2.5 AI model family

Google releases updates to the Gemini 2.5 AI model family

4m ago

0880

NVIDIA Releases New Model: OpenReasoning-Nemotron

NVIDIA Releases New Model: OpenReasoning-Nemotron

3m ago

0750

Musk Open-Sources Grok 2.5: Chinese Companies Are xAI’s Biggest Rivals

Musk Open-Sources Grok 2.5: Chinese Companies Are xAI’s Biggest Rivals

2m ago

0640

Notion Launches Multiple New AI Upgrade Features

Notion Launches Multiple New AI Upgrade Features

5m ago

01020

No comments yet...

none

No comments yet...