The prospects of the release of DeepSeek R2 have sparked speculation

AI Daily News updated 7m ago dongdong

134 0

Hugging Face CEO Clément Delangue posted a tweet that has sparked widespread speculation, hinting that DeepSeek R2 might be on the verge of release. The link attached to the tweet points to DeepSeek’s Hugging Face repository. Meanwhile, leaked information about DeepSeek R2 is rapidly spreading on social media platforms. It shows that the model has 1.2 trillion parameters, with 78 billion active parameters, and adopts a hybrid MoE (Mixture of Experts) architecture. Additionally, its cost is reported to be 97.3% lower than that of GPT – 4o, with training data amounting to 5.2PB and a test score of 89.7%. However, the authenticity of this information has not yet been confirmed. As of now, neither DeepSeek nor Qwen officials have responded to this.

The prospects of the release of DeepSeek R2 have sparked speculation

© Copyright Notice

The copyright of the article belongs to the author. Please do not reprint without permission.

Related Posts

The AI Agent product Manus has announced its pricing plan: offering two packages priced at $39 and $199 respectively.

The AI Agent product Manus has announced its pricing plan: offering two packages priced at $39 and $199 respectively.

7m ago

05140

Mistral Unveils Medium 3 Large Model: High Cost-Effectiveness Meets Easy Deployment

Mistral Unveils Medium 3 Large Model: High Cost-Effectiveness Meets Easy Deployment

6m ago

01270

OpenAI’s “Hotline” Feature Update: Send a text to 1-800-242-8478 to generate images

OpenAI’s “Hotline” Feature Update: Send a text to 1-800-242-8478 to generate images

5m ago

01260

Microsoft releases GitHub Spark, a one-click full-stack application generation tool

Microsoft releases GitHub Spark, a one-click full-stack application generation tool

4m ago

01180

No comments yet...

none

No comments yet...