DeepSeek Open Sources Again: Releases 3B MoE OCR Model DeepSeek-OCR

AI Daily News updated 4d ago dongdong

86 0

DeepSeek has launched a new visual text compression model, DeepSeek-OCR. With only 3 billion parameters and a Mixture-of-Experts (MoE) architecture, the model reduces the number of visual tokens by 20×, achieving a 20× compression ratio. It can process 33 million pages per day across 20 nodes. In the Fox benchmark, the model achieves over 85% accuracy across all text length ranges. DeepSeek-OCR supports multiple resolution configurations, multilingual processing, and complex chart interpretation, delivering 10× compression efficiency in multi-turn dialogues.

© Copyright Notice

The copyright of the article belongs to the author. Please do not reprint without permission.

Related Posts

MiniMax-M1, the world’s first open-source large-scale hybrid architecture inference model

MiniMax-M1, the world’s first open-source large-scale hybrid architecture inference model

4m ago

0860

The in-depth research feature launched by ChatGPT can be directly connected to GitHub

The in-depth research feature launched by ChatGPT can be directly connected to GitHub

6m ago

0940

ElevenLabs Launches AI Music Generation Tool: Eleven Music

ElevenLabs Launches AI Music Generation Tool: Eleven Music

3m ago

0770

Google Veo 3’s Amazing Upgrade: Photos Instantly Come to Life and Speak!

Google Veo 3’s Amazing Upgrade: Photos Instantly Come to Life and Speak!

4m ago

0850

No comments yet...

none

No comments yet...