NVIDIA open-sources the Llama Nemotron-253B inference model, with a throughput 4 times higher than that of DeepSeek R1.

AI Daily News updated 7m ago dongdong

151 0

NVIDIA announces the open-sourcing of the Llama Nemotron-253B inference model, which is fine-tuned based on Llama-3.1-405B. In multiple benchmark tests, Llama Nemotron outperformed Llama 4, achieving performance comparable to DeepSeek R1 with only half the number of parameters. It excels in complex mathematical reasoning, scientific question answering, and coding tasks, with a throughput that is 4 times higher than that of DeepSeek R1.

© Copyright Notice

The copyright of the article belongs to the author. Please do not reprint without permission.

Related Posts

Google Releases Gemini 2.5 Pro, Tops Multiple Tests, and Surpasses OpenAI in Reasoning Ability Across the Board

Google Releases Gemini 2.5 Pro, Tops Multiple Tests, and Surpasses OpenAI in Reasoning Ability Across the Board

7m ago

02640

The GPT – 4.1 model is officially launched on ChatGPT

The GPT – 4.1 model is officially launched on ChatGPT

6m ago

01300

OpenAI upgrades its Operator AI Agent, transitioning the model from 4o to o3

OpenAI upgrades its Operator AI Agent, transitioning the model from 4o to o3

6m ago

01290

OpenAI releases bug-finding intelligent agent Aardvark: fully automatic code reading, vulnerability detection, and fix writing

OpenAI releases bug-finding intelligent agent Aardvark: fully automatic code reading, vulnerability detection, and fix writing

1w ago

0590

No comments yet...

none

No comments yet...