The Google T5Gemma model has been released

AI Daily News updated 4m ago dongdong

125 0

Google has released a new language model called T5Gemma, which is a large language model based on an encoder-decoder architecture and comes with 32 derived versions. T5Gemma is mainly designed for text generation tasks, including summarization, machine translation, question answering, mathematical reasoning, and reading comprehension. The model is built on the Gemma 2 framework and employs a technique called “model adaptation” to convert a pre-trained decoder-only model into an encoder-decoder architecture. It also supports flexible “imbalanced” configurations, such as adjusting between a 9B-parameter encoder and a 2B-parameter decoder, enabling a trade-off between quality and efficiency. Its advantage lies in outperforming decoder-only models under the same inference computational cost, while allowing flexible scaling of the encoder and decoder sizes according to specific tasks.

© Copyright Notice

The copyright of the article belongs to the author. Please do not reprint without permission.

Related Posts

Microsoft open-sources TTS model VibeVoice, capable of generating up to 90 minutes of speech

Microsoft open-sources TTS model VibeVoice, capable of generating up to 90 minutes of speech

3m ago

01620

DeepSeek, in collaboration with Tsinghua University, has released the DeepSeek-GRM model, which significantly improves scalability during inference.

DeepSeek, in collaboration with Tsinghua University, has released the DeepSeek-GRM model, which significantly improves scalability during inference.

7m ago

01630

MiniMax Releases Latest Large Language Model “MiniMax M2”

MiniMax Releases Latest Large Language Model “MiniMax M2”

2w ago

01080

A New Global Benchmark for Multimodal Reasoning: GLM-4.5V Officially Launched and Open-Sourced

A New Global Benchmark for Multimodal Reasoning: GLM-4.5V Officially Launched and Open-Sourced

3m ago

01310

No comments yet...

none

No comments yet...