Google DeepMind launches DiffusionGemma, boosting text generation speed by 4x
KuCoinFlashShare






On-chain news: Google DeepMind has launched DiffusionGemma, an open-source text generation model that increases speed by 4x. Using diffusion technology, it generates 256 tokens in parallel, achieving over 1,000 tokens/s on H100 and 700+ on RTX 5090. The 26B MoE model activates 3.8B parameters during inference and supports consumer GPUs after quantization. It features bidirectional attention and self-correction, making it ideal for code completion and inline editing. The model is open-sourced under Apache 2.0. New token listings may benefit from such advancements in generation efficiency.
Source:Show original
Disclaimer: The information on this page may have been obtained from third parties and does not necessarily reflect the views or opinions of KuCoin. This content is provided for general informational purposes only, without any representation or warranty of any kind, nor shall it be construed as financial or investment advice. KuCoin shall not be liable for any errors or omissions, or for any outcomes resulting from the use of this information.
Investments in digital assets can be risky. Please carefully evaluate the risks of a product and your risk tolerance based on your own financial circumstances. For more information, please refer to our Terms of Use and Risk Disclosure.