ME News reports that on May 20 (UTC+8), according to monitoring by Beating, wafer-scale chip company Cerebras has launched the trillion-parameter large model Kimi K2.6 in enterprise testing, completely eliminating interconnect latency from traditional board-level communication by integrating chips directly across an entire 12-inch silicon wafer. Third-party evaluation firm Artificial Analysis measured its generation speed at 981 tokens/s—6.7 times faster than mainstream GPU cloud services. In long-text tasks with 10,000 input tokens and 500 output tokens, total response time dropped from 163.7 seconds via Kimi’s official API to just 5.6 seconds—a 29-fold improvement. Since model weights are distributed across multiple wafers while activation values are streamed, inter-layer communication runs entirely on the wafer’s on-chip network fabric, achieving a physical communication bandwidth over 200 times that of NVIDIA’s NVLink in the NVL72 architecture. Combined with distributed computing optimizations, Kimi K2.6 stores weights natively in 4-bit format with minimal loss, uses 16-bit floating-point numbers during computation to preserve precision, and employs customized operator kernels with speculative decoding to achieve real-time performance. (Source: BlockBeats)
Cerebras Tests Kimi K2.6 Model with 29x Speed Boost in Long-Text Tasks
KuCoinFlashShare






On May 20 (UTC+8), Cerebras revealed it had tested the trillion-parameter Kimi K2.6 model using its wafer-scale chips. By mounting chips directly on a full 12-inch wafer, the company reduced communication delays. According to Artificial Analysis, the model generated text at 981 tokens per second—6.7 times faster than standard GPU services. In a long-text test with 10,000 input tokens and 500 output tokens, response time decreased from 163.7 seconds to 5.6 seconds, a 29x improvement. On-chain data continues to underscore performance advancements in AI infrastructure.
Source:Show original
Disclaimer: The information on this page may have been obtained from third parties and does not necessarily reflect the views or opinions of KuCoin. This content is provided for general informational purposes only, without any representation or warranty of any kind, nor shall it be construed as financial or investment advice. KuCoin shall not be liable for any errors or omissions, or for any outcomes resulting from the use of this information.
Investments in digital assets can be risky. Please carefully evaluate the risks of a product and your risk tolerance based on your own financial circumstances. For more information, please refer to our Terms of Use and Risk Disclosure.