According to ME News, on April 17 (UTC+8), monitoring by Beating revealed that PrismML has released the Ternary Bonsai series of language models, utilizing a 1.58-bit (ternary weights) technique to reduce GPU memory usage to one-ninth that of a 16-bit model while maintaining high performance. The series includes three parameter sizes: 8B, 4B, and 1.7B, and is now open-sourced on Hugging Face with native support for Apple devices. The term “1.58-bit model” refers to constraining neural network weights to only three values: {-1, 0, +1}. Compared to earlier ultra-compressed 1-bit models (weights limited to {-1, +1}), introducing the “0” value effectively eliminates redundant connections, enabling the model to retain sophisticated reasoning capabilities despite its extremely small size. The Ternary Bonsai 8B weight file is only 1.75 GB and achieves an average benchmark score of 75.5—5 points higher than its own 1-bit version and significantly outperforming similar dense models like Qwen3 in terms of “intelligence density” (performance per GB of GPU memory). Energy efficiency and inference speed are another core advantage of this series. On the iPhone 17 Pro Max, the 8B version achieves a speed of 27 tokens per second, with energy efficiency improved by approximately 3 to 4 times. For developers seeking to deploy high-performance AI on edge devices such as smartphones and laptops, this means achieving near-full-precision intelligence with minimal memory overhead. Currently, the Ternary Bonsai models are natively supported on Apple devices via the MLX framework, and the model weights are distributed under the Apache 2.0 license. (Source: BlockBeats)
PrismML Launches 1.58-Bit Ternary Bonsai Model with 9x Fewer Parameters and Enhanced Intelligence
KuCoinFlashShare






On-chain data shows that PrismML launched the 1.58-bit Ternary Bonsai model on April 17, reducing memory usage to one-ninth that of 16-bit models. The 8B version runs at 27 tokens per second on the iPhone 17 Pro Max and scores 75.5 on benchmarks. Available on Hugging Face, the model supports Apple devices and is open-sourced under the Apache 2.0 license. Market observers note that this release could impact the Fear & Greed Index as adoption of lightweight AI grows.
Source:Show original
Disclaimer: The information on this page may have been obtained from third parties and does not necessarily reflect the views or opinions of KuCoin. This content is provided for general informational purposes only, without any representation or warranty of any kind, nor shall it be construed as financial or investment advice. KuCoin shall not be liable for any errors or omissions, or for any outcomes resulting from the use of this information.
Investments in digital assets can be risky. Please carefully evaluate the risks of a product and your risk tolerance based on your own financial circumstances. For more information, please refer to our Terms of Use and Risk Disclosure.