AIMPACT Update, May 19 (UTC+8): According to monitoring by Beating, Sapient Intelligence has open-sourced HRM-Text, a 1-billion-parameter (1B) text generation foundational model. This is a purely pre-trained model based on the Hierarchical Reasoning Model (HRM) architecture. By introducing latent space reasoning at the base layer of the architecture, it reduces pre-training computational costs by 130 to 600 times compared to conventional models. Specifically, HRM-Text achieved pre-training using only 40 billion (40B) structured tokens—approximately one-thousandth the data volume of comparable standard models. Official benchmarks show that training the 1B version from scratch requires approximately 46 hours on two 8-GPU H100 servers, with a computational cost of about $1,472; the 0.6B version can be trained on a single node in 50 hours at a hardware cost of roughly $800. The complete engineering framework, including data extraction, sequence packing, and PyTorch distributed training, has been simultaneously open-sourced. The dramatic cost reduction is enabled by a unique dual-timescale recurrent design: the model incorporates two sets of Transformer modules—one fast (lower layer) and one slow (higher layer)—that iteratively alternate on the same input batch and exchange information via state addition. This design allows the model to dynamically extend computational depth by increasing recurrence steps while keeping the total number of physical parameters fixed. The steep drop in pre-training barriers has revived opportunities for low-cost validation of previously abandoned model theories due to prohibitive computational costs. Note that only unaligned, purely pre-trained weights are being released; the model can only perform prefix continuation tasks and cannot be used directly as a question-answering assistant. (Source: BlockBeats)
Sapient Open-Sources 1B-Parameter HRM-Text Model with 1,300x Lower Training Cost
KuCoinFlashShare






On May 19 (UTC+8), Sapient Intelligence released on-chain news announcing the open-sourcing of HRM-Text, a 1-billion-parameter text model featuring a hierarchical reasoning architecture. The model leverages latent space reasoning to reduce pre-training costs by 130 to 600 times. It was trained on 40 billion structured tokens—only a fraction of the data used by comparable models. The 1B version can be trained in 46 hours for $1,472 using two 8-GPU H100 servers, while the 0.6B version costs $800. The full engineering framework, including data extraction and PyTorch training, is also open-sourced. Inflation data and training efficiency are key highlights of this release.
Source:Show original
Disclaimer: The information on this page may have been obtained from third parties and does not necessarily reflect the views or opinions of KuCoin. This content is provided for general informational purposes only, without any representation or warranty of any kind, nor shall it be construed as financial or investment advice. KuCoin shall not be liable for any errors or omissions, or for any outcomes resulting from the use of this information.
Investments in digital assets can be risky. Please carefully evaluate the risks of a product and your risk tolerance based on your own financial circumstances. For more information, please refer to our Terms of Use and Risk Disclosure.