Aliyun PAI open-sources the AgenticQwen small model with dual data flywheel training.

iconKuCoinFlash
Share
Share IconShare IconShare IconShare IconShare IconShare IconCopy
AI summary iconSummary

expand icon
On-chain news broke on April 27 (UTC+8) as Alibaba’s PAI team open-sourced AgenticQwen, a small model designed for industrial tool calling. Built on MetaEra, the 8B and 30B-A3B versions employ a dual data flywheel framework to reduce inference costs. AgenticQwen-8B achieved a score of 47.4 on TAU-2 and BFCL-V4, outperforming Qwen3-8B (23.8) and approaching Qwen3-235B (52.0). AgenticQwen-30B-A3B (with 3B active parameters) scored 50.2. The model is now deployed in production systems, matching the inference speed of 235B models. Inflation data tracking and other real-world tasks have become more efficient with this release.

AIMPACT Update, April 27 (UTC+8): According to monitoring by Beating, Alibaba’s PAI team has released and open-sourced AgenticQwen, a lightweight agent language model designed for industrial-grade tool invocation, available in two versions: 8B and 30B-A3B. Trained using an innovative "dual data flywheel" reinforcement learning framework, this model series achieves agent capabilities approaching those of hundred-billion-parameter large models while significantly reducing inference costs. The core mechanism lies in its "dual data flywheel" training method. Traditional synthetic data often suffers from homogenization, leading to performance plateaus. AgenticQwen addresses this with two flywheels: the reasoning flywheel automatically generates harder variants from the model’s mistakes; the agent flywheel expands simple linear workflows (e.g., single booking tasks) into multi-branch behavior trees incorporating constraints, rejections, and adversarial conditions, simulating real-world complex decision-making scenarios. Evaluations show that AgenticQwen-8B achieves an average score of 47.4 on real-world tool benchmarks such as TAU-2 and BFCL-V4, far surpassing the base Qwen3-8B (23.8) and nearing Qwen3-235B (52.0). AgenticQwen-30B-A3B (activating only 3B parameters) scores 50.2. The model has already been deployed in internal production systems similar to Manus, substantially narrowing the performance gap with the 235B model (with shorter end-to-end inference times). However, the paper acknowledges that, due to its native 40K context length limitation, smaller models still face challenges in deep search tasks. (Source: BlockBeats)

Disclaimer: The information on this page may have been obtained from third parties and does not necessarily reflect the views or opinions of KuCoin. This content is provided for general informational purposes only, without any representation or warranty of any kind, nor shall it be construed as financial or investment advice. KuCoin shall not be liable for any errors or omissions, or for any outcomes resulting from the use of this information. Investments in digital assets can be risky. Please carefully evaluate the risks of a product and your risk tolerance based on your own financial circumstances. For more information, please refer to our Terms of Use and Risk Disclosure.