OpenRouter Data Shows 61% of Token Consumption by Chinese AI Models

iconCryptoBriefing
Share
Share IconShare IconShare IconShare IconShare IconShare IconCopy
AI summary iconSummary

expand icon
AI + crypto news platforms like OpenRouter show Chinese-developed language models now using 61% of tokens among the top 10 models. Their share rose from under 1.2% in late 2024 to 51% by April 2026. Leading models include Qwen, DeepSeek, Kimi, GLM, and MiniMax. Developers point to cost efficiency and open-weight flexibility as main reasons. Inflation data and token usage trends highlight the rising role of Chinese AI in global infrastructure.

Here’s a number that should make Silicon Valley uncomfortable: Chinese-developed language models now account for roughly 61% of token consumption among the top 10 models on OpenRouter, the popular API routing platform used by thousands of AI startups. That’s not a typo. The majority of compute flowing through one of America’s key AI infrastructure layers is being routed to models built in Beijing, Hangzhou, and Shenzhen.

The shift didn’t happen overnight, but it accelerated fast. In late 2024, Chinese open-weight models represented less than 1.2% of weekly token consumption on OpenRouter. By 2025, that figure had surged to nearly 30% at peak moments, averaging about 13% for the full year. By April 2026, select snapshots showed Chinese provider traffic hitting 51% of all tokens processed on the platform.

Advertisement

The models driving the shift

The names leading this migration are becoming familiar to anyone paying attention to the global AI race. Alibaba’s Qwen series, DeepSeek’s family of models, Moonshot AI’s Kimi, Zhipu AI’s GLM, and MiniMax have all carved out significant presence on OpenRouter’s leaderboard.

The share of programming and agentic workloads on OpenRouter climbed from about 11% to over 50% through 2025. American startups aren’t just experimenting with Chinese models for chatbot demos. They’re building their core products on them, with code generation and autonomous agents increasingly being handled by models developed outside the US.

Why developers are making the switch

The explanation is deceptively simple: cost and capability. Chinese model providers have aggressively priced their APIs, often dramatically undercutting their American counterparts. The open-weight nature of many Chinese models adds another layer. Startups can inspect, fine-tune, and deploy these models with fewer restrictions than they face with proprietary US alternatives.

What this means for investors

For the Chinese companies themselves, particularly Alibaba, Qwen’s presence on OpenRouter means it’s being integrated into American software products. The geopolitical dimension can’t be ignored either. US policymakers have spent the last two years tightening export controls on advanced chips headed to China. The OpenRouter data suggests a different outcome: Chinese labs have focused intensely on efficiency, producing models that deliver competitive performance at lower computational cost.

Disclaimer: The information on this page may have been obtained from third parties and does not necessarily reflect the views or opinions of KuCoin. This content is provided for general informational purposes only, without any representation or warranty of any kind, nor shall it be construed as financial or investment advice. KuCoin shall not be liable for any errors or omissions, or for any outcomes resulting from the use of this information. Investments in digital assets can be risky. Please carefully evaluate the risks of a product and your risk tolerance based on your own financial circumstances. For more information, please refer to our Terms of Use and Risk Disclosure.