StepFun’s Step 3.7 Flash achieved 409 tokens/s, leading Artificial Analysis’ Output Speed ranking. It also ranks first in end-to-end response time, intelligence-to-output speed ratio, and speed-to-price ratio. The model rose to second place on OpenRouter Trending and received acclaim for its efficiency, multimodal understanding, and tool calling. Altcoins to watch may include projects leveraging such high-performance models. Price analysis indicates strong interest in AI-driven infrastructure.
ME AI message: According to the latest Output Speed ranking from Artificial Analysis, the globally respected large model evaluation platform, StepFun’s newly open-sourced base model, Step 3.7 Flash, leads all mainstream models with an output speed of 409 tokens/s. It also holds a leading position across key metrics including End-to-End Response Time, Intelligence vs. Output Speed, and Output Speed vs. Price.
From an industry perspective, competition among large models is shifting from isolated capability benchmarks to real-world Agent task efficiency. In complete task pipelines involving browsing, retrieval, document understanding, interface analysis, and tool invocation, models are no longer merely answering questions—they function as continuous execution engines. As a result, end-to-end latency, throughput capacity, and cost structure have become critical constraints. Therefore, a systemic balance of higher throughput, lower latency, and superior cost efficiency is emerging as the foundational requirement for scalable Agent deployment.
In this context, Step 3.7 Flash outperforms comparable models across multiple dimensions—including Intelligence vs. Output Speed, End-to-End Response Time, and Output Speed vs. Price—achieving coordinated optimization of intelligence, speed, and cost. This provides essential foundational support for Agent systems requiring high-frequency invocation, continuous operation, and scalable deployment. This trend further confirms that the core competitiveness in the Agent era is shifting from peak model capability to real-world task completion efficiency—essentially, a systemic balance among speed, intelligence, and cost.
Meanwhile, since its release, Step 3.7 Flash has risen to second place on OpenRouter’s Trending list and has become one of the most closely watched open-source models in the global developer community. Developer feedback highlights its outstanding performance in runtime efficiency, multimodal understanding, and Agent tool invocation capabilities. Some developers have compared it with similar models like DeepSeek V4 Flash and concluded that it offers clear advantages in speed and response experience.
Overall, Step 3.7 Flash’s dual recognition in authoritative evaluations and the developer community not only validates its engineering strengths in high throughput and low latency but also reflects how Chinese open-source models are rapidly integrating into the global developer ecosystem and securing a more prominent role in the next phase of Agent infrastructure competition.
(Source: Ifnar)
Disclaimer: The information on this page may have been obtained from third parties and does not necessarily reflect the views or opinions of KuCoin. This content is provided for general informational purposes only, without any representation or warranty of any kind, nor shall it be construed as financial or investment advice. KuCoin shall not be liable for any errors or omissions, or for any outcomes resulting from the use of this information.
Investments in digital assets can be risky. Please carefully evaluate the risks of a product and your risk tolerance based on your own financial circumstances. For more information, please refer to our Terms of Use and Risk Disclosure.