2026 OpenRouter Weekly Token Rankings
Billing Data Does Not Lie

28.9T weekly throughput · China-US shift · DeepSeek matrix · Token vs dollar share

2026 OpenRouter weekly token rankings and billing market structure
If your team picks models from MMLU scores but monthly API bills tell a different story, OpenRouter weekly token rankings offer a more honest signal. They track 7-day rolling real throughput, not vendor lab claims. This guide is for API routing owners and AI platform leads. Using the May 18–24, 2026 window, you get: (1) billing vs benchmark trust comparison; (2) 28.9T global weekly volume and China-US share; (3) that week's Top 10 plus the DeepSeek model matrix; (4) the token share vs dollar revenue split; (5) a six-step weekly tracking runbook; (6) three citable data points and host recommendations.
01

Benchmark leaderboards vs billing throughput: which reflects real AI adoption?

Conclusion first: for production routing, weekly billing beats static benchmarks. OpenRouter aggregates 300+ models from 60+ providers, serves 8M+ users, and processes roughly 100T tokens per month. Its leaderboard ranks by 7-day rolling input+output tokens—actual paid usage, not self-reported scores.

01

Benchmark blind spot: High-scoring models with unstable APIs or extreme pricing lose traffic fast. Leaderboards cannot capture that migration.

02

Billing honesty: Every token maps to compute and spend. Throughput is the market's thermometer for adoption.

03

Agent-era shift: OpenRouter and a16z's 2025 AI Usage Report (100T anonymized tokens) found benchmark scores and market share are nearly inversely correlated. Teams optimize for cost and API stability.

04

Use-case mix: Coding jumped from ~11% of traffic in early 2025 to over 50%—the largest single category. That explains DeepSeek's weekly dominance.

Platform weekly volume grew from ~2.4T tokens a year ago to 28.9T in the May 18–24 window—a roughly 12x annual surge. Weekly observation windows matter more than ever.

02

How to read OpenRouter weekly stats: decoding 28.9T for May 18–24

At openrouter.ai/rankings, four dimensions matter: weekly token total, per-model rank, provider market share, and dollar revenue share vs token share. The last pair exposes pricing-driven "dual truth." Summary for the latest complete week:

MetricValueWoWRead
Global weekly tokens28.9T+7.4%Fifth consecutive weekly rise
China-origin models9.223T+19.89%Outpaces global average
US-origin models4.93T+16.27%Growing in absolute terms, losing share
China vs US rankChina #1 for 4 weeksFirst surpassed US in Feb 2026
TimelineChina model traffic shareNote
Early 2025< 2%Negligible
Feb 2026First to surpass USInflection point
May 2026~45%+Fourth week at #1

Token throughput has graduated from a technical metric to a commercial barometer—investors, builders, and media now vote on the same weekly chart.

03

May 18–24 Top 10: how DeepSeek's three-model matrix took the lead

Three DeepSeek variants landed in the top nine. Combined series volume hit 5.74T tokens (+25.9% WoW), beating Anthropic and Google for the second straight week at provider level.

#ModelVendorWeekly tokensWoWRole
1DeepSeek-V4-FlashDeepSeek3.43T+66%Agent default, ultra-low price
2Tencent Hy3 PreviewTencent3.07T+16%Post-free-tier growth
3Claude Sonnet 4.6Anthropic1.35T1M context, enterprise coding
4DeepSeek-V3.2DeepSeek1.31TLow-cost long tail
5Owl AlphaOpenRouter1.15T+29%Free Agent-specialized
6Gemini 3 Flash PreviewGoogle1.06TMultimodal, academic
7DeepSeek-V4-ProDeepSeek1.00TFlagship (5.74T series total)
8MiniMax M2.7MiniMax806BLong-context value
9Grok 4.1 FastxAI721B2M context, legal workflows
10Step 3.5 FlashStepFun673BFast batch processing

Three tiers emerge: high-value / low-volume (Claude Opus for complex enterprise reasoning); mid-cost / mid-volume (Gemini Flash for multimodal); ultra-low-cost / high-volume (DeepSeek, MiniMax, StepFun for agents and batch jobs). Anthropic's premium paradox: ~12% token share (down from 25% a year ago) but ~46% dollar revenue share. Claude Opus 4.6 alone drives ~$25M/month while moving a fraction of DeepSeek's tokens.

Note: Kimi K2.6 dropped out of the top 10 after ranking #6 prior week. V4-Pro volume derived from 5.74T series total minus V4-Flash and V3.2. Cross-checked against OpenRouter public data and May 25, 2026 press coverage.

04

Six-step runbook: track OpenRouter weekly rankings and adjust routing

01

Fixed cadence: Every Monday, open openrouter.ai/rankings, screenshot 7-day ranks and provider shares, archive internally.

02

Reconcile your bills: Export OpenRouter or vendor invoices. If your token mix diverges sharply from global weekly ranks, routing may be stale.

03

Route by task tier: Agents and batch jobs to DeepSeek-V4-Flash; complex enterprise reasoning to Claude Opus; multimodal to Gemini Flash.

04

Watch new entrants: Hy3 Preview and Owl Alpha surges often precede the next default model. Run 5% shadow traffic A/B tests.

05

Split token vs revenue share: High-token / low-revenue models scale cheaply; high-revenue models belong on critical paths.

06

Bind a stable host: Routing logic fails if laptops sleep through OAuth refresh or choke on parallel dev servers. Put Gateways on 24/7 cloud Mac hosts and bake weekly reviews into SOP.

05

Three citable data points behind the weekly chart

A

12x annual growth: Weekly platform volume rose from ~2.4T to 28.9T. At a reported 26x PS valuation, the weekly chart is now a core investor signal for AI commercialization.

B

Coding dominates: Coding exceeds 50% of OpenRouter traffic (vs ~11% in early 2025), explaining V4-Flash's 3.43T weekly crown—agents prize unit economics over peak reasoning scores.

C

China-US reversal speed: China-origin share climbed from <2% to ~45%+ in under 18 months—open, ultra-low-cost APIs are reshaping global call patterns.

Caution: Weekly figures fluctuate daily. This article uses data through 2026-05-24. Free models like Owl Alpha suit prototypes; review privacy terms before production.

Running multi-model agent routing on a personal Mac introduces sleep disconnects, memory pressure from parallel dev servers, and OAuth refresh failures. VPS hosts lack native Apple Silicon for Xcode and iOS CI. For 24/7 Gateway uptime, parallel dev servers, and multi-region API routing, MESHLAUNCH cloud Mac Mini rental is usually the better production choice: dedicated Apple Silicon, flexible daily/weekly/monthly terms, closing the loop with weekly OpenRouter reviews.

FAQ

Benchmarks test ceilings; weekly ranks track paid throughput. Use both, but follow billing for market direction. See our pricing page for Agent host options.

V4-Flash as default agent router; V4-Pro for flagship coding; V3.2 for low-cost long tail. The 5.74T series total can guide API key quota allocation.

Review every Monday against your invoices; run 5% shadow traffic within seven days of major model launches. Host issues: help center.