logo
  • menu
  • Markets
  • ETFs
  • Live
  • Spot
  • Futures
  • Learn
  • Sign In
  • Sign Up
  • Downloads
  • English
  • |
  • USD
  • |
Sign Up
Crypto PricesLearnLatest NewsDownloadsMarketsSpotAnnouncements
Home/
Latest News/
Live

DeepSeek, Xiaomi Just Made Frontier AI 99% Cheaper. American Labs Went the Other Way

By Decrypt
May 28, 2026
4.6 
★
★
★
★
★
★
★
★
★
★
 478 User Rating
Share

Quick explainer for the non-developers in the room: When you use ChatGPT or Claude in a browser, you're paying a flat subscription—or nothing. When a company builds a product on top of an AI model, they pay per token, where a token is roughly three-quarters of a word. Every message sent, every reply generated, every document processed: all of it adds up at a rate measured in millions of tokens.

An API is the raw pipe that makes this possible, making it possible for an app, an agent, a web site, etc. to use the model in their own environment. So token pricing determines whether an AI-powered product is economically viable or a money pit.

Token plans are a subscription wrapper on top of that. You buy credits upfront; the model eats through them. Xiaomi's billing upgrade gives users 5 to 8 times more tokens at the same price. The Max plan at $100 now gets you 82 billion tokens, up from 1.6 billion.

For context, 82 billion tokens is more than 60 billion words.

Why the cuts are real, not marketing

Fuli Luo, head of Xiaomi's MiMo team and a former core DeepSeek developer who co-built DeepSeek-V2, published a technical explanation on X. The biggest savings come from a smarter way of storing and reusing information the AI has already processed. Instead of repeatedly doing the same work, Xiaomi’s system can remember much more data at once—about five times more than before. That means the AI needs far less computing power, cutting storage and processing costs by around 80%.

Behind the MiMo API Price Reduction:The deepest price cut, up to 99%, is for Input (Cache Hit). The core reason is our inference framework now supports hierarchical KV cache optimization for SWA. Production inference engine tests show this optimization increases cached token…

“Operating at these newly reduced API prices, our production inference engine is running at near full capacity, and we can still essentially break even,” Luo wrote. “If more architectures that save compute and KV [Key-Value cache] cache emerge, along with better inference Infra to drive down API costs, this will form an excellent virtuous cycle in the industry.”

The result is a model 98% cheaper than GPT-5.5 Pro with a competitive performance.

Silicon Valley’s bet

DeepSeek V4-Pro is a 1.6 trillion parameter model that gives you the knowledge base of a massive model at a fraction of the compute cost. It now permanently runs at $0.435 input and $0.87 output per million tokens. That's a model that scored 80.6% on SWE-Verified against Claude Opus 4.6's 80.8%—a benchmark measuring real GitHub issue resolution, not cherry-picked demos. The pricing gap between models with essentially the same coding score: 34x on output.

DeepSeek and Xiaomi aren't alone

Kimi K2.5 from Moonshot AI, with 76.8% on SWE-bench Verified, runs $0.60 input and $2.50 output. GLM-5.1 from Z.AI beat Claude Opus 4.6 on a key coding benchmark earlier this quarter. Four Chinese frontier models shipped in a 12-day window in early May, all under one-third of Opus 4.7's per-token cost.

For better visualization, this chart shows how Chinese models stack up against the three most popular American AI providers (Anthropic, OpenAI, and Meta) in terms of price to quality ratio.

Image: Artificialanalysis.ai

The Q2 2026 gap between Chinese and American frontier models sits at 15–30x, depending on which models you compare—and that's the baseline, before any cache discounts.

What this week's cuts do is collapse that gap further for the specific workloads that actually run in production: agent pipelines with stable system prompts, document processors, retrieval tools, things that hit cache constantly. At $0.003625 per million cached input tokens, DeepSeek V4-Pro's cost for repeated context is functionally rounding error.

Disclaimer: The information on this page may have been obtained from third parties and does not necessarily reflect the views or opinions of BitKan. This content is provided for general informational purposes only, without any representation or warranty of any kind, nor shall it be construed as financial or investment advice. BitKan shall not be liable for any errors or omissions, or for any outcomes resulting from the use of this information. Investments in digital assets can be risky. Please carefully evaluate the risks of a product and your risk tolerance based on your own financial circumstances. Products mentioned in this article may not be available in your region.

Latest News

Industry

Cryptocurrency

Airdrop

Markets

  • VerifiedX Launches Bitcoin Sidechain for Native DeFi Privacy

    VerifiedX Launches Bitcoin Sidechain for Native DeFi Privacy

    VerifiedX has officially introduced a decentralized "reliever chain" designed to bring programmable, privacy-preserving functionality to the Bitcoin network.
    Martha Grizzard
    May 18, 2026
  • Japan’s SBI and Rakuten Plan Crypto Trusts as Rules Finalize

    Japan’s SBI and Rakuten Plan Crypto Trusts as Rules Finalize

    SBI Securities and Rakuten Securities have officially announced plans to introduce cryptocurrency investment trusts to their massive retail user bases.
    Craig Green
    May 18, 2026
  • Senate Advances CLARITY Act: A New Era for U.S. Crypto Oversight

    Senate Advances CLARITY Act: A New Era for U.S. Crypto Oversight

    The Senate Banking Committee advanced the CLARITY Act on May 14, 2026 to establish a comprehensive federal framework for the digital asset industry.
    May 15, 2026
  • TRC20-USDT Circulation Soars to 89.3 Billion Record on TRON

    TRC20-USDT Circulation Soars to 89.3 Billion Record on TRON

    The circulation of TRC20-USDT has officially ascended to a historic peak of 89.3 billion tokens, fundamentally expanding the liquidity threshold of the decentralized financial landscape.
    Hallie Gill
    May 12, 2026
  • 21Shares Debuts First Canton Network ETF (TCAN) on Nasdaq

    21Shares Debuts First Canton Network ETF (TCAN) on Nasdaq

    The TCAN ETF provides the first U.S.-listed gateway to Canton Coin (CC), the native utility token of the Canton Network.
    Martha Grizzard
    May 8, 2026
View more data 
BTCBTC(BTC)
$0
--(Last 24h)
SpotFutures

Top

View more
  1. 1S&P 500 Reclaims 200-Day Moving Average, Bitcoin Gains
  2. 2Trump Softens His Stance on Reciprocal Tariffs, US Stocks and Crypto Markets Rise
  3. 3Vitalik Buterin : The current price of ETH has not been affected by the merger event
  4. 4Vibhu Norby : Solana Spaces store to bring 100K people to Solana per month
  5. 5CZ: compared with the record high nine months ago, the current situation of the industry is much better

Top Gainers

View more
Backpack
BackpackBP

$0.2669

+82.06%
Yei Finance
Yei FinanceCLO

$0.1893

+50.33%
aPriori
aPrioriAPR

$0.2533

+45.36%
Lobster
Lobster龙虾

$0.009203

+34.08%
Alaya AI
Alaya AIAGT

$0.0162

+32.68%

Top Trending

View more
Ethena
EthenaENA

$0.1047

+20.21%
Monero
MoneroXMR

$342.710

+0.54%
Ondo
OndoONDO

$0.4169

+15.48%
LAB
LABLAB

$18.7674

-3.93%
Aster
AsterASTER

$0.6800

-1.16%

Recently added

View more
Citrea
CitreaCTR

$0.0176

+1.21%
Solstice
SolsticeSLX

$0.3325

+0.85%
Nexus
NexusNEX

$0.00000329

-8.41%
Zest Protocol
Zest ProtocolZEST

$0.1502

-5.71%
Animal Welfare Fund
Animal Welfare FundAWF

$0.001515

-10.78%

Learn

View more
  1. 1What is Bitwise Hyperliquid ETF? How Does BHYP Work?
  2. 2What is PaperTrade on HyperEVM? Is Zero Funding Real?
  3. 3What Is Circle Arc? How Does the New USDC Blockchain Work?
  4. 4What Is Circle Arc Whitepaper? How to Join Circle Arc Testnet?
  5. 5Is the Bear Market Over? Decoding Bitcoin On-Chain Data
About Us
  • About BitKan
  • Contact Us
  • Announcements
  • VIP Program
  • BitKan Ambassador
  • Institutional Services
Products
  • Spot
  • Futures
  • Crypto Prices
  • Learn
  • News
  • Markets
  • How to Buy Crypto
  • BTC to USD Calculator
  • Reward
Help
  • Help Center
  • Email Us
  • Live Chat
  • Download APP
  • Listing Application
  • Buy Bitcoin
  • Buy Ethereum
  • Buy Dogecoin
  • Buy Altcoins
Terms
  • Terms of Use
  • Privacy Policy
  • Trading Rules
  • Fee
K-Site
English
About Us
+
  • About BitKan
  • Contact Us
  • Announcements
  • VIP Program
  • BitKan Ambassador
  • Institutional Services
Products
+
  • Spot
  • Futures
  • Crypto Prices
  • Learn
  • News
  • Markets
  • How to Buy Crypto
  • BTC to USD Calculator
  • Reward
Help
+
  • Help Center
  • Email Us
  • Live Chat
  • Download APP
  • Listing Application
  • Buy Bitcoin
  • Buy Ethereum
  • Buy Dogecoin
  • Buy Altcoins
Terms
+
  • Terms of Use
  • Privacy Policy
  • Trading Rules
  • Fee
K-Site
+
  • Twitter
  • Facebook
  • Telegram
  • YouTube
  • Instagram
  • Medium
  • Linkedin
@2012-2026 BITKAN.com