logo
  • menu
  • Markets
  • ETFs
  • Live
  • Spot
  • Futures
  • Learn
  • Sign In
  • Sign Up
  • Downloads
  • English
  • |
  • USD
  • |
Sign Up
Crypto PricesLearnLatest NewsDownloadsMarketsSpotAnnouncements
Home/
Latest News/
Live

Google's DiffusionGemma AI Hits 1,000 Tokens Per Second—And It's Free

By Decrypt
Jun 11, 2026
4.2 
★
★
★
★
★
★
★
★
★
★
 107 User Rating
Share

Google says so themselves. This is a speed model, not a quality upgrade.

What this actually does

Every LLM you've used is a typewriter. One token at a time with each word dependent on the last. That's how autoregressive architectures work.

The side effect is bidirectional attention—every token can see every other token while being generated, which is impossible in autoregressive models (they cannot see the future, what is going to be encoded). That makes it unusually good at tasks where the end of the answer constrains the beginning: code infilling, structured output, constraint-heavy problems, etc. Google fine-tuned a version to solve Sudoku as a demo. The base model got roughly 0% of puzzles right.

The fine-tuned version hit 80%.

But none of that was open-weight, and none of it came with day-zero support in vLLM, Hugging Face Transformers, and Unsloth. DiffusionGemma is the first major open release from a tier-one lab.

There's also a historical irony worth noting. Image generators started as diffusion models (hence the name Stable Diffusion) and are now moving toward autoregressive architectures for better quality. Language models started as autoregressive and are now experimenting with diffusion for speed.

Why it’s a pain to run… for now

The problem: DiffusionGemma needs a specific drafter to run locally via MLX—Apple's machine learning framework for Apple Silicon. That module doesn't exist in any public version of mlx-lm, in any open pull request, or in LM Studio's bundled runtime.

We tried running DiffusionGemma with Hermes through NVIDIA NIM. The model loaded, but then: "agent init failed: Model google/diffusiongemma-26b-a4b-it has a context window of 8,192 tokens, which is below the minimum 64,000 required by Hermes Agent."

 

To be precise: DiffusionGemma's actual context window is 256K tokens. The 8,192 figure was Nvidia messing things up by default, not the model's architectural limit.

In practice, getting it configured correctly for agentic use requires manual work that most everyday users haven't figured out yet, and Hermes Agent simply won't initialize without it. Parallel speed means nothing if the agent can't boot.

Hopefully, in the next few days, the community will produce better resources to run these models.

Who this is actually for

For researchers, bidirectional generation opens territory that autoregressive models simply can't reach—protein sequences, mathematical graphs, anything where position N depends on position N+50. That's not a small thing.

On a machine with a capable discrete GPU, 1,000 tokens per second is real.

Disclaimer: The information on this page may have been obtained from third parties and does not necessarily reflect the views or opinions of BitKan. This content is provided for general informational purposes only, without any representation or warranty of any kind, nor shall it be construed as financial or investment advice. BitKan shall not be liable for any errors or omissions, or for any outcomes resulting from the use of this information. Investments in digital assets can be risky. Please carefully evaluate the risks of a product and your risk tolerance based on your own financial circumstances. Products mentioned in this article may not be available in your region.

Latest News

Industry

Cryptocurrency

Airdrop

Markets

  • VerifiedX Launches Bitcoin Sidechain for Native DeFi Privacy

    VerifiedX Launches Bitcoin Sidechain for Native DeFi Privacy

    VerifiedX has officially introduced a decentralized "reliever chain" designed to bring programmable, privacy-preserving functionality to the Bitcoin network.
    Martha Grizzard
    May 18, 2026
  • Japan’s SBI and Rakuten Plan Crypto Trusts as Rules Finalize

    Japan’s SBI and Rakuten Plan Crypto Trusts as Rules Finalize

    SBI Securities and Rakuten Securities have officially announced plans to introduce cryptocurrency investment trusts to their massive retail user bases.
    Craig Green
    May 18, 2026
  • Senate Advances CLARITY Act: A New Era for U.S. Crypto Oversight

    Senate Advances CLARITY Act: A New Era for U.S. Crypto Oversight

    The Senate Banking Committee advanced the CLARITY Act on May 14, 2026 to establish a comprehensive federal framework for the digital asset industry.
    May 15, 2026
  • TRC20-USDT Circulation Soars to 89.3 Billion Record on TRON

    TRC20-USDT Circulation Soars to 89.3 Billion Record on TRON

    The circulation of TRC20-USDT has officially ascended to a historic peak of 89.3 billion tokens, fundamentally expanding the liquidity threshold of the decentralized financial landscape.
    Hallie Gill
    May 12, 2026
  • 21Shares Debuts First Canton Network ETF (TCAN) on Nasdaq

    21Shares Debuts First Canton Network ETF (TCAN) on Nasdaq

    The TCAN ETF provides the first U.S.-listed gateway to Canton Coin (CC), the native utility token of the Canton Network.
    Martha Grizzard
    May 8, 2026
View more data 
BTCBTC(BTC)
$0
--(Last 24h)
SpotFutures

Top

View more
  1. 1S&P 500 Reclaims 200-Day Moving Average, Bitcoin Gains
  2. 2Trump Softens His Stance on Reciprocal Tariffs, US Stocks and Crypto Markets Rise
  3. 3Vitalik Buterin : The current price of ETH has not been affected by the merger event
  4. 4Vibhu Norby : Solana Spaces store to bring 100K people to Solana per month
  5. 5CZ: compared with the record high nine months ago, the current situation of the industry is much better

Top Gainers

View more
DeepNode
DeepNodeDN

$0.8901

+145.05%
UNC
UNCUNC

$0.001763

+104.05%
Audiera
AudieraBEAT

$8.5516

+56.68%
Hamster Kombat
Hamster KombatHMSTR

$0.000276

+39.82%
Yei Finance
Yei FinanceCLO

$0.1493

+36.39%

Top Trending

View more
Alephium
AlephiumALPH

$0.0336

-2.30%
Curve DAO
Curve DAOCRV

$0.2488

+21.66%
Velvet
VelvetVELVET

$0.7902

+110.50%
Livepeer
LivepeerLPT

$1.7310

-0.63%
Audiera
AudieraBEAT

$8.5494

+56.64%

Recently added

View more
Jotchua
JotchuaJOTCHUA

$0.003797

-39.27%
Kinetiq
KinetiqKNTQ

$0.2112

-4.13%
Citrea
CitreaCTR

$0.0129

-3.50%
Solstice
SolsticeSLX

$0.1774

-8.32%
Nexus
NexusNEX

$0.00000332

+12.47%

Learn

View more
  1. 1What is the MSX X Card? Understanding the New Crypto Card
  2. 2How Does The SpaceX IPO Impact Crypto? Are Traders Selling Bitcoin for SpaceX?
  3. 3What is Bitwise Hyperliquid ETF? How Does BHYP Work?
  4. 4What is PaperTrade on HyperEVM? Is Zero Funding Real?
  5. 5What Is Circle Arc? How Does the New USDC Blockchain Work?
About Us
  • About BitKan
  • Contact Us
  • Announcements
  • VIP Program
  • BitKan Ambassador
  • Institutional Services
Products
  • Spot
  • Futures
  • Crypto Prices
  • Learn
  • News
  • Markets
  • How to Buy Crypto
  • BTC to USD Calculator
  • Reward
Help
  • Help Center
  • Email Us
  • Live Chat
  • Download APP
  • Listing Application
  • Buy Bitcoin
  • Buy Ethereum
  • Buy Dogecoin
  • Buy Altcoins
Terms
  • Terms of Use
  • Privacy Policy
  • Trading Rules
  • Fee
K-Site
English
About Us
+
  • About BitKan
  • Contact Us
  • Announcements
  • VIP Program
  • BitKan Ambassador
  • Institutional Services
Products
+
  • Spot
  • Futures
  • Crypto Prices
  • Learn
  • News
  • Markets
  • How to Buy Crypto
  • BTC to USD Calculator
  • Reward
Help
+
  • Help Center
  • Email Us
  • Live Chat
  • Download APP
  • Listing Application
  • Buy Bitcoin
  • Buy Ethereum
  • Buy Dogecoin
  • Buy Altcoins
Terms
+
  • Terms of Use
  • Privacy Policy
  • Trading Rules
  • Fee
K-Site
+
  • Twitter
  • Facebook
  • Telegram
  • YouTube
  • Instagram
  • Medium
  • Linkedin
@2012-2026 BITKAN.com