Tether's Medical AI Runs on Your Phone and Outperforms Models 16x Its Size

May 8, 2026

4.3

★

279 User Rating

The headline number: a tiny 1.7 billion-parameter model capable of beating Google's MedGemma-4B on medical benchmarks despite being less than half its size. On HealthBench Hard—OpenAI's benchmark that evaluates AI on realistic, multi-turn clinical conversations graded by 262 physicians—Tether says its 1.7 billion-parameter model outscores MedGemma-27B, a model nearly sixteen times larger.

Parameters are all the configurations and values that a model learns during trading. The more the parameters, the better the model should be, in theory.

Source: Tether

The test suite spans MedQA-USMLE, which measures clinical knowledge using US medical licensing exam-style questions scored as percentage accuracy, all the way to AfriMedQA, which tests performance specifically for underserved African healthcare contexts.

Tether CEO Paolo Ardoino credited the gains to efficiency rather than scale. "With QVAC MedPsy, our focus was improving efficiency at the model level, rather than scaling up size," he said in a statement. "Our 4 billion model exceeded results from models nearly seven times its size, while using up to three times fewer tokens per response."

That token efficiency is the other headline. The 4B model averages around 909 tokens per response versus 2,953 for comparable systems—a 3.2x reduction. Fewer tokens means lower compute cost, faster responses, and crucially, the ability to run locally without a cloud backend.

"You can run medical reasoning where the data already exists, inside a hospital system or on a device, without moving sensitive information through the cloud or waiting on external processing," Ardoino said.

The models ship as quantized GGUF files—1.2 GB for the 1.7 billion-parameter model and 2.6 GB for the 4 billion—with compressed versions retaining most benchmark performance while fitting on standard consumer hardware. That means a hospital system, rural clinic, or individual clinician could run the model entirely on-device, keeping patient records out of third-party cloud infrastructure and away from HIPAA exposure.

GrvtGRVT	$0.2688 +437.66%
Koma InuKOMA	$0.0240 +84.95%
Kekius MaximusKEKIUS	$0.005282 +61.63%
UnipegUPEG	$568.920 +56.45%
AXTAXTIB	$58.6000 +43.94%

StrategyMSTR	$92.7200 -4.38%
Sei NetworkSEI	$0.0417 -0.57%
MomentumMMT	$0.2406 +15.45%
Giggle FundGIGGLE	$36.9300 +30.82%
EnsoENSO	$0.8980 +5.77%

GrvtGRVT	$0.2687 +437.46%
Direxion Semiconductor Bear 3X ETFSOXSB	$52.3300 -16.75%
VanEck Semiconductor ETFSMHB	$547.600 +4.01%
PayPalPYPLB	$57.4800 -0.43%
Goldman SachsGSB	$1,035.01 +3.81%

Tether's Medical AI Runs on Your Phone and Outperforms Models 16x Its Size

Latest News

Industry

Cryptocurrency

Airdrop

Markets

Brazil’s CVM Launches 60-Day Sprint to Tokenize Securities

Hyperliquid Enables Permissionless Markets With HIP-4 Plan

DTCC Launches Live Tokenized Asset Trading for Wall Street

South Korea Updates Asset Law to Include Cryptocurrency

New SEC Crypto Rule to Cut Red Tape for Startup Fundraising

Top

Top Gainers

Top Trending

Recently added

Learn