Related News

Palo Alto Networks (PANW) Q2 2026 earnings

Palo Alto Networks beat Wall Street’s fiscal second-quarter estimates after the bell on Tuesday but shares fell 6% on disappointing guidance. Here’s how the company did versus LSEG estimates: Earnings

Google’s I/O developer conference to be held May 19 and 20

Google CEO Sundar Pichai addresses the crowd during Google’s annual I/O developers conference in Mountain View, California on May 20, 2025. Camille Cohen | AFP | Getty Images Alphabet announced

Anthropic releases Claude Sonnet 4.6, the new default for free and pro

SAN FRANCISCO, CALIFORNIA – SEPTEMBER 04: Anthropic Co-founder and CEO Dario Amodei speaks at the “How AI Will Transform Business in the Next 18 Months” panel during INBOUND 2025 Powered

Palantir moving headquarters from Denver to Miami

Alex Karp, Palantir CEO, joins CNBC’s ‘Squawk on the Street’ on June 5, 2025. CNBC Palantir is relocating its headquarters to Miami from Denver, the company announced Tuesday in a

Amazon has lost $450 billion in value during historic losing streak

Andy Jassy, CEO of Amazon, speaking with CNBC at the World Economic Forum in Davos, Switzerland on Jan. 20, 2026. CNBC Amazon shares whipsawed on Tuesday, as the stock attempted

Viral article warns of looming impacts of artificial intelligence

Matt Shumer joins “CBS Mornings” to discuss his now viral article, “Something Big Is Happening.” He writes that AI’s “capability for massive disruption could be here by the end of

Mixture of experts: The method behind DeepSeek’s frugal success |

Word Count: 710 | Estimated Reading Time: 4 minutes

China’s DeepSeek has pulled off an AI miracle—building a top-tier artificial intelligence model while spending far less than its American rivals. At a time when AI giants are burning billions on GPUs and power-hungry data centers, this start-up has figured out a way to do more with less.
The secret? A mix of smart engineering, a clever neural network design, and some good old-fashioned mathematical efficiency.
Big AI, Small Budget
Most AI firms stack their data centers with thousands of GPUs—Meta’s latest AI model reportedly ran on 16,000 specialized chips, each costing around $40,000. DeepSeek? Just 2,000. Their total compute cost? A mere $6 million, almost a tenth of what Meta is rumored to have spent.
The ‘Mixture of Experts’ Trick
The key to DeepSeek’s frugal success? A method called “mixture of experts.” Traditional AI models try to learn everything in one giant neural network. That’s like stuffing all knowledge into a single brain—inefficient and power-hungry.
DeepSeek, instead, split the system into specialized mini-networks—one for poetry, one for coding, another for biology, and so on. Each “expert” focused on its domain, while a “generalist” network acted as a bridge, coordinating them.
Think of it like a newsroom: specialist reporters cover specific beats, while an editor connects the dots.
The Decimal Game
If that wasn’t enough, DeepSeek also squeezed efficiency out of pure mathematics. AI models rely on mind-boggling amounts of number crunching, typically using 16-bit precision. DeepSeek? They slashed it to 8 bits—halving memory use and speeding up calculations.
Losing precision sounds risky, right? Not really. Just like rounding π to 3.14 works for most practical uses, trimming decimals didn’t hurt the AI’s performance. And when needed, DeepSeek stretched the final results back to 32-bit accuracy—giving them the best of both worlds.
Why Didn’t Others Do It?
AI giants like OpenAI and Google’s DeepMind have the brains and the budget, so why didn’t they crack this code first? Simple: risk.
Building AI models is expensive, and experimenting with new techniques can burn millions with no guarantee of success. DeepSeek took that gamble—and it paid off.
Now that they’ve published their findings, the industry is taking note. AI development just got a whole lot cheaper. The question is—who will be the next to follow suit?