Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    iPhone Says It Cannot Activate Data? Do This

    Where Is the iPhone Made? It’s Not Just One Country

    Apple MacBook Neo Officially Launches in Malaysia From RM2,499

    Facebook X (Twitter) Instagram
    • Artificial Intelligence
    • Business Technology
    • Cryptocurrency
    • Gadgets
    • Gaming
    • Health
    • Software and Apps
    • Technology
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Tech AI Verse
    • Home
    • Artificial Intelligence

      What the polls say about how Americans are using AI

      February 27, 2026

      Tensions between the Pentagon and AI giant Anthropic reach a boiling point

      February 21, 2026

      Read the extended transcript: President Donald Trump interviewed by ‘NBC Nightly News’ anchor Tom Llamas

      February 6, 2026

      Stocks and bitcoin sink as investors dump software company shares

      February 4, 2026

      AI, crypto and Trump super PACs stash millions to spend on the midterms

      February 2, 2026
    • Business

      Google releases Gemini 3.1 Flash Lite at 1/8th the cost of Pro

      March 4, 2026

      Huawei Watch GT Series

      March 4, 2026

      Weighing up the enterprise risks of neocloud providers

      March 3, 2026

      A stolen Gemini API key turned a $180 bill into $82,000 in two days

      March 3, 2026

      These ultra-budget laptops “include” 1.2TB storage, but most of it is OneDrive trial space

      March 1, 2026
    • Crypto

      Banks Respond to Kraken’s Federal Reserve Access as Trump Sides with Crypto

      March 4, 2026

      Hyperliquid and DEXs Break the Top 10 — Is the CEX Era Ending?

      March 4, 2026

      Consensus Hong Kong 2026: The Institutional Turn 

      March 4, 2026

      New Crypto Mutuum Finance (MUTM) Reports V1 Protocol Progress as Roadmap Enters Phase 3

      March 4, 2026

      Bitcoin Short Sellers Caught Off Guard in New White House Move

      March 4, 2026
    • Technology

      iPhone Says It Cannot Activate Data? Do This

      March 6, 2026

      Where Is the iPhone Made? It’s Not Just One Country

      March 6, 2026

      New free-to-play action-adventure RPG launches on Steam with 2,300 player peak and over 100 characters

      March 5, 2026

      Hisense U7SG Mini LED TVs launch with 330 Hz gaming mode, 3000 nits brightness and sizes up to 116 inches

      March 5, 2026

      Yahoo pauses IAB membership amid a series of quiet cost-saving measures

      March 5, 2026
    • Others
      • Gadgets
      • Gaming
      • Health
      • Software and Apps
    Check BMI
    Tech AI Verse
    You are at:Home»Technology»GPU-rich labs have won: What’s left for the rest of us is distillation
    Technology

    GPU-rich labs have won: What’s left for the rest of us is distillation

    TechAiVerseBy TechAiVerseAugust 9, 2025No Comments3 Mins Read4 Views
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    GPU-rich labs have won: What’s left for the rest of us is distillation
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp Email

    GPU-rich labs have won: What’s left for the rest of us is distillation

    Word on the street is that OpenAI now spends $50M+ just on LLM training a day. Working to compete on superintelligence without country-scale resources is pretty much futile. Despite this, massive training runs and powerful but expensive models means another technique is starting to dominate: distillation.

    2024 was the year of wasteful AI enterprise spending. Fortune-500 companies would spend tens of millions and proudly announce that they trained their own SOTA models, only to have them be antiquated months or weeks after their release. Large labs like OpenAI and Anthropic would immediately release a new model that would perform better at the specific tasks these companies cared about than the very models they spent so much money and time training!

    Impressively, open source models have been able to quickly catch up to big labs. This is partly because of creativity under constraint, but the dominant strategy is the quiet distillation of large proprietary models. Deepseek is the most impressive example of both of these.

    Open-source has been lagging behind proprietary models for years, but lately this gap has been widening. Here’s a look at LMArena, for example:

    Even with the “generosity” of Meta and Alibaba (Qwen), who have spent hundreds of millions just to release model weights, open source simply cannot compete with the hegemony of superintelligence labs when it comes to general intelligence. The gap in GPU wealth is difficult to fully grok. The largest labs either own or have access to 200k+ H100/H200 equivalent GPUs. The first model that takes $1B to train will not be released for free.

    2025 is the year of agents and the application layer. Through many expensive lessons, enterprises realized that training large models is a waste of time. Instead, easy wins come from building on the smallest LLM available that can solve a particular task acceptably. Most companies found that by not worrying about training, they can still serve their users and eke out margins. If an LLM can’t solve a particular task acceptably yet, it’s not the worst strategy to build what’s possible now and wait a couple of months.

    Still, LLMs prove to cut into margins more than most software, and the large models that seem to be able to solve any task also come with significant latency.

    Most applications don’t need superintelligence, instead they need low-latency models that are good enough for a task. Whether that task is data extraction, classification, or research, they need something cheap and fast enough to deploy to millions of users and maximize margins.

    Luckily, when you have superintelligent models who can perform any task well, training a smaller model is extremely easy through a process called distillation. Distillation takes the outputs of a large model and trains a smaller pre-trained model on it in a process called Supervised Fine Tuning (SFT), typically conserving 95%+ of the performance while being an order of magnitude faster/cheaper.

    Distillation is the second step after product market fit. Once you have users and significant costs, distillation can expand margins and reduce latency without impacting quality.

    The challenge with distillation is that you need experience to distill and evaluate models effectively. And once your train, it’s not trivial to deploy models.

    That’s what we are solving at Inference.net. End-to-end distillation and inference for busy founders that just want to focus on the application layer. If you have 30k+/mo in model spend, we’d love to chat.

    Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email
    Previous ArticleIntel CEO calls China ties allegations “misinformation” after Trump demands resignation
    Next Article Little-known leguminous plant can increase beef production by 60% (2022)
    TechAiVerse
    • Website

    Jonathan is a tech enthusiast and the mind behind Tech AI Verse. With a passion for artificial intelligence, consumer tech, and emerging innovations, he deliver clear, insightful content to keep readers informed. From cutting-edge gadgets to AI advancements and cryptocurrency trends, Jonathan breaks down complex topics to make technology accessible to all.

    Related Posts

    iPhone Says It Cannot Activate Data? Do This

    March 6, 2026

    Where Is the iPhone Made? It’s Not Just One Country

    March 6, 2026

    New free-to-play action-adventure RPG launches on Steam with 2,300 player peak and over 100 characters

    March 5, 2026
    Leave A Reply Cancel Reply

    Top Posts

    Ping, You’ve Got Whale: AI detection system alerts ships of whales in their path

    April 22, 2025705 Views

    Lumo vs. Duck AI: Which AI is Better for Your Privacy?

    July 31, 2025290 Views

    6.7 Cummins Lifter Failure: What Years Are Affected (And Possible Fixes)

    April 14, 2025164 Views

    6 Best MagSafe Phone Grips (2025), Tested and Reviewed

    April 6, 2025124 Views
    Don't Miss
    Technology March 6, 2026

    iPhone Says It Cannot Activate Data? Do This

    iPhone Says It Cannot Activate Data? Do This If you are a reader experiencing an…

    Where Is the iPhone Made? It’s Not Just One Country

    Apple MacBook Neo Officially Launches in Malaysia From RM2,499

    New free-to-play action-adventure RPG launches on Steam with 2,300 player peak and over 100 characters

    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    About Us
    About Us

    Welcome to Tech AI Verse, your go-to destination for everything technology! We bring you the latest news, trends, and insights from the ever-evolving world of tech. Our coverage spans across global technology industry updates, artificial intelligence advancements, machine learning ethics, and automation innovations. Stay connected with us as we explore the limitless possibilities of technology!

    Facebook X (Twitter) Pinterest YouTube WhatsApp
    Our Picks

    iPhone Says It Cannot Activate Data? Do This

    March 6, 20261 Views

    Where Is the iPhone Made? It’s Not Just One Country

    March 6, 20262 Views

    Apple MacBook Neo Officially Launches in Malaysia From RM2,499

    March 6, 20262 Views
    Most Popular

    7 Best Kids Bikes (2025): Mountain, Balance, Pedal, Coaster

    March 13, 20250 Views

    VTOMAN FlashSpeed 1500: Plenty Of Power For All Your Gear

    March 13, 20250 Views

    Best TV Antenna of 2025

    March 13, 20250 Views
    © 2026 TechAiVerse. Designed by Divya Tech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.