Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Xiaomi’s Leica Edition flagship confirmed for new global release

    Apple iPad Pro unlikely to get major update for years despite stronger-than-ever competition

    This sleek all-black Citizen Eco-Drive dress watch is 54% off right now

    Facebook X (Twitter) Instagram
    • Artificial Intelligence
    • Business Technology
    • Cryptocurrency
    • Gadgets
    • Gaming
    • Health
    • Software and Apps
    • Technology
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Tech AI Verse
    • Home
    • Artificial Intelligence

      Read the extended transcript: President Donald Trump interviewed by ‘NBC Nightly News’ anchor Tom Llamas

      February 6, 2026

      Stocks and bitcoin sink as investors dump software company shares

      February 4, 2026

      AI, crypto and Trump super PACs stash millions to spend on the midterms

      February 2, 2026

      To avoid accusations of AI cheating, college students are turning to AI

      January 29, 2026

      ChatGPT can embrace authoritarian ideas after just one prompt, researchers say

      January 24, 2026
    • Business

      The HDD brand that brought you the 1.8-inch, 2.5-inch, and 3.5-inch hard drives is now back with a $19 pocket-sized personal cloud for your smartphones

      February 12, 2026

      New VoidLink malware framework targets Linux cloud servers

      January 14, 2026

      Nvidia Rubin’s rack-scale encryption signals a turning point for enterprise AI security

      January 13, 2026

      How KPMG is redefining the future of SAP consulting on a global scale

      January 10, 2026

      Top 10 cloud computing stories of 2025

      December 22, 2025
    • Crypto

      Wall Street Moves Into Prediction Markets With Election-Contract ETF Filings

      February 18, 2026

      Tectonic to Host Inaugural Quantum Summit at ETHDenver 2026 Focused on Post-Quantum Cryptography Readiness for Web3

      February 18, 2026

      Ki Young Ju Says Bitcoin May Need to Hit $55K Before True Recovery Begins

      February 18, 2026

      MYX Finance Is Oversold For The First Time Ever, Yet No Relief In Sight

      February 18, 2026

      Everyone is Talking about the SaaSpocalypse, But Why Does it matter for Crypto?

      February 18, 2026
    • Technology

      Xiaomi’s Leica Edition flagship confirmed for new global release

      February 18, 2026

      Apple iPad Pro unlikely to get major update for years despite stronger-than-ever competition

      February 18, 2026

      This sleek all-black Citizen Eco-Drive dress watch is 54% off right now

      February 18, 2026

      Google’s new smartphone confirmed to launch globally with old Tensor G4 silicon on eve of release

      February 18, 2026

      Nintendo’s VR accessory for the Switch 2 is finally available

      February 18, 2026
    • Others
      • Gadgets
      • Gaming
      • Health
      • Software and Apps
    Check BMI
    Tech AI Verse
    You are at:Home»Technology»Meta Llama 4 Benchmarking Confusion: How Good Are the New AI Models?
    Technology

    Meta Llama 4 Benchmarking Confusion: How Good Are the New AI Models?

    TechAiVerseBy TechAiVerseApril 9, 2025No Comments4 Mins Read3 Views
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    Meta Llama 4 Benchmarking Confusion: How Good Are the New AI Models?
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp Email

    Meta Llama 4 Benchmarking Confusion: How Good Are the New AI Models?

    Meta’s Llama 4 models Maverick and Scout are out now, but they might not be the best models on the market.

    Katelyn Chedraoui Writer I

    Katelyn is a writer with CNET covering social media, AI and online services. She graduated from the University of North Carolina at Chapel Hill with a degree in media and journalism. You can often find her with a novel and an iced coffee during her time off.

    There are two new AI models joining the slew of options, thanks to new releases from Meta. The company unveiled its latest family of generative AI models on Saturday, named Llama 4. You can give the Llama 4 models a test drive now through Meta AI’s website, and Llama 4 will soon power the many Meta AI features on the company’s Instagram, WhatsApp and Messenger services.

    The competition between Meta and other AI companies is becoming increasingly intense. Companies are working to build and release AI models capable of more complex tasks and advanced reasoning without requiring vast amounts of computing power and cash to run. It’s a tricky sweet spot to hit, and Meta hopes its newest models will put it ahead of competitors like ChatGPT and Gemini. 

    Llama 4 benchmarking, explained

    There has been some confusion about how the new Llama models stack up compared to other models. There are a couple of ways to test chatbots, and one of the biggest is through a crowdsource AI benchmarking platform called LMArena, created by UC Berkeley SkyLab. In Meta’s announcement, the company claimed its Maverick model had outperformed ChatGPT-4o. 

    But the model that Meta actually submitted to the LMArena tests is not the model that is available for people to use now. The model submitted for testing is called “llama-4-maverick-03-26-experimental.” In a footnote on a chart on Llama’s website (not the announcement), in tiny font in the final bullet point, Meta clarifies that the model submitted to LMArena was ‘optimized for conversationality.” 

    Check out the final footnote at the bottom.

    Meta/Screenshot by Katelyn Chedraoui

    LMArena put out a statement on X/Twitter on Monday saying Meta’s policy interpretation did not match LMArena’s expectations, and that Meta should have been clearer that the submitted model was a “customized model to optimize for human preference.” In other words, it’s possible that Meta submitted a better, more human-friendly model to try and juice its scores. One way to do that could be training a model on test sets, which a set of data and tests typically run during the post-training process, not before.

    Meta’s VP of generative AI Ahmad Al Dahle tweeted that claims the company trained on test sets are “simply not true.” He said that the differences in performance people are seeing “is due to needing to stabilize implementations.” As of publication, Meta’s Llama Maverick experimental (the original model submitted) is ranked in second place on LMArena, tied with GPT-4o and preview of Grok 3. Google’s Gemini 2.5 Pro is first.

    Meta did not immediately return a request for comment.

    Meet Scout and Maverick

    There are two models in the Llama 4 family available now: Scout and Maverick. They’re open-weights models and multimodal, which means they can generate text, images and code. Open models like Meta’s mean that developers can get some insight into how the models are built. The Llama 4 models are open-weights models, which means you can see how the model makes connections and how certain characteristics are given more weight as it learns. OpenAI announced earlier this month that it is developing an open-weights model for the first time.

    Scout is the smallest model of the family, designed to run on a single Nvidia H100 GPU. Scout has a 10 million token context window and is a 17 billion parameter model featuring 16 experts (subnetworks within the model, allowing it to run tasks more efficiently). Scout has more than twice the firepower of Llama 3, which has 8 billion parameters. Generally, the more parameters a model has, the more capable it is of delivering better results faster. Maverick is a midsized model, the big brother to Scout, featuring 17 billion parameters with 128 experts.

    More information on the rest of the Llama 4 family, including a base model named Behemoth and a Llama 4 reasoning model, is expected to come later this month, according to a video posted by CEO Mark Zuckerberg. We’ll likely learn more about these models at LlamaCon, the company’s first annual AI developers conference beginning on April 29.

    For more, check out what we know about a potential standalone app for Meta AI and our review of the best AI chatbots.

    Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email
    Previous ArticleBest Cooling Comforters of 2025
    Next Article Play South of Midnight Now, GTA 5 and More Soon, on Xbox Game Pass
    TechAiVerse
    • Website

    Jonathan is a tech enthusiast and the mind behind Tech AI Verse. With a passion for artificial intelligence, consumer tech, and emerging innovations, he deliver clear, insightful content to keep readers informed. From cutting-edge gadgets to AI advancements and cryptocurrency trends, Jonathan breaks down complex topics to make technology accessible to all.

    Related Posts

    Xiaomi’s Leica Edition flagship confirmed for new global release

    February 18, 2026

    Apple iPad Pro unlikely to get major update for years despite stronger-than-ever competition

    February 18, 2026

    This sleek all-black Citizen Eco-Drive dress watch is 54% off right now

    February 18, 2026
    Leave A Reply Cancel Reply

    Top Posts

    Ping, You’ve Got Whale: AI detection system alerts ships of whales in their path

    April 22, 2025683 Views

    Lumo vs. Duck AI: Which AI is Better for Your Privacy?

    July 31, 2025272 Views

    6.7 Cummins Lifter Failure: What Years Are Affected (And Possible Fixes)

    April 14, 2025155 Views

    6 Best MagSafe Phone Grips (2025), Tested and Reviewed

    April 6, 2025114 Views
    Don't Miss
    Technology February 18, 2026

    Xiaomi’s Leica Edition flagship confirmed for new global release

    Xiaomi’s Leica Edition flagship confirmed for new global release – NotebookCheck.net News ⓘ XiaomiThe Leica…

    Apple iPad Pro unlikely to get major update for years despite stronger-than-ever competition

    This sleek all-black Citizen Eco-Drive dress watch is 54% off right now

    Google’s new smartphone confirmed to launch globally with old Tensor G4 silicon on eve of release

    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    About Us
    About Us

    Welcome to Tech AI Verse, your go-to destination for everything technology! We bring you the latest news, trends, and insights from the ever-evolving world of tech. Our coverage spans across global technology industry updates, artificial intelligence advancements, machine learning ethics, and automation innovations. Stay connected with us as we explore the limitless possibilities of technology!

    Facebook X (Twitter) Pinterest YouTube WhatsApp
    Our Picks

    Xiaomi’s Leica Edition flagship confirmed for new global release

    February 18, 20263 Views

    Apple iPad Pro unlikely to get major update for years despite stronger-than-ever competition

    February 18, 20264 Views

    This sleek all-black Citizen Eco-Drive dress watch is 54% off right now

    February 18, 20263 Views
    Most Popular

    7 Best Kids Bikes (2025): Mountain, Balance, Pedal, Coaster

    March 13, 20250 Views

    VTOMAN FlashSpeed 1500: Plenty Of Power For All Your Gear

    March 13, 20250 Views

    This new Roomba finally solves the big problem I have with robot vacuums

    March 13, 20250 Views
    © 2026 TechAiVerse. Designed by Divya Tech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.