Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Vince Zampella, studio head behind Call of Duty and Titanfall, killed in car crash

    GTA 6 delayed again, AI impacts hardware, and Half-Life 3: Analyst predictions for 2026 | Year in Review

    The biggest games industry news stories of 2025 | Year in Review

    Facebook X (Twitter) Instagram
    • Artificial Intelligence
    • Business Technology
    • Cryptocurrency
    • Gadgets
    • Gaming
    • Health
    • Software and Apps
    • Technology
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Tech AI Verse
    • Home
    • Artificial Intelligence

      Trump signs executive order seeking to ban states from regulating AI companies

      December 13, 2025

      Apple’s AI chief abruptly steps down

      December 3, 2025

      The issue that’s scrambling both parties: From the Politics Desk

      December 3, 2025

      More of Silicon Valley is building on free Chinese AI

      December 1, 2025

      From Steve Bannon to Elizabeth Warren, backlash erupts over push to block states from regulating AI

      November 23, 2025
    • Business

      Top 10 cloud computing stories of 2025

      December 22, 2025

      Saudia Arabia’s STC commits to five-year network upgrade programme with Ericsson

      December 18, 2025

      Zeroday Cloud hacking event awards $320,0000 for 11 zero days

      December 18, 2025

      Amazon: Ongoing cryptomining campaign uses hacked AWS accounts

      December 18, 2025

      Want to back up your iPhone securely without paying the Apple tax? There’s a hack for that, but it isn’t for everyone… yet

      December 16, 2025
    • Crypto

      10x Research Targets 8% Up for Gold: Today’s ATH Is the Cheapest You’ll See

      December 23, 2025

      Bitcoin Fintech Enters Russell 2000 While Strategy Risks MSCI Exclusion

      December 23, 2025

      3 Meme Coins To Watch In The Week of Christmas 2025

      December 23, 2025

      3 Altcoins To Watch In The Christmas 2025 Week

      December 23, 2025

      5 Charts Suggest Bitcoin Could Enter a Bear Market in Early 2026

      December 23, 2025
    • Technology

      While everyone talks about an AI bubble, Salesforce quietly added 6,000 enterprise customers in 3 months

      December 23, 2025

      From assistance to autonomy: How agentic AI is redefining enterprises

      December 23, 2025

      US bans new foreign-made drones and components

      December 23, 2025

      Pirate group Anna’s Archive says it has scraped Spotify in its entirety

      December 23, 2025

      Nintendo has huge discounts on Switch 2 games in its holiday sale

      December 23, 2025
    • Others
      • Gadgets
      • Gaming
      • Health
      • Software and Apps
    Check BMI
    Tech AI Verse
    You are at:Home»Technology»Gemini 3 Flash is smart — but when it doesn’t know, it makes stuff up anyway
    Technology

    Gemini 3 Flash is smart — but when it doesn’t know, it makes stuff up anyway

    TechAiVerseBy TechAiVerseDecember 23, 2025No Comments5 Mins Read0 Views
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    Gemini 3 Flash is smart — but when it doesn’t know, it makes stuff up anyway
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp Email

    Gemini 3 Flash is smart — but when it doesn’t know, it makes stuff up anyway

    (Image credit: Google)

    • Gemini 3 Flash often invents answers instead of admitting when it doesn’t know something
    • The problem arises with factual or high‑stakes questions
    • But it still tests as the most accurate and capable AI model

    Gemini 3 Flash is fast and clever. But if you ask it something it doesn’t actually know – something obscure or tricky or just outside its training – it will almost always try to bluff its way through, according to a recent evaluation from the independent testing group Artificial Analysis.

    It seems Gemini 3 Flash hit 91% on the “hallucination rate” portion of the AA-Omniscience benchmark. That means when it didn’t have the answer, it still gave one anyway, almost all the time, one that was entirely fictional.

    AI chatbots making things up has been an issue since they first debuted. Knowing when to stop and say I don’t know is just as important as knowing how to answer in the first place. Currently, Google Gemini 3 Flash AI doesn’t do that very well. That’s what the test is for: seeing whether a model can differentiate actual knowledge from a guess.

    Lest the number distract from reality, it should be noted that Gemini’s high hallucination rate doesn’t mean 91% of its total answers are false. Instead, it means that in situations where the correct answer would be “I don’t know,” it fabricated an answer 91% of the time. That’s a subtle but important distinction, but one that has real-world implications, especially as Gemini is integrated into more products like Google Search.

    Ok, it’s not only me. Gemini 3 Flash has a 91% hallucination rate on the Artificial Analysis Omniscience Hallucination Rate benchmark!?Can you actually use this for anything serious?I wonder if the reason Anthropic models are so good at coding is that they hallucinate much… https://t.co/b3CZbX9pHw pic.twitter.com/uZnF8KKZD4December 18, 2025

    This result doesn’t diminish the power and utility of Gemini 3. The model remains the highest-performing in general-purpose tests and ranks alongside, or even ahead of, the latest versions of ChatGPT and Claude. It just errs on the side of confidence when it should be modest.

    The overconfidence in answering crops up with Gemini’s rivals as well. What makes Gemini’s number stand out is how often it happens in these uncertainty scenarios, where there’s simply no correct answer in the training data or no definitive public source to point to.

    Hallucination Honesty

    Part of the issue is simply that generative AI models are largely word-prediction tools, and predicting a new word is not the same as evaluating truth. And that means the default behavior is to come up with a new word, even when saying “I don’t know” would be more honest.

    Sign up for breaking news, reviews, opinion, top tech deals, and more.

    OpenAI has started addressing this and getting its models to recognize what they don’t know and say so clearly. It’s a tough thing to train, because reward models don’t typically value a blank response over a confident (but wrong) one. Still, OpenAI has made it a goal for the development of future models.

    And Gemini does usually cite sources when it can. But even then, it doesn’t always pause when it should. That wouldn’t matter much if Gemini were just a research model, but as Gemini becomes the voice behind many Google features, being confidently wrong could affect quite a lot.

    There’s also a design choice here. Many users expect their AI assistant to respond quickly and smoothly. Saying “I’m not sure” or “Let me check on that” might feel clunky in a chatbot context. But it’s probably better than being misled. Generative AI still isn’t always reliable, but double-checking any AI response is always a good idea.


    Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds. Make sure to click the Follow button!

    And of course you can also follow TechRadar on TikTok for news, reviews, unboxings in video form, and get regular updates from us on WhatsApp too.


    Eric Hal Schwartz is a freelance writer for TechRadar with more than 15 years of experience covering the intersection of the world and technology. For the last five years, he served as head writer for Voicebot.ai and was on the leading edge of reporting on generative AI and large language models. He’s since become an expert on the products of generative AI models, such as OpenAI’s ChatGPT, Anthropic’s Claude, Google Gemini, and every other synthetic media tool. His experience runs the gamut of media, including print, digital, broadcast, and live events. Now, he’s continuing to tell the stories people want and need to hear about the rapidly evolving AI space and its impact on their lives. Eric is based in New York City.

    community guidelines.

    “>

    You must confirm your public display name before commenting

    Please logout and then login again, you will then be prompted to enter your display name.

    Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email
    Previous ArticleWatch out, Nvidia
    Next Article This $450 RTX laptop deal is too good to ignore — Acer Nitro V packs RTX 5050, 16GB RAM and 13th Gen Core i5 power
    TechAiVerse
    • Website

    Jonathan is a tech enthusiast and the mind behind Tech AI Verse. With a passion for artificial intelligence, consumer tech, and emerging innovations, he deliver clear, insightful content to keep readers informed. From cutting-edge gadgets to AI advancements and cryptocurrency trends, Jonathan breaks down complex topics to make technology accessible to all.

    Related Posts

    While everyone talks about an AI bubble, Salesforce quietly added 6,000 enterprise customers in 3 months

    December 23, 2025

    From assistance to autonomy: How agentic AI is redefining enterprises

    December 23, 2025

    US bans new foreign-made drones and components

    December 23, 2025
    Leave A Reply Cancel Reply

    Top Posts

    Ping, You’ve Got Whale: AI detection system alerts ships of whales in their path

    April 22, 2025533 Views

    Lumo vs. Duck AI: Which AI is Better for Your Privacy?

    July 31, 2025191 Views

    6.7 Cummins Lifter Failure: What Years Are Affected (And Possible Fixes)

    April 14, 202593 Views

    6 Best MagSafe Phone Grips (2025), Tested and Reviewed

    April 6, 202579 Views
    Don't Miss
    Gaming December 23, 2025

    Vince Zampella, studio head behind Call of Duty and Titanfall, killed in car crash

    Vince Zampella, studio head behind Call of Duty and Titanfall, killed in car crash Former…

    GTA 6 delayed again, AI impacts hardware, and Half-Life 3: Analyst predictions for 2026 | Year in Review

    The biggest games industry news stories of 2025 | Year in Review

    Katamari Damacy maker Takahashi moved back to Japan after To a T flopped

    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    About Us
    About Us

    Welcome to Tech AI Verse, your go-to destination for everything technology! We bring you the latest news, trends, and insights from the ever-evolving world of tech. Our coverage spans across global technology industry updates, artificial intelligence advancements, machine learning ethics, and automation innovations. Stay connected with us as we explore the limitless possibilities of technology!

    Facebook X (Twitter) Pinterest YouTube WhatsApp
    Our Picks

    Vince Zampella, studio head behind Call of Duty and Titanfall, killed in car crash

    December 23, 20250 Views

    GTA 6 delayed again, AI impacts hardware, and Half-Life 3: Analyst predictions for 2026 | Year in Review

    December 23, 20250 Views

    The biggest games industry news stories of 2025 | Year in Review

    December 23, 20250 Views
    Most Popular

    What to Know and Where to Find Apple Intelligence Summaries on iPhone

    March 12, 20250 Views

    A Team of Female Founders Is Launching Cloud Security Tech That Could Overhaul AI Protection

    March 12, 20250 Views

    Senua’s Saga: Hellblade 2 leads BAFTA Game Awards 2025 nominations

    March 12, 20250 Views
    © 2025 TechAiVerse. Designed by Divya Tech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.