Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    This Two-Faced Watch Band Lets You Hide an Apple Watch Under Your Rolex

    ‘Stranger Things 5’ Flips Christmas Upside Down. Volume 2 Drops Today

    We Tested 87 Face Sunscreens and These Are the Best for Year-Round Sun Protection

    Facebook X (Twitter) Instagram
    • Artificial Intelligence
    • Business Technology
    • Cryptocurrency
    • Gadgets
    • Gaming
    • Health
    • Software and Apps
    • Technology
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Tech AI Verse
    • Home
    • Artificial Intelligence

      AI has become the norm for students. Teachers are playing catch-up.

      December 23, 2025

      Trump signs executive order seeking to ban states from regulating AI companies

      December 13, 2025

      Apple’s AI chief abruptly steps down

      December 3, 2025

      The issue that’s scrambling both parties: From the Politics Desk

      December 3, 2025

      More of Silicon Valley is building on free Chinese AI

      December 1, 2025
    • Business

      Top 10 cloud computing stories of 2025

      December 22, 2025

      Saudia Arabia’s STC commits to five-year network upgrade programme with Ericsson

      December 18, 2025

      Zeroday Cloud hacking event awards $320,0000 for 11 zero days

      December 18, 2025

      Amazon: Ongoing cryptomining campaign uses hacked AWS accounts

      December 18, 2025

      Want to back up your iPhone securely without paying the Apple tax? There’s a hack for that, but it isn’t for everyone… yet

      December 16, 2025
    • Crypto

      Crypto Twitter Turns Bearish on 2026—but These 3 Sectors Could Still Win

      December 25, 2025

      Bitcoin’s Trading Pair Flashes Down to $24,000 on Binance: Why You Need to be Careful

      December 25, 2025

      Cardano’s 18% Breakdown Setup Is Clear — But So Is Its Only Escape Route

      December 25, 2025

      How a Major Source of Market Stress in 2025 May Be Diminishing

      December 25, 2025

      USDC Is Being Used for More Than Trading, and Bybit Is Expanding Support on XDC

      December 25, 2025
    • Technology

      This Two-Faced Watch Band Lets You Hide an Apple Watch Under Your Rolex

      December 25, 2025

      ‘Stranger Things 5’ Flips Christmas Upside Down. Volume 2 Drops Today

      December 25, 2025

      We Tested 87 Face Sunscreens and These Are the Best for Year-Round Sun Protection

      December 25, 2025

      Today’s NYT Connections: Sports Edition Hints and Answers for Dec. 25, #458

      December 25, 2025

      Intel’s Granite Rapids Xeon CPUs spotted with up to 86 cores, 336MB cache, and $9,300 pricing

      December 25, 2025
    • Others
      • Gadgets
      • Gaming
      • Health
      • Software and Apps
    Check BMI
    Tech AI Verse
    You are at:Home»Technology»Google finds AI chatbots are only 69% accurate… at best
    Technology

    Google finds AI chatbots are only 69% accurate… at best

    TechAiVerseBy TechAiVerseDecember 16, 2025No Comments5 Mins Read1 Views
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    Google finds AI chatbots are only 69% accurate… at best
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp Email

    Google finds AI chatbots are only 69% accurate… at best

    AI chatbots still get one in three answers wrong


    Solen Feyissa / Unsplash

    Google has published a blunt assessment of how reliable today’s AI chatbots really are, and the numbers are not flattering. Using its newly introduced FACTS Benchmark Suite, the company found that even the best AI models struggle to break past a 70% factual accuracy rate. The top performer, Gemini 3 Pro, reached 69% overall accuracy, while other leading systems from OpenAI, Anthropic, and xAI scored even lower. The takeaway is simple and uncomfortable. These chatbots still get roughly one out of every three answers wrong, even when they sound confident doing it.

    The benchmark matters because most existing AI tests focus on whether a model can complete a task, not whether the information it produces is actually true. For industries like finance, healthcare, and law, that gap can be costly. A fluent response that sounds confident but contains errors can do real damage, especially when users assume the chatbot knows what it is talking about.

    What Google’s accuracy test reveals

    Google

    The FACTS Benchmark Suite was built by Google’s FACTS team with Kaggle to directly test factual accuracy across four real-world use. One test measures parametric knowledge, which checks whether a model can answer fact-based questions using only what it learned during training. Another evaluates search performance, testing how well models use web tools to retrieve accurate information. A third focuses on grounding, meaning whether the model sticks to a provided document without adding false details. The fourth examines multimodal understanding, such as reading charts, diagrams, and images correctly.

    Google

    The results show sharp differences between models. Gemini 3 Pro led the leaderboard with a 69% FACTS score, followed by Gemini 2.5 Pro and OpenAI’s ChatGPT-5 nearly at 62% percent. Claude 4.5 Opus landed at ~51% percent, while Grok 4 scored ~54%. Multimodal tasks were the weakest area across the board, with accuracy often below 50%. This matters because these tasks involve reading charts, diagrams, or images, where a chatbot could confidently misread a sales graph or pull the wrong number from a document, leading to mistakes that are easy to miss but hard to undo.

    The takeaway isn’t that chatbots are useless, but blind trust is risky. Google’s own data suggests AI is improving, yet it still needs verification, guardrails, and human oversight before it can be treated as a reliable source of truth.

    Manisha likes to cover technology that is a part of everyday life, from smartphones & apps to gaming & streaming…

    I found a Mac tool that you’ll love as a sleeker dock with extra tricks

    The Mac’s dock has remained static over the years. Loopty replaces it with a lot practical pizzazz.

    The shift to macOS Tahoe introduced a whole bunch of upgrades to core Mac systems. Spotlight, in particular, got some noteworthy tweaks such as support for custom shortcuts and an improved AI-powered search system. The disappearance of LaunchPad, however, proved to be a controversial change.

    Apple also didn’t pay attention to deeper cross-app integrations that have made apps such as RayCast a hot favorite in the user community. The new Spotlight wants to be the hub of your core Mac activities, but not without its fair share of clutter and a few big omissions.


    Read more

    AMD to play safe at CES 2026, but it may still deserve your attention

    AMD’s CES 2026 keynote is shaping up to be far more about AI strategy than shiny new consumer chips.

    For years, the Consumer Electronics Show (CES) has evolved from a consumer-electronics showcase to a global premier launchpad for chipmakers, turning the event into a key battleground for leadership in computing and AI hardware. The upcoming 2026 edition is expected to be no less. 

    AMD has confirmed that President and CEO, Dr. Lisa Su will deliver the opening keynote on January 5, outlining the company’s AI vision across cloud, enterprise, edge, and consumer devices. While we aren’t expecting any major announcements like a new GPU generation or a surprise Zen 6 tease (though we can still dream), expect some important launches. 


    Read more

    ChatGPT gets major update (GPT-5.2) as OpenAI battles Google in AI arms race

    OpenAI’s GPT-5.2 upgrade boosts real-world productivity just as Google escalates the competition with its latest Deep Research model.

    OpenAI has officially launched GPT-5.2, the latest iteration of its flagship AI model series and its answer to Google’s Gemini 3. The new model is meant to be faster, smarter, and more helpful for the complex, real-world queries with improvements in reasoning and long-document processing.

    It is rolling out to ChatGPT’s paid subscribers as part of the Plus, Pro, Team, and Enterprise tiers, and developers via API. OpenAI provides GPT-5.2 in three models: GPT-5.2 Instant, GPT-5.2 Thinking, and GPT-5.2 Pro (is it just me, or does the naming sound similar to that of the Gemini models?).


    Read more

    Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email
    Previous ArticleAsus is now offering the Nvidia GeForce RTX 5060 in two new flavors
    Next Article Meta’s Threads doubles down on Communities, along with “Champion” badge and profile labels
    TechAiVerse
    • Website

    Jonathan is a tech enthusiast and the mind behind Tech AI Verse. With a passion for artificial intelligence, consumer tech, and emerging innovations, he deliver clear, insightful content to keep readers informed. From cutting-edge gadgets to AI advancements and cryptocurrency trends, Jonathan breaks down complex topics to make technology accessible to all.

    Related Posts

    This Two-Faced Watch Band Lets You Hide an Apple Watch Under Your Rolex

    December 25, 2025

    ‘Stranger Things 5’ Flips Christmas Upside Down. Volume 2 Drops Today

    December 25, 2025

    We Tested 87 Face Sunscreens and These Are the Best for Year-Round Sun Protection

    December 25, 2025
    Leave A Reply Cancel Reply

    Top Posts

    Ping, You’ve Got Whale: AI detection system alerts ships of whales in their path

    April 22, 2025537 Views

    Lumo vs. Duck AI: Which AI is Better for Your Privacy?

    July 31, 2025191 Views

    6.7 Cummins Lifter Failure: What Years Are Affected (And Possible Fixes)

    April 14, 202593 Views

    6 Best MagSafe Phone Grips (2025), Tested and Reviewed

    April 6, 202586 Views
    Don't Miss
    Technology December 25, 2025

    This Two-Faced Watch Band Lets You Hide an Apple Watch Under Your Rolex

    This Two-Faced Watch Band Lets You Hide an Apple Watch Under Your RolexThe Consumer Electronics…

    ‘Stranger Things 5’ Flips Christmas Upside Down. Volume 2 Drops Today

    We Tested 87 Face Sunscreens and These Are the Best for Year-Round Sun Protection

    Today’s NYT Connections: Sports Edition Hints and Answers for Dec. 25, #458

    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    About Us
    About Us

    Welcome to Tech AI Verse, your go-to destination for everything technology! We bring you the latest news, trends, and insights from the ever-evolving world of tech. Our coverage spans across global technology industry updates, artificial intelligence advancements, machine learning ethics, and automation innovations. Stay connected with us as we explore the limitless possibilities of technology!

    Facebook X (Twitter) Pinterest YouTube WhatsApp
    Our Picks

    This Two-Faced Watch Band Lets You Hide an Apple Watch Under Your Rolex

    December 25, 20250 Views

    ‘Stranger Things 5’ Flips Christmas Upside Down. Volume 2 Drops Today

    December 25, 20250 Views

    We Tested 87 Face Sunscreens and These Are the Best for Year-Round Sun Protection

    December 25, 20250 Views
    Most Popular

    What to Know and Where to Find Apple Intelligence Summaries on iPhone

    March 12, 20250 Views

    A Team of Female Founders Is Launching Cloud Security Tech That Could Overhaul AI Protection

    March 12, 20250 Views

    Senua’s Saga: Hellblade 2 leads BAFTA Game Awards 2025 nominations

    March 12, 20250 Views
    © 2025 TechAiVerse. Designed by Divya Tech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.