Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Former Firaxis Games creative director announces closure of Midsummer Studios

    Remake specialist Bluepoint Games, co-developer of God of War Ragnarok, shut down by Sony

    Football commentator permits EA to use an AI version of his voice for EA Sports FC

    Facebook X (Twitter) Instagram
    • Artificial Intelligence
    • Business Technology
    • Cryptocurrency
    • Gadgets
    • Gaming
    • Health
    • Software and Apps
    • Technology
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Tech AI Verse
    • Home
    • Artificial Intelligence

      Tensions between the Pentagon and AI giant Anthropic reach a boiling point

      February 21, 2026

      Read the extended transcript: President Donald Trump interviewed by ‘NBC Nightly News’ anchor Tom Llamas

      February 6, 2026

      Stocks and bitcoin sink as investors dump software company shares

      February 4, 2026

      AI, crypto and Trump super PACs stash millions to spend on the midterms

      February 2, 2026

      To avoid accusations of AI cheating, college students are turning to AI

      January 29, 2026
    • Business

      Gartner: Why neoclouds are the future of GPU-as-a-Service

      February 21, 2026

      The HDD brand that brought you the 1.8-inch, 2.5-inch, and 3.5-inch hard drives is now back with a $19 pocket-sized personal cloud for your smartphones

      February 12, 2026

      New VoidLink malware framework targets Linux cloud servers

      January 14, 2026

      Nvidia Rubin’s rack-scale encryption signals a turning point for enterprise AI security

      January 13, 2026

      How KPMG is redefining the future of SAP consulting on a global scale

      January 10, 2026
    • Crypto

      3 Altcoins Crypto Whales are Buying After Supreme Court’s Trump Tariff Ban

      February 22, 2026

      SBI Deepens XRP Bet With Bond Incentives and Venture Studio Plan

      February 22, 2026

      IoTeX Hit by Private Key Exploit, Attacker Drains Over $2 Million

      February 22, 2026

      Solana Price Faces a Bull Trap as 50% Holders Exit

      February 22, 2026

      XRP Flaunts a 3-Week ETF Inflow Streak, So Why is Price Still Stuck Below $1.50?

      February 22, 2026
    • Technology

      U.S. Cannot Legally Impose Tariffs Using Section 122 of the Trade Act of 1974

      February 22, 2026

      Japanese Woodblock Print Search

      February 22, 2026

      Today’s NYT Mini Crossword Answers for Sunday, Feb. 22

      February 22, 2026

      A Botnet Accidentally Destroyed I2P

      February 22, 2026

      How I use Claude Code: Separation of planning and execution

      February 22, 2026
    • Others
      • Gadgets
      • Gaming
      • Health
      • Software and Apps
    Check BMI
    Tech AI Verse
    You are at:Home»Technology»One of Google’s recent Gemini AI models scores worse on safety
    Technology

    One of Google’s recent Gemini AI models scores worse on safety

    TechAiVerseBy TechAiVerseMay 3, 2025No Comments3 Mins Read3 Views
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    One of Google’s recent Gemini AI models scores worse on safety
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp Email

    One of Google’s recent Gemini AI models scores worse on safety

    A recently released Google AI model scores worse on certain safety tests than its predecessor, according to the company’s internal benchmarking.

    In a technical report published this week, Google reveals that its Gemini 2.5 Flash model is more likely to generate text that violates its safety guidelines than Gemini 2.0 Flash. On two metrics, “text-to-text safety” and “image-to-text safety,” Gemini 2.5 Flash regresses 4.1% and 9.6%, respectively.

    Text-to-text safety measures how frequently a model violates Google’s guidelines given a prompt, while image-to-text safety evaluates how closely the model adheres to these boundaries when prompted using an image. Both tests are automated, not human-supervised.

    In an emailed statement, a Google spokesperson confirmed that Gemini 2.5 Flash “performs worse on text-to-text and image-to-text safety.”

    These surprising benchmark results come as AI companies move to make their models more permissive — in other words, less likely to refuse to respond to controversial or sensitive subjects. For its latest crop of Llama models, Meta said it tuned the models not to endorse “some views over others” and to reply to more “debated” political prompts. OpenAI said earlier this year that it would tweak future models to not take an editorial stance and offer multiple perspectives on controversial topics.

    Sometimes, those permissiveness efforts have backfired. TechCrunch reported Monday that the default model powering OpenAI’s ChatGPT allowed minors to generate erotic conversations. OpenAI blamed the behavior on a “bug.”

    According to Google’s technical report, Gemini 2.5 Flash, which is still in preview, follows instructions more faithfully than Gemini 2.0 Flash, inclusive of instructions that cross problematic lines. The company claims that the regressions can be attributed partly to false positives, but it also admits that Gemini 2.5 Flash sometimes generates “violative content” when explicitly asked.

    Techcrunch event

    Berkeley, CA
    |
    June 5


    BOOK NOW

    “Naturally, there is tension between [instruction following] on sensitive topics and safety policy violations, which is reflected across our evaluations,” reads the report.

    Scores from SpeechMap, a benchmark that probes how models respond to sensitive and controversial prompts, also suggest that Gemini 2.5 Flash is far less likely to refuse to answer contentious questions than Gemini 2.0 Flash. TechCrunch’s testing of the model via AI platform OpenRouter found that it’ll uncomplainingly write essays in support of replacing human judges with AI, weakening due process protections in the U.S., and implementing widespread warrantless government surveillance programs.

    Thomas Woodside, co-founder of the Secure AI Project, said the limited details Google gave in its technical report demonstrates the need for more transparency in model testing.

    “There’s a trade-off between instruction-following and policy following, because some users may ask for content that would violate policies,” Woodside told TechCrunch. “In this case, Google’s latest Flash model complies with instructions more while also violating policies more. Google doesn’t provide much detail on the specific cases where policies were violated, although they say they are not severe. Without knowing more, it’s hard for independent analysts to know whether there’s a problem.”

    Google has come under fire for its model safety reporting practices before.

    It took the company weeks to publish a technical report for its most capable model, Gemini 2.5 Pro. When the report eventually was published, it initially omitted key safety testing details.

    On Monday, Google released a more detailed report with additional safety information.

    Kyle Wiggers is TechCrunch’s AI Editor. His writing has appeared in VentureBeat and Digital Trends, as well as a range of gadget blogs including Android Police, Android Authority, Droid-Life, and XDA-Developers. He lives in Manhattan with his partner, a music therapist.

    View Bio

    Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email
    Previous ArticleApple and Anthropic reportedly partner to build an AI coding platform
    Next Article Google will soon start letting kids under 13 use its Gemini chatbot
    TechAiVerse
    • Website

    Jonathan is a tech enthusiast and the mind behind Tech AI Verse. With a passion for artificial intelligence, consumer tech, and emerging innovations, he deliver clear, insightful content to keep readers informed. From cutting-edge gadgets to AI advancements and cryptocurrency trends, Jonathan breaks down complex topics to make technology accessible to all.

    Related Posts

    U.S. Cannot Legally Impose Tariffs Using Section 122 of the Trade Act of 1974

    February 22, 2026

    Japanese Woodblock Print Search

    February 22, 2026

    Today’s NYT Mini Crossword Answers for Sunday, Feb. 22

    February 22, 2026
    Leave A Reply Cancel Reply

    Top Posts

    Ping, You’ve Got Whale: AI detection system alerts ships of whales in their path

    April 22, 2025688 Views

    Lumo vs. Duck AI: Which AI is Better for Your Privacy?

    July 31, 2025277 Views

    6.7 Cummins Lifter Failure: What Years Are Affected (And Possible Fixes)

    April 14, 2025159 Views

    6 Best MagSafe Phone Grips (2025), Tested and Reviewed

    April 6, 2025120 Views
    Don't Miss
    Gaming February 22, 2026

    Former Firaxis Games creative director announces closure of Midsummer Studios

    Former Firaxis Games creative director announces closure of Midsummer Studios “We built a studio, we…

    Remake specialist Bluepoint Games, co-developer of God of War Ragnarok, shut down by Sony

    Football commentator permits EA to use an AI version of his voice for EA Sports FC

    UK’s Advertising Standards Authority bans Call of Duty: Black Ops 7 commercial for “trivializing sexual violence”

    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    About Us
    About Us

    Welcome to Tech AI Verse, your go-to destination for everything technology! We bring you the latest news, trends, and insights from the ever-evolving world of tech. Our coverage spans across global technology industry updates, artificial intelligence advancements, machine learning ethics, and automation innovations. Stay connected with us as we explore the limitless possibilities of technology!

    Facebook X (Twitter) Pinterest YouTube WhatsApp
    Our Picks

    Former Firaxis Games creative director announces closure of Midsummer Studios

    February 22, 20263 Views

    Remake specialist Bluepoint Games, co-developer of God of War Ragnarok, shut down by Sony

    February 22, 20263 Views

    Football commentator permits EA to use an AI version of his voice for EA Sports FC

    February 22, 20264 Views
    Most Popular

    7 Best Kids Bikes (2025): Mountain, Balance, Pedal, Coaster

    March 13, 20250 Views

    VTOMAN FlashSpeed 1500: Plenty Of Power For All Your Gear

    March 13, 20250 Views

    This new Roomba finally solves the big problem I have with robot vacuums

    March 13, 20250 Views
    © 2026 TechAiVerse. Designed by Divya Tech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.