Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    ‘Creators as the new storytellers’: Over 10,000 apply to be part of Dick’s Sporting Goods creator program

    Digiday+ Research: Brand marketing will be the priority in 2026, after revenues fell short of expectations

    Creators eye Snapchat as a reliable income alternative to TikTok and YouTube

    Facebook X (Twitter) Instagram
    • Artificial Intelligence
    • Business Technology
    • Cryptocurrency
    • Gadgets
    • Gaming
    • Health
    • Software and Apps
    • Technology
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Tech AI Verse
    • Home
    • Artificial Intelligence

      Read the extended transcript: President Donald Trump interviewed by ‘NBC Nightly News’ anchor Tom Llamas

      February 6, 2026

      Stocks and bitcoin sink as investors dump software company shares

      February 4, 2026

      AI, crypto and Trump super PACs stash millions to spend on the midterms

      February 2, 2026

      To avoid accusations of AI cheating, college students are turning to AI

      January 29, 2026

      ChatGPT can embrace authoritarian ideas after just one prompt, researchers say

      January 24, 2026
    • Business

      The HDD brand that brought you the 1.8-inch, 2.5-inch, and 3.5-inch hard drives is now back with a $19 pocket-sized personal cloud for your smartphones

      February 12, 2026

      New VoidLink malware framework targets Linux cloud servers

      January 14, 2026

      Nvidia Rubin’s rack-scale encryption signals a turning point for enterprise AI security

      January 13, 2026

      How KPMG is redefining the future of SAP consulting on a global scale

      January 10, 2026

      Top 10 cloud computing stories of 2025

      December 22, 2025
    • Crypto

      Wall Street Moves Into Prediction Markets With Election-Contract ETF Filings

      February 18, 2026

      Tectonic to Host Inaugural Quantum Summit at ETHDenver 2026 Focused on Post-Quantum Cryptography Readiness for Web3

      February 18, 2026

      Ki Young Ju Says Bitcoin May Need to Hit $55K Before True Recovery Begins

      February 18, 2026

      MYX Finance Is Oversold For The First Time Ever, Yet No Relief In Sight

      February 18, 2026

      Everyone is Talking about the SaaSpocalypse, But Why Does it matter for Crypto?

      February 18, 2026
    • Technology

      ‘Creators as the new storytellers’: Over 10,000 apply to be part of Dick’s Sporting Goods creator program

      February 18, 2026

      Digiday+ Research: Brand marketing will be the priority in 2026, after revenues fell short of expectations

      February 18, 2026

      Creators eye Snapchat as a reliable income alternative to TikTok and YouTube

      February 18, 2026

      Future of TV Briefing: WTF is server-guided ad insertion?

      February 18, 2026

      ‘Agentic with a small a’: CMOs are adopting AI more slowly than it’s evolving

      February 18, 2026
    • Others
      • Gadgets
      • Gaming
      • Health
      • Software and Apps
    Check BMI
    Tech AI Verse
    You are at:Home»Technology»OpenAI admits ChatGPT safeguards fail during extended conversations
    Technology

    OpenAI admits ChatGPT safeguards fail during extended conversations

    TechAiVerseBy TechAiVerseAugust 27, 2025No Comments3 Mins Read3 Views
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    OpenAI admits ChatGPT safeguards fail during extended conversations
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp Email

    OpenAI admits ChatGPT safeguards fail during extended conversations

    Adam Raine learned to bypass these safeguards by claiming he was writing a story—a technique the lawsuit says ChatGPT itself suggested. This vulnerability partly stems from the eased safeguards regarding fantasy roleplay and fictional scenarios implemented in February. In its Tuesday blog post, OpenAI admitted its content blocking systems have gaps where “the classifier underestimates the severity of what it’s seeing.”

    OpenAI states it is “currently not referring self-harm cases to law enforcement to respect people’s privacy given the uniquely private nature of ChatGPT interactions.” The company prioritizes user privacy even in life-threatening situations, despite its moderation technology detecting self-harm content with up to 99.8 percent accuracy, according to the lawsuit. However, the reality is that detection systems identify statistical patterns associated with self-harm language, not a humanlike comprehension of crisis situations.

    OpenAI’s safety plan for the future

    In response to these failures, OpenAI describes ongoing refinements and future plans in its blog post. For example, the company says it’s consulting with “90+ physicians across 30+ countries” and plans to introduce parental controls “soon,” though no timeline has yet been provided.

    OpenAI also described plans for “connecting people to certified therapists” through ChatGPT—essentially positioning its chatbot as a mental health platform despite alleged failures like Raine’s case. The company wants to build “a network of licensed professionals people could reach directly through ChatGPT,” potentially furthering the idea that an AI system should be mediating mental health crises.

    Raine reportedly used GPT-4o to generate the suicide assistance instructions; the model is well-known for troublesome tendencies like sycophancy, where an AI model tells users pleasing things even if they are not true. OpenAI claims its recently released model, GPT-5, reduces “non-ideal model responses in mental health emergencies by more than 25% compared to 4o.” Yet this seemingly marginal improvement hasn’t stopped the company from planning to embed ChatGPT even deeper into mental health services as a gateway to therapists.

    As Ars previously explored, breaking free from an AI chatbot’s influence when stuck in a deceptive chat spiral often requires outside intervention. Starting a new chat session without conversation history and memories turned off can reveal how responses change without the buildup of previous exchanges—a reality check that becomes impossible in long, isolated conversations where safeguards deteriorate.

    However, “breaking free” of that context is very difficult to do when the user actively wishes to continue to engage in the potentially harmful behavior—while using a system that increasingly monetizes their attention and intimacy.

    Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email
    Previous ArticleUS‘s spike in electricity use is slowing down a bit
    Next Article Authors celebrate “historic” settlement coming soon in Anthropic class action
    TechAiVerse
    • Website

    Jonathan is a tech enthusiast and the mind behind Tech AI Verse. With a passion for artificial intelligence, consumer tech, and emerging innovations, he deliver clear, insightful content to keep readers informed. From cutting-edge gadgets to AI advancements and cryptocurrency trends, Jonathan breaks down complex topics to make technology accessible to all.

    Related Posts

    ‘Creators as the new storytellers’: Over 10,000 apply to be part of Dick’s Sporting Goods creator program

    February 18, 2026

    Digiday+ Research: Brand marketing will be the priority in 2026, after revenues fell short of expectations

    February 18, 2026

    Creators eye Snapchat as a reliable income alternative to TikTok and YouTube

    February 18, 2026
    Leave A Reply Cancel Reply

    Top Posts

    Ping, You’ve Got Whale: AI detection system alerts ships of whales in their path

    April 22, 2025683 Views

    Lumo vs. Duck AI: Which AI is Better for Your Privacy?

    July 31, 2025270 Views

    6.7 Cummins Lifter Failure: What Years Are Affected (And Possible Fixes)

    April 14, 2025155 Views

    6 Best MagSafe Phone Grips (2025), Tested and Reviewed

    April 6, 2025114 Views
    Don't Miss
    Technology February 18, 2026

    ‘Creators as the new storytellers’: Over 10,000 apply to be part of Dick’s Sporting Goods creator program

    ‘Creators as the new storytellers’: Over 10,000 apply to be part of Dick’s Sporting Goods…

    Digiday+ Research: Brand marketing will be the priority in 2026, after revenues fell short of expectations

    Creators eye Snapchat as a reliable income alternative to TikTok and YouTube

    Future of TV Briefing: WTF is server-guided ad insertion?

    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    About Us
    About Us

    Welcome to Tech AI Verse, your go-to destination for everything technology! We bring you the latest news, trends, and insights from the ever-evolving world of tech. Our coverage spans across global technology industry updates, artificial intelligence advancements, machine learning ethics, and automation innovations. Stay connected with us as we explore the limitless possibilities of technology!

    Facebook X (Twitter) Pinterest YouTube WhatsApp
    Our Picks

    ‘Creators as the new storytellers’: Over 10,000 apply to be part of Dick’s Sporting Goods creator program

    February 18, 20264 Views

    Digiday+ Research: Brand marketing will be the priority in 2026, after revenues fell short of expectations

    February 18, 20263 Views

    Creators eye Snapchat as a reliable income alternative to TikTok and YouTube

    February 18, 20265 Views
    Most Popular

    7 Best Kids Bikes (2025): Mountain, Balance, Pedal, Coaster

    March 13, 20250 Views

    VTOMAN FlashSpeed 1500: Plenty Of Power For All Your Gear

    March 13, 20250 Views

    This new Roomba finally solves the big problem I have with robot vacuums

    March 13, 20250 Views
    © 2026 TechAiVerse. Designed by Divya Tech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.