Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    OpenAI debuts GPT‑5.1-Codex-Max coding model and it already completed a 24-hour task internally

    The Google Search of AI agents? Fetch launches ASI:One and Business tier for new era of non-human web

    OpenCV founders launch AI video startup to take on OpenAI and Google

    Facebook X (Twitter) Instagram
    • Artificial Intelligence
    • Business Technology
    • Cryptocurrency
    • Gadgets
    • Gaming
    • Health
    • Software and Apps
    • Technology
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Tech AI Verse
    • Home
    • Artificial Intelligence

      Insurance companies are trying to avoid big payouts by making AI safer

      November 19, 2025

      State and local opposition to new data centers is gaining steam, study shows

      November 15, 2025

      Amazon to lay off 14,000 corporate employees

      October 29, 2025

      Elon Musk launches Grokipedia as an alternative to ‘woke’ Wikipedia

      October 29, 2025

      Fears of an AI bubble are growing, but some on Wall Street aren’t worried just yet

      October 18, 2025
    • Business

      Windows 11 gets new Cloud Rebuild, Point-in-Time Restore tools

      November 18, 2025

      Government faces questions about why US AWS outage disrupted UK tax office and banking firms

      October 23, 2025

      Amazon’s AWS outage knocked services like Alexa, Snapchat, Fortnite, Venmo and more offline

      October 21, 2025

      SAP ECC customers bet on composable ERP to avoid upgrading

      October 18, 2025

      Revenue generated by neoclouds expected to exceed $23bn in 2025, predicts Synergy

      October 15, 2025
    • Crypto

      Nvidia Posts $57B Record Revenue with Bitcoin Rebounding Above $91K

      November 20, 2025

      3 Reasons Why A Cardano Price Rebound Looks Likely

      November 20, 2025

      BitMine (BMNR) Stock Bounces As Q4 Results Near — Is the Price Preparing Another Early Move?

      November 20, 2025

      Fed Minutes Reveal December Rate Cut on a Knife’s Edge, Bitcoin Slips Below $89,000

      November 20, 2025

      TRUMP Price Holds Above $7, Even As Epstein Files Release Approved

      November 20, 2025
    • Technology

      OpenAI debuts GPT‑5.1-Codex-Max coding model and it already completed a 24-hour task internally

      November 20, 2025

      The Google Search of AI agents? Fetch launches ASI:One and Business tier for new era of non-human web

      November 20, 2025

      OpenCV founders launch AI video startup to take on OpenAI and Google

      November 20, 2025

      VentureBeat launches “Beyond the Pilot” — a new podcast series exploring how enterprise AI gets real

      November 20, 2025

      Meta’s DreamGym framework trains AI agents in a simulated world to cut reinforcement learning costs

      November 20, 2025
    • Others
      • Gadgets
      • Gaming
      • Health
      • Software and Apps
    Check BMI
    Tech AI Verse
    You are at:Home»Technology»Cloudflare contrite after worst outage since 2019
    Technology

    Cloudflare contrite after worst outage since 2019

    TechAiVerseBy TechAiVerseNovember 20, 2025No Comments6 Mins Read0 Views
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    Cloudflare contrite after worst outage since 2019
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp Email

    Cloudflare contrite after worst outage since 2019

    Cloudflare CEO Matthew Prince apologises for the firm’s worst outage in years and shares details of how a change to database system permissions caused a cascading effect that brought down some of the web’s biggest names

    By

    • Alex Scroxton,
      Security Editor

    Published: 19 Nov 2025 17:04

    Cloudflare co-founder and CEO Matthew Prince has described the Tuesday 18 November hiccup that disrupted global internet traffic for hours as the organisation’s worst outage since 2019, saying that the traffic management giant has not experienced an issue that has caused the majority of core traffic to stop flowing through its network in more than six years.

    “An outage like today is unacceptable. We’ve architected our systems to be highly resilient to failure to ensure traffic will always continue to flow. When we’ve had outages in the past, it’s always led to us building new, more resilient systems,” said Prince. “On behalf of the entire team at Cloudflare, I would like to apologise for the pain we caused the internet today.”

    The Cloudflare outage began at 11.20am UTC (6.20am EST) on Tuesday when its network began experiencing significant failures to deliver core traffic, which manifested to ordinary web users as an error page indicating a Cloudflare network failure when they tried to access a customer site. The issue was triggered not by a cyber attack or malicious activity, but a minor change affecting a file used by Cloudflare’s Bot Management security system.

    Cloudflare Bot Management includes a machine learning model that generates bot “scores” for any request crossing the network – these scores are used by customers to allow or disallow bots from accessing their sites. It relies on a feature configuration file that the model uses to predict whether a request is automated or not, and because the bot landscape is so dynamic, it is refreshed and pushed live every few minutes specifically so that Cloudflare can react to new bots and attacks.

    The outage originated from a change to database system permissions that caused said database to output multiple entries into the feature configuration file. The file rapidly increased in size and was unfortunately propagated to all the machines comprising Cloudflare’s network. These machines – which route traffic across the network – were supposed to read the file to update the Bot Management system but because their software has a limit on the size of the feature file, it failed when the larger-than-expected feature file showed up, causing the machines to crash.

    DDoS confusion

    Prince said Cloudflare’s tech teams at first suspected they were seeing a hyperscale distributed-denial-of-service (DDoS) attack because of two factors. First, Cloudflare’s own status page, which is hosted off its infrastructure with no dependencies, coincidentally went down. Second, at the beginning of the outage period, Cloudflare saw brief periods of apparent system recovery.

    This was not, however, the result of threat actor activity – rather, it was happening because the feature file was being generated every five minutes by a query running on a ClickHouse database cluster, which was itself in the process of being updated to improve permissions management.

    The dodgy file was therefore only generated if the query ran on an updated part of the cluster, so every five minutes there was a chance of either normal or abnormal feature files being generated and propagated.

    “This fluctuation made it unclear what was happening as the entire system would recover and then fail again as sometimes good, sometimes bad configuration files were distributed to our network,” said Prince. “Initially, this led us to believe this might be caused by an attack. Eventually, every ClickHouse node was generating the bad configuration file and the fluctuation stabilised in the failing state.”

    These errors continued until the tech team was able to identify the issue and resolve it by stopping the generation and propagation of the bad feature file, manually inserting a “known good” file into the distribution queue, and then turning the core proxy off and on again. This done, things started to return to normal from 2.30pm onwards, and the number of baseline errors on Cloudflare’s network returned to normal about two-and-a-half hours later.

    Risk and resilience

    Although Cloudflare was not itself attacked by a threat actor, the outage is still a serious cyber risk issue with lessons to be learned not just at Cloudflare, but among all organisations, whether or not they are customers. It has exposed a deeper, systemic risk in that too much of the internet’s infrastructure rests on only a few shoulders.

    Ryan Polk, policy director at US-based non-profit the Internet Society, said that market concentration among content delivery networks (CDNs) had steadily increased since 2020: “CDNs offer clear advantages – they improve reliability, reduce latency and lower transit demand. However, when too much internet traffic is concentrated within a few providers, these networks can become single points of failure that disrupt access to large parts of the internet.

    “Organisations should assess the resilience of the services they rely on and examine their supply chains. Which systems and providers are critical to their operations? Where do single points of failure exist? Companies should explore ways to diversify, such as using multiple cloud, CDN or authentication providers to reduce risk and improve overall resilience.”

    Martin Greenfield, CEO at Quod Orbis, a continuous monitoring platform, added: “When a single auto-generated configuration file can take major parts of the web offline, that’s not purely a Cloudflare issue but a fragility problem that has become baked into how organisations build their security stacks.

    “Automation makes security scalable, but when automated configuration propagates instantly across a global network, it also scales failure. What’s missing in most organisations, and was clearly missing here, is automated assurance that validates those configurations before they go live. Automation without assurance is fragility at scale and relying on one vendor can’t stand up for an effective resilience strategy.”

    For its part, Prince said Cloudflare will be taking steps to lessen the chances of such an issue cropping up again in the future. These include hardening the ingestion of Cloudflare-generated configuration files in the same way it would do for user-generated inputs, enabling global kill-switches for features, working to eliminate the ability for core dumps or error reports to overwhelm system resources, and reviewing failure modes for error conditions across all of its core proxy modules.

    Read more on Network security management


    • Cloudflare outage disrupts public web services

      By: Alex Scroxton


    • Starlink traffic continues to skyrocket

      By: Joe O’Halloran


    • Cloudflare discloses breach related to stolen Okta data

      By: Rob Wright


    • Cloudflare eyes GenAI workloads with Workers AI

      By: Aaron Tan

    Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email
    Previous ArticleEU sets out plans to cut red tape on digital
    Next Article Meta’s DreamGym framework trains AI agents in a simulated world to cut reinforcement learning costs
    TechAiVerse
    • Website

    Jonathan is a tech enthusiast and the mind behind Tech AI Verse. With a passion for artificial intelligence, consumer tech, and emerging innovations, he deliver clear, insightful content to keep readers informed. From cutting-edge gadgets to AI advancements and cryptocurrency trends, Jonathan breaks down complex topics to make technology accessible to all.

    Related Posts

    OpenAI debuts GPT‑5.1-Codex-Max coding model and it already completed a 24-hour task internally

    November 20, 2025

    The Google Search of AI agents? Fetch launches ASI:One and Business tier for new era of non-human web

    November 20, 2025

    OpenCV founders launch AI video startup to take on OpenAI and Google

    November 20, 2025
    Leave A Reply Cancel Reply

    Top Posts

    Ping, You’ve Got Whale: AI detection system alerts ships of whales in their path

    April 22, 2025410 Views

    Lumo vs. Duck AI: Which AI is Better for Your Privacy?

    July 31, 2025109 Views

    6.7 Cummins Lifter Failure: What Years Are Affected (And Possible Fixes)

    April 14, 202575 Views

    Is Libby Compatible With Kobo E-Readers?

    March 31, 202555 Views
    Don't Miss
    Technology November 20, 2025

    OpenAI debuts GPT‑5.1-Codex-Max coding model and it already completed a 24-hour task internally

    OpenAI debuts GPT‑5.1-Codex-Max coding model and it already completed a 24-hour task internally Vercel Security…

    The Google Search of AI agents? Fetch launches ASI:One and Business tier for new era of non-human web

    OpenCV founders launch AI video startup to take on OpenAI and Google

    VentureBeat launches “Beyond the Pilot” — a new podcast series exploring how enterprise AI gets real

    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    About Us
    About Us

    Welcome to Tech AI Verse, your go-to destination for everything technology! We bring you the latest news, trends, and insights from the ever-evolving world of tech. Our coverage spans across global technology industry updates, artificial intelligence advancements, machine learning ethics, and automation innovations. Stay connected with us as we explore the limitless possibilities of technology!

    Facebook X (Twitter) Pinterest YouTube WhatsApp
    Our Picks

    OpenAI debuts GPT‑5.1-Codex-Max coding model and it already completed a 24-hour task internally

    November 20, 20251 Views

    The Google Search of AI agents? Fetch launches ASI:One and Business tier for new era of non-human web

    November 20, 20251 Views

    OpenCV founders launch AI video startup to take on OpenAI and Google

    November 20, 20250 Views
    Most Popular

    Xiaomi 15 Ultra Officially Launched in China, Malaysia launch to follow after global event

    March 12, 20250 Views

    Apple thinks people won’t use MagSafe on iPhone 16e

    March 12, 20250 Views

    French Apex Legends voice cast refuses contracts over “unacceptable” AI clause

    March 12, 20250 Views
    © 2025 TechAiVerse. Designed by Divya Tech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.