Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    5 New Electronics You Can Buy At Costco In 2026

    5 Of The Most Frustrating Engine Layouts Ever Designed

    4 Big Changes Coming To ChatGPT In 2026

    Facebook X (Twitter) Instagram
    • Artificial Intelligence
    • Business Technology
    • Cryptocurrency
    • Gadgets
    • Gaming
    • Health
    • Software and Apps
    • Technology
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Tech AI Verse
    • Home
    • Artificial Intelligence

      AI, crypto and Trump super PACs stash millions to spend on the midterms

      February 2, 2026

      To avoid accusations of AI cheating, college students are turning to AI

      January 29, 2026

      ChatGPT can embrace authoritarian ideas after just one prompt, researchers say

      January 24, 2026

      Ashley St. Clair, the mother of one of Elon Musk’s children, sues xAI over Grok sexual images

      January 17, 2026

      Anthropic joins OpenAI’s push into health care with new Claude tools

      January 12, 2026
    • Business

      New VoidLink malware framework targets Linux cloud servers

      January 14, 2026

      Nvidia Rubin’s rack-scale encryption signals a turning point for enterprise AI security

      January 13, 2026

      How KPMG is redefining the future of SAP consulting on a global scale

      January 10, 2026

      Top 10 cloud computing stories of 2025

      December 22, 2025

      Saudia Arabia’s STC commits to five-year network upgrade programme with Ericsson

      December 18, 2025
    • Crypto

      XRP Advances 3% After Ripple Achieves Major Regulatory Breakthrough in Europe

      February 3, 2026

      BitMEX Launches the Grand Ascent Campaign Featuring a 100,000 USDT Prize Pool

      February 3, 2026

      At $76K, Strategy’s Average Cost Meets Bitcoin’s Current Price

      February 3, 2026

      Solana Rebounds After Sell-Off as Big Money Returns — Why $120 Matters Next

      February 3, 2026

      Clarity Act Loses Clarity Over Trump’s UAE Crypto Deal

      February 3, 2026
    • Technology

      5 New Electronics You Can Buy At Costco In 2026

      February 4, 2026

      5 Of The Most Frustrating Engine Layouts Ever Designed

      February 4, 2026

      4 Big Changes Coming To ChatGPT In 2026

      February 4, 2026

      Changing These 5 Settings Will Instantly Make Your Monitor Look Better

      February 4, 2026

      3 Apps You Should Be Using Alongside ChatGPT

      February 4, 2026
    • Others
      • Gadgets
      • Gaming
      • Health
      • Software and Apps
    Check BMI
    Tech AI Verse
    You are at:Home»Technology»The Coming Need for Formal Specification
    Technology

    The Coming Need for Formal Specification

    TechAiVerseBy TechAiVerseDecember 13, 2025No Comments7 Mins Read0 Views
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    The Coming Need for Formal Specification
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp Email

    The Coming Need for Formal Specification

    In late 2022, I had a conversation with a senior engineer on the coming problem
    of “what to do when AI is writing most of the code”. His opinion, which I found
    striking at the time, was that engineers would transition from writing mostly
    “implementation” code, to mostly writing tests and specifications.

    I remember thinking at the time that this was prescient. With three years of
    hindsight, it seems like things are trending in a different direction. I
    thought that the reason that testing and specifications would be useful was
    that AI agents would be struggling to “grok” coding for quite some time, and
    that you’d need to have robust specifications such that they could stumble
    toward correctness.

    In reality, AI written tests were one of the first tasks I felt comfortable
    delegating. Unit tests are squarely in-distribution for what the models have
    seen on all public open source code. There’s a lot of unit tests in open source
    code, and they follow predictable patterns. I’d expect that the variance of
    implementation code – and the requirement for out-of-distribution patterns –
    is much higher than testing code. The result is that models are now quite good
    at translating English descriptions into quite crisp test cases.1

    System Design

    There exists a higher level problem of holistic system behavior verification,
    though. Let’s take a quick diversion into systems design to see why.

    System design happens on multiple scales. You want systems to be robust – both
    in their runtime, and their ability to iteratively evolve. This nudges towards
    decomposing systems into distinct components, each of which can be internally
    complicated but exposes a firm interface boundary that allows you to abstract
    over this internal complexity.

    If we design things well, we can swap out parts of our system without disrupting
    other parts or harming the top-level description of what the system does. We can
    also perform top-down changes iteratively – adding new components, and retiring
    old ones, at each level of description of the system.

    This all requires careful thinking of how to build these interfaces and
    component boundaries in such a way that (1) there is a clean boundary between
    components and (2) that stringing all the components together actually produces
    the desired top-level behavior.

    To do this effectively, we require maps of various levels of description of the
    system’s
    territory. My
    conjecture is that code is not a good map for this territory.

    To be clear, I’ve found a lot of value in throwing out system diagram maps and
    looking directly at the code territory when debugging issues. However,
    code-level reasoning is often not the best level of abstraction to use for
    reasoning about systems. This is for a similar reason that “modeling all the
    individual molecules of a car” is not a great way to estimate that car’s braking
    distance.

    LLMs have increasingly longer context windows, so one could naively say “just
    throw all the code in the context and have it work it out”. Perhaps. But this is
    still just clearly not the most efficient way to reason about large-scale
    systems.

    Formal Verification

    The promise of formal verification is that we can construct provably composable
    maps which still match the ground-level territory. Formal verification of code
    allows you to specify a system using mathematical proofs, and then exhaustively
    prove that a system is correct. As an analogy: unit tests are like running an
    experiment. Each passing test is an assertion that, for the conditions checked,
    the code is correct. There could still exist some untested input that would
    demonstrate incorrect behavior. You only need one negative test to show the code
    is incorrect, but only a provably exhaustive set of inputs would be sufficient
    to show the code is fully correct. Writing a formal verification of a program is
    more like writing a proof. Writing a self-consistent proof is sufficient to show
    that the properties you’ve proven always hold.

    I saw Martin Kleppmann’s
    “Prediction: AI will make formal verification go mainstream”
    right after I posted
    “The Decline of the Software Drafter?”,
    which became the inspiration for this post. Kleppmann’s argument is that, just
    as the cost of generating code is coming down, so too will the cost of formal
    verification of code:

    For example, as of 2009, the formally verified seL4 microkernel consisted of
    8,700 lines of C code, but proving it correct required 20 person-years and
    200,000 lines of Isabelle code – or 23 lines of proof and half a person-day
    for every single line of implementation. Moreover, there are maybe a few
    hundred people in the world (wild guess) who know how to write such proofs,
    since it requires a lot of arcane knowledge about the proof system.

    …

    If formal verification becomes vastly cheaper, then we can afford to verify
    much more software. But on top of that, AI also creates a need to formally
    verify more software: rather than having humans review AI-generated code, I’d
    much rather have the AI prove to me that the code it has generated is correct.
    If it can do that, I’ll take AI-generated code over handcrafted code (with all
    its artisanal bugs) any day!

    I’ve long been interested in formal verification tools like TLA+ and Rocq (née
    Coq). I haven’t (yet) been able to justify to myself spending all that much time
    on them. I think that’s changing: the cost of writing code is
    coming down dramatically.
    The cost of reviewing
    and maintaining it is also coming down, but at a slower rate. I agree with
    Kleppmann that we need systematic tooling for dealing with this mismatch.

    Wishcasting a future world, I
    would be excited to see something like:

    • One starts with a high-level system specification, in English.
    • This specification is spun out into multiple TLA+ models at various levels
      of component specificity.
    • These models would allow us to determine the components that are
      load-bearing for system correctness.
    • The most critical set of load-bearing components are implemented with a
      corresponding formal verification proof, in something like
      Rocq.
    • The rest of the system components are still audited by an LLM to ensure they
      correctly match the behavior of their associated component in the TLA+ spec.

    The biggest concern to me related to formal verification is the following two
    excerpts, first from Kleppmann, and then from Hillel Wayne, a notable proponent
    of TLA+:

    There are maybe a few hundred people in the world (wild guess) who know how to
    write such proofs, since it requires a lot of arcane knowledge about the proof
    system. –
    Martin Kleppmann

    TLA+ is one of the more popular formal specification languages and you can
    probably fit every TLA+ expert in the world in a large schoolbus. –
    Hillel Wayne

    For formal verification to be useful in practice, at least some of the arcane
    knowledge of its internals will need to be broadly disseminated. Reviewing an
    AI-generated formal spec of a problem won’t be useful if you don’t have enough
    knowledge of the proof system to poke holes in what the AI came up with.

    I’d argue that undergraduate Computer Science programs should allocate some of
    their curriculum to formal verification. After all, students should have more
    time on their hands as they delegate implementation of their homework to AI
    agents.

    Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email
    Previous ArticleOliver Sacks fabricated key details in his books
    Next Article Google Removes Sci-Hub Domains from U.S. Search Results Due to Dated Court Order
    TechAiVerse
    • Website

    Jonathan is a tech enthusiast and the mind behind Tech AI Verse. With a passion for artificial intelligence, consumer tech, and emerging innovations, he deliver clear, insightful content to keep readers informed. From cutting-edge gadgets to AI advancements and cryptocurrency trends, Jonathan breaks down complex topics to make technology accessible to all.

    Related Posts

    5 New Electronics You Can Buy At Costco In 2026

    February 4, 2026

    5 Of The Most Frustrating Engine Layouts Ever Designed

    February 4, 2026

    4 Big Changes Coming To ChatGPT In 2026

    February 4, 2026
    Leave A Reply Cancel Reply

    Top Posts

    Ping, You’ve Got Whale: AI detection system alerts ships of whales in their path

    April 22, 2025651 Views

    Lumo vs. Duck AI: Which AI is Better for Your Privacy?

    July 31, 2025245 Views

    6.7 Cummins Lifter Failure: What Years Are Affected (And Possible Fixes)

    April 14, 2025145 Views

    6 Best MagSafe Phone Grips (2025), Tested and Reviewed

    April 6, 2025111 Views
    Don't Miss
    Technology February 4, 2026

    5 New Electronics You Can Buy At Costco In 2026

    5 New Electronics You Can Buy At Costco In 2026 What’s the last thing you…

    5 Of The Most Frustrating Engine Layouts Ever Designed

    4 Big Changes Coming To ChatGPT In 2026

    Changing These 5 Settings Will Instantly Make Your Monitor Look Better

    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    About Us
    About Us

    Welcome to Tech AI Verse, your go-to destination for everything technology! We bring you the latest news, trends, and insights from the ever-evolving world of tech. Our coverage spans across global technology industry updates, artificial intelligence advancements, machine learning ethics, and automation innovations. Stay connected with us as we explore the limitless possibilities of technology!

    Facebook X (Twitter) Pinterest YouTube WhatsApp
    Our Picks

    5 New Electronics You Can Buy At Costco In 2026

    February 4, 20260 Views

    5 Of The Most Frustrating Engine Layouts Ever Designed

    February 4, 20260 Views

    4 Big Changes Coming To ChatGPT In 2026

    February 4, 20260 Views
    Most Popular

    A Team of Female Founders Is Launching Cloud Security Tech That Could Overhaul AI Protection

    March 12, 20250 Views

    7 Best Kids Bikes (2025): Mountain, Balance, Pedal, Coaster

    March 13, 20250 Views

    VTOMAN FlashSpeed 1500: Plenty Of Power For All Your Gear

    March 13, 20250 Views
    © 2026 TechAiVerse. Designed by Divya Tech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.