Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Xiaomi 17 Ultra Officially Launched: Leica 1-Inch Camera, Snapdragon 8 Elite Gen 5 and 6,800mAh Battery Push Flagship Limits

    I failed at spotting AI slop videos. Can you do better?

    I can’t believe I’m saying this, but I would buy DLC for Windows 11

    Facebook X (Twitter) Instagram
    • Artificial Intelligence
    • Business Technology
    • Cryptocurrency
    • Gadgets
    • Gaming
    • Health
    • Software and Apps
    • Technology
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Tech AI Verse
    • Home
    • Artificial Intelligence

      AI has become the norm for students. Teachers are playing catch-up.

      December 23, 2025

      Trump signs executive order seeking to ban states from regulating AI companies

      December 13, 2025

      Apple’s AI chief abruptly steps down

      December 3, 2025

      The issue that’s scrambling both parties: From the Politics Desk

      December 3, 2025

      More of Silicon Valley is building on free Chinese AI

      December 1, 2025
    • Business

      Top 10 cloud computing stories of 2025

      December 22, 2025

      Saudia Arabia’s STC commits to five-year network upgrade programme with Ericsson

      December 18, 2025

      Zeroday Cloud hacking event awards $320,0000 for 11 zero days

      December 18, 2025

      Amazon: Ongoing cryptomining campaign uses hacked AWS accounts

      December 18, 2025

      Want to back up your iPhone securely without paying the Apple tax? There’s a hack for that, but it isn’t for everyone… yet

      December 16, 2025
    • Crypto

      Yield Basis (YB) Gains 17% After Securing Upbit Listing

      December 26, 2025

      The Biggest Options Expiry Ever—What $27 Billion Means for Bitcoin and Ethereum

      December 26, 2025

      TRON Network Hits Record User Growth as TRX Price Faces Worst Q4 Decline

      December 26, 2025

      4chan Trader Who Nailed Bitcoin’s October All-Time High Calls $250,000 in 2026

      December 26, 2025

      Ethereum ETFs Bleed for 2 Weeks, But This Key Level Retest Could Flip the Script

      December 26, 2025
    • Technology

      I failed at spotting AI slop videos. Can you do better?

      December 26, 2025

      I can’t believe I’m saying this, but I would buy DLC for Windows 11

      December 26, 2025

      Use Microsoft Excel data types to save unnecessary typing

      December 26, 2025

      A lifetime license for this PDF editor is now only $25

      December 26, 2025

      Office + Windows 11 Pro for $40 is the best add-on to your Christmas PC

      December 26, 2025
    • Others
      • Gadgets
      • Gaming
      • Health
      • Software and Apps
    Check BMI
    Tech AI Verse
    You are at:Home»Technology»New approach to agent reliability, AgentSpec, forces agents to follow rules
    Technology

    New approach to agent reliability, AgentSpec, forces agents to follow rules

    TechAiVerseBy TechAiVerseMarch 29, 2025No Comments4 Mins Read0 Views
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    New approach to agent reliability, AgentSpec, forces agents to follow rules
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp Email

    New approach to agent reliability, AgentSpec, forces agents to follow rules

    March 28, 2025 1:05 PM

    Credit: VentureBeat, generated with Midjourney

    Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


    AI agents have safety and reliability problems. Although agents would allow enterprises to automate more steps in their workflows, they can take unintended actions while executing a task, are not very flexible and are difficult to control.

    Organizations have already raised the alarm about unreliable agents, worried that once deployed, agents might forget to follow instructions. 

    OpenAI even admitted that ensuring agent reliability would involve working with outside developers, so it opened up its Agents SDK to help solve this issue. 

    However, Singapore Management University (SMU) researchers have developed a new approach to solving agent reliability.

    AgentSpec is a domain-specific framework that lets users “define structured rules that incorporate triggers, predicates and enforcement mechanisms.” The researchers said AgentSpec will make agents work only within the parameters that users want.

    Guiding LLM-based agents with a new approach

    AgentSpec is not a new large language model (LLM) but rather an approach to guide LLM-based AI agents. The researchers believe AgentSpec can be used for agents in enterprise settings and self-driving applications.   

    The first AgentSpec tests integrated on LangChain frameworks, but the researchers said they designed it to be framework-agnostic, meaning it can also run on AutoGen and Apollo ecosystems. 

    Experiments using AgentSpec showed it prevented “over 90% of unsafe code executions, ensures full compliance in autonomous driving law-violation scenarios, eliminates hazardous actions in embodied agent tasks and operates with millisecond-level overhead.” LLM-generated AgentSpec rules, which used OpenAI’s o1, also had a strong performance and enforced 87% of risky code and prevented “law-breaking in 5 out of 8 scenarios.”

    Current methods are a little lacking

    AgentSpec is not the only method for helping developers give agents more control and reliability. Other approaches include ToolEmu and GuardAgent. The startup Galileo launched Agentic Evaluations, a way to ensure agents work as intended.

    The open-source platform H2O.ai uses predictive models to improve the accuracy of agents used by companies in finance, healthcare, telecommunications and government. 

    The AgentSpec said researchers said current approaches to mitigate risks, like ToolEmu, effectively identify risks. They noted that “these methods lack interpretability and offer no mechanism for safety enforcement, making them susceptible to adversarial manipulation.” 

    Using AgentSpec

    AgentSpec works as a runtime enforcement layer for agents. It intercepts the agent’s behavior while executing tasks and adds safety rules set by humans or generated by prompts.

    Since AgentSpec is a custom domain-specific language, users must define the safety rules. There are three components to this: the first is the trigger, which lays out when to activate the rule; the second is to check to add conditions; and the third is enforce, which enforces actions to take if the rule is violated. 

    AgentSpec is built on LangChain, though, as previously stated, the researchers said AgentSpec can also be integrated into other frameworks like AutoGen or the autonomous vehicle software stack Apollo. 

    These frameworks orchestrate the steps agents need to take by taking in the user input, creating an execution plan, observing the result, and then deciding if the action was completed and, if not, planning the next step. AgentSpec adds rule enforcement into this flow. 

    “Before an action is executed, AgentSpec evaluates predefined constraints to ensure compliance, modifying the agent’s behavior when necessary. Specifically, AgentSpec hooks into three key decision points: before an action is executed (AgentAction), after an action produces an observation (AgentStep), and when the agent completes its task (AgentFinish). These points provide a structured way to intervene without altering the core logic of the agent,” the paper states. 

    More reliable agents

    Approaches like AgentSpec underscore the need for reliable agents for enterprise use. As organizations begin to plan their agentic strategy, tech decision leaders also look at ways to ensure reliability. 

    For many, agents will eventually autonomously and proactively do tasks for users. The idea of ambient agents, where AI agents and apps continuously run in the background and trigger themselves to execute actions, would require agents that do not stray from their path and accidentally introduce non-safe actions. 

    If ambient agents are where agentic AI will go in the future, expect more methods like AgentSpec to proliferate as companies seek to make AI agents continuously reliable. 

    Daily insights on business use cases with VB Daily

    If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

    Read our Privacy Policy

    Thanks for subscribing. Check out more VB newsletters here.

    An error occured.

    Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email
    Previous ArticleCountering nation-state cyber espionage: A CISO field guide
    Next Article SAG-AFTRA union creates deal for students and game jam devs to work with acting talent
    TechAiVerse
    • Website

    Jonathan is a tech enthusiast and the mind behind Tech AI Verse. With a passion for artificial intelligence, consumer tech, and emerging innovations, he deliver clear, insightful content to keep readers informed. From cutting-edge gadgets to AI advancements and cryptocurrency trends, Jonathan breaks down complex topics to make technology accessible to all.

    Related Posts

    I failed at spotting AI slop videos. Can you do better?

    December 26, 2025

    I can’t believe I’m saying this, but I would buy DLC for Windows 11

    December 26, 2025

    Use Microsoft Excel data types to save unnecessary typing

    December 26, 2025
    Leave A Reply Cancel Reply

    Top Posts

    Ping, You’ve Got Whale: AI detection system alerts ships of whales in their path

    April 22, 2025541 Views

    Lumo vs. Duck AI: Which AI is Better for Your Privacy?

    July 31, 2025191 Views

    6.7 Cummins Lifter Failure: What Years Are Affected (And Possible Fixes)

    April 14, 202594 Views

    6 Best MagSafe Phone Grips (2025), Tested and Reviewed

    April 6, 202586 Views
    Don't Miss
    Gadgets December 26, 2025

    Xiaomi 17 Ultra Officially Launched: Leica 1-Inch Camera, Snapdragon 8 Elite Gen 5 and 6,800mAh Battery Push Flagship Limits

    Xiaomi 17 Ultra Officially Launched: Leica 1-Inch Camera, Snapdragon 8 Elite Gen 5 and 6,800mAh…

    I failed at spotting AI slop videos. Can you do better?

    I can’t believe I’m saying this, but I would buy DLC for Windows 11

    Use Microsoft Excel data types to save unnecessary typing

    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    About Us
    About Us

    Welcome to Tech AI Verse, your go-to destination for everything technology! We bring you the latest news, trends, and insights from the ever-evolving world of tech. Our coverage spans across global technology industry updates, artificial intelligence advancements, machine learning ethics, and automation innovations. Stay connected with us as we explore the limitless possibilities of technology!

    Facebook X (Twitter) Pinterest YouTube WhatsApp
    Our Picks

    Xiaomi 17 Ultra Officially Launched: Leica 1-Inch Camera, Snapdragon 8 Elite Gen 5 and 6,800mAh Battery Push Flagship Limits

    December 26, 20250 Views

    I failed at spotting AI slop videos. Can you do better?

    December 26, 20252 Views

    I can’t believe I’m saying this, but I would buy DLC for Windows 11

    December 26, 20252 Views
    Most Popular

    What to Know and Where to Find Apple Intelligence Summaries on iPhone

    March 12, 20250 Views

    A Team of Female Founders Is Launching Cloud Security Tech That Could Overhaul AI Protection

    March 12, 20250 Views

    Senua’s Saga: Hellblade 2 leads BAFTA Game Awards 2025 nominations

    March 12, 20250 Views
    © 2025 TechAiVerse. Designed by Divya Tech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.