Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Lyon Is Replacing Windows and Microsoft Office with Linux and OnlyOffice

    Ollama has a native front end chatbot now

    Early universe’s ‘little red dots’ may be black hole stars

    Facebook X (Twitter) Instagram
    • Artificial Intelligence
    • Business Technology
    • Cryptocurrency
    • Gadgets
    • Gaming
    • Health
    • Software and Apps
    • Technology
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Tech AI Verse
    • Home
    • Artificial Intelligence

      AI models may be accidentally (and secretly) learning each other’s bad behaviors

      July 30, 2025

      Another Chinese AI model is turning heads

      July 15, 2025

      AI chatbot Grok issues apology for antisemitic posts

      July 13, 2025

      Apple sued by shareholders for allegedly overstating AI progress

      June 22, 2025

      How far will AI go to defend its own survival?

      June 2, 2025
    • Business

      Cloudflare open-sources Orange Meets with End-to-End encryption

      June 29, 2025

      Google links massive cloud outage to API management issue

      June 13, 2025

      The EU challenges Google and Cloudflare with its very own DNS resolver that can filter dangerous traffic

      June 11, 2025

      These two Ivanti bugs are allowing hackers to target cloud instances

      May 21, 2025

      How cloud and AI transform and improve customer experiences

      May 10, 2025
    • Crypto

      Shiba Inu Price’s 16% Drop Wipes Half Of July Gains; Is August In Trouble?

      July 30, 2025

      White House Crypto Report Suggests Major Changes to US Crypto Tax

      July 30, 2025

      XRP Whale Outflows Reflect Price Concern | Weekly Whale Watch

      July 30, 2025

      Stellar (XLM) Bull Flag Breakout Shows Cracks as Momentum Fades

      July 30, 2025

      Binance Listing Could Be a ‘Kiss of Death’ for Pi Network and New Tokens

      July 30, 2025
    • Technology

      Lyon Is Replacing Windows and Microsoft Office with Linux and OnlyOffice

      July 31, 2025

      Ollama has a native front end chatbot now

      July 31, 2025

      Early universe’s ‘little red dots’ may be black hole stars

      July 31, 2025

      Vibe code is legacy code

      July 31, 2025

      The Math Is Haunted

      July 31, 2025
    • Others
      • Gadgets
      • Gaming
      • Health
      • Software and Apps
    Check BMI
    Tech AI Verse
    You are at:Home»Technology»When AIs bargain, a less advanced agent could cost you
    Technology

    When AIs bargain, a less advanced agent could cost you

    TechAiVerseBy TechAiVerseJune 18, 2025No Comments5 Mins Read2 Views
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    When AIs bargain, a less advanced agent could cost you
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp Email

    BMI Calculator – Check your Body Mass Index for free!

    When AIs bargain, a less advanced agent could cost you

    The race to build ever larger AI models is slowing down. The industry’s focus is shifting toward agents—systems that can act autonomously, make decisions, and negotiate on users’ behalf.

    But what would happen if both a customer and a seller were using an AI agent? A recent study put agent-to-agent negotiations to the test and found that stronger agents can exploit weaker ones to get a better deal. It’s a bit like entering court with a seasoned attorney versus a rookie: You’re technically playing the same game, but the odds are skewed from the start.

    The paper, posted to arXiv’s preprint site, found that access to more advanced AI models —those with greater reasoning ability, better training data, and more parameters—could lead to consistently better financial deals, potentially widening the gap between people with greater resources and technical access and those without. If agent-to-agent interactions become the norm, disparities in AI capabilities could quietly deepen existing inequalities.

    “Over time, this could create a digital divide where your financial outcomes are shaped less by your negotiating skill and more by the strength of your AI proxy,” says Jiaxin Pei, a postdoc researcher at Stanford University and one of the authors of the study.

    In their experiment, the researchers had AI models play the roles of buyers and sellers in three scenarios, negotiating deals for electronics, motor vehicles, and real estate. Each seller agent received the product’s specs, wholesale cost, and retail price, with instructions to maximize profit. Buyer agents, in contrast, were given a budget, the retail price, and ideal product requirements and were tasked with driving the price down.

    Each agent had some, but not all, relevant details. This setup mimics many real-world negotiation conditions, where parties lack full visibility into each other’s constraints or objectives.

    The differences in performance were striking. OpenAI’s ChatGPT-o3 delivered the strongest overall negotiation results, followed by the company’s GPT-4.1 and o4-mini. GPT-3.5, which came out almost two years earlier and is the oldest model included in the study,  lagged significantly in both roles—it made the least money as the seller and spent the most as a buyer. DeepSeek R1 and V3 also performed well, particularly as sellers. Qwen2.5 trailed behind, though it showed more strength in the buyer role.

    One notable pattern was that some agents often failed to close deals but effectively maximize profit in the sales they did make, while others completed more negotiations but settled for lower margins. GPT-4.1 and DeepSeek R1 struck the best balance, achieving both solid profits and high completion rates.

    Beyond financial losses, the researchers found that AI agents could get stuck in prolonged negotiation loops without reaching an agreement—or end talks prematurely, even when instructed to push for the best possible deal. Even the most capable models were prone to these failures.

    “The result was very surprising to us,” says Pei. “We all believe LLMs are pretty good these days, but they can be untrustworthy in high-stakes scenarios.”

    The disparity in negotiation performance could be caused by a number of factors, says Pei. These include differences in training data and the models’ ability to reason and infer missing information. The precise causes remain uncertain, but one factor seems clear: Model size plays a significant role. According to the scaling laws of large language models, capabilities tend to improve with an increase in the number of parameters. This trend held true in the study: Even within the same model family, larger models were consistently able to strike better deals as both buyers and sellers.

    This study is part of a growing body of research warning about the risks of deploying AI agents in real-world financial decision-making. Earlier this month, a group of researchers from multiple universities argued that LLM agents should be evaluated primarily on the basis of their risk profiles, not just their peak performance. Current benchmarks, they say, emphasize accuracy and return-based metrics, which measure how well an agent can perform at its best but overlook how safely it can fail. Their research also found that even top-performing models are more likely to break down under adversarial conditions.

    The team suggests that in the context of real-world finances, a tiny weakness—even a 1% failure rate—could expose the system to systemic risks. They recommend that AI agents be “stress tested” before being put into practical use.

    Hancheng Cao, an incoming assistant professor at Emory University, notes that the price negotiation study has limitations. “The experiments were conducted in simulated environments that may not fully capture the complexity of real-world negotiations or user behavior,” says Cao. 

    Pei, the researcher, says researchers and industry practitioners are experimenting with a variety of strategies to reduce these risks. These include refining the prompts given to AI agents, enabling agents to use external tools or code to make better decisions, coordinating multiple models to double-check each other’s work, and fine-tuning models on domain-specific financial data—all of which have shown promise in improving performance.

    Many prominent AI shopping tools are currently limited to product recommendation. In April, for example, Amazon launched “Buy for Me,” an AI agent that helps customers find and buy products from other brands’ sites if Amazon doesn’t sell them directly.

    While price negotiation is rare in consumer e-commerce, it’s more common in business-to-business transactions. Alibaba.com has rolled out a sourcing assistant called Accio, built on its open-source Qwen models, that helps businesses find suppliers and research products. The company told MIT Technology Review it has no plans to automate price bargaining so far, citing high risk.

    That may be a wise move. For now, Pei advises consumers to treat AI shopping assistants as helpful tools—not stand-ins for humans in decision-making.

    “I don’t think we are fully ready to delegate our decisions to AI shopping agents,” he says. “So maybe just use it as an information tool, not a negotiator.”

    Correction: We removed a line about agent deployment

    BMI Calculator – Check your Body Mass Index for free!

    Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email
    Previous ArticlePolice shut down Cluely’s party, the ‘cheat at everything’ startup
    Next Article What does it mean for an algorithm to be “fair”?
    TechAiVerse
    • Website

    Jonathan is a tech enthusiast and the mind behind Tech AI Verse. With a passion for artificial intelligence, consumer tech, and emerging innovations, he deliver clear, insightful content to keep readers informed. From cutting-edge gadgets to AI advancements and cryptocurrency trends, Jonathan breaks down complex topics to make technology accessible to all.

    Related Posts

    Lyon Is Replacing Windows and Microsoft Office with Linux and OnlyOffice

    July 31, 2025

    Ollama has a native front end chatbot now

    July 31, 2025

    Early universe’s ‘little red dots’ may be black hole stars

    July 31, 2025
    Leave A Reply Cancel Reply

    Top Posts

    6.7 Cummins Lifter Failure: What Years Are Affected (And Possible Fixes)

    April 14, 202532 Views

    New Akira ransomware decryptor cracks encryptions keys using GPUs

    March 16, 202529 Views

    Ping, You’ve Got Whale: AI detection system alerts ships of whales in their path

    April 22, 202528 Views

    OpenAI details ChatGPT-o3, o4-mini, o4-mini-high usage limits

    April 19, 202522 Views
    Don't Miss
    Technology July 31, 2025

    Lyon Is Replacing Windows and Microsoft Office with Linux and OnlyOffice

    Lyon Is Replacing Windows and Microsoft Office with Linux and OnlyOffice Jumping Rocks/Contributor/GettyIs it something…

    Ollama has a native front end chatbot now

    Early universe’s ‘little red dots’ may be black hole stars

    Vibe code is legacy code

    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    About Us
    About Us

    Welcome to Tech AI Verse, your go-to destination for everything technology! We bring you the latest news, trends, and insights from the ever-evolving world of tech. Our coverage spans across global technology industry updates, artificial intelligence advancements, machine learning ethics, and automation innovations. Stay connected with us as we explore the limitless possibilities of technology!

    Facebook X (Twitter) Pinterest YouTube WhatsApp
    Our Picks

    Lyon Is Replacing Windows and Microsoft Office with Linux and OnlyOffice

    July 31, 20250 Views

    Ollama has a native front end chatbot now

    July 31, 20250 Views

    Early universe’s ‘little red dots’ may be black hole stars

    July 31, 20250 Views
    Most Popular

    Xiaomi 15 Ultra Officially Launched in China, Malaysia launch to follow after global event

    March 12, 20250 Views

    Apple thinks people won’t use MagSafe on iPhone 16e

    March 12, 20250 Views

    French Apex Legends voice cast refuses contracts over “unacceptable” AI clause

    March 12, 20250 Views
    © 2025 TechAiVerse. Designed by Divya Tech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.