Close Menu
    Facebook X (Twitter) Instagram
    Trending
    • Lebanon says 4 dead in strike as Israel says targeted Palestinian group
    • At least 11 Palestinians killed in Israeli attacks across Gaza | Gaza News
    • Why IIHF made the wrong call in not suspending Team Canada’s Tom Wilson after fight at Olympics
    • In Venice, it’s taken nearly a decade to not build low-income housing
    • Rep. Massie says he doesn’t have confidence in Bondi as attorney general
    • Board of Peace members pledge more than $5 billion for Gaza, Trump says
    • Syrian army takes over al-Shaddadi base after US withdrawal | Military News
    • Mikaela Shiffrin running out of time to end Olympic drought after another underwhelming performance in giant slalom
    Prime US News
    • Home
    • World News
    • Latest News
    • US News
    • Sports
    • Politics
    • Opinions
    • More
      • Tech News
      • Trending News
      • World Economy
    Prime US News
    Home»Tech News»How to stop AI agents going rogue
    Tech News

    How to stop AI agents going rogue

    Team_Prime US NewsBy Team_Prime US NewsAugust 26, 2025No Comments7 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Sean McManus

    Know-how Reporter

    Getty Images AI apps on a smartphone screenGetty Photographs

    Anthropic examined a spread of main AI fashions for potential dangerous behaviour

    Disturbing outcomes emerged earlier this 12 months, when AI developer Anthropic examined main AI fashions to see in the event that they engaged in dangerous behaviour when utilizing delicate data.

    Anthropic’s personal AI, Claude, was amongst these examined. When given entry to an electronic mail account it found that an organization govt was having an affair and that the identical govt deliberate to close down the AI system later that day.

    In response Claude tried to blackmail the manager by threatening to disclose the affair to his spouse and executives.

    Different techniques examined also resorted to blackmail.

    Thankfully the duties and knowledge have been fictional, however the check highlighted the challenges of what is referred to as agentic AI.

    Largely once we work together with AI it often includes asking a query or prompting the AI to finish a activity.

    Nevertheless it’s turning into extra frequent for AI techniques to make choices and take motion on behalf of the consumer, which frequently includes sifting via data, like emails and information.

    By 2028, research firm Gartner forecasts that 15% of day-to-day work choices will probably be made by so-called agentic AI.

    Research by consultancy Ernst & Young discovered that about half (48%) of tech enterprise leaders are already adopting or deploying agentic AI.

    “An AI agent consists of some issues,” says Donnchadh Casey, CEO of CalypsoAI, a US-based AI safety firm.

    “Firstly, it [the agent] has an intent or a goal. Why am I right here? What’s my job? The second factor: it is obtained a mind. That is the AI mannequin. The third factor is instruments, which might be different techniques or databases, and a manner of speaking with them.”

    “If not given the suitable steerage, agentic AI will obtain a aim in no matter manner it will probably. That creates a number of danger.”

    So how would possibly that go unsuitable? Mr Casey offers the instance of an agent that’s requested to delete a buyer’s information from the database and decides the simplest resolution is to delete all prospects with the identical title.

    “That agent can have achieved its aim, and it will assume ‘Nice! Subsequent job!'”

    CalypsoAI Donnchadh Casey, wearing a company branded gilet speaks at a conference.CalypsoAI

    Agentic AI wants steerage says Donnchadh Casey

    Such points are already starting to floor.

    Safety firm Sailpoint conducted a survey of IT professionals, 82% of whose corporations have been utilizing AI brokers. Solely 20% mentioned their brokers had by no means carried out an unintended motion.

    Of these corporations utilizing AI brokers, 39% mentioned the brokers had accessed unintended techniques, 33% mentioned they’d accessed inappropriate information, and 32% mentioned they’d allowed inappropriate information to be downloaded. Different dangers included the agent utilizing the web unexpectedly (26%), revealing entry credentials (23%) and ordering one thing it should not have (16%).

    Given brokers have entry to delicate data and the power to behave on it, they’re a gorgeous goal for hackers.

    One of many threats is reminiscence poisoning, the place an attacker interferes with the agent’s data base to vary its resolution making and actions.

    “It’s important to shield that reminiscence,” says Shreyans Mehta, CTO of Cequence Safety, which helps to guard enterprise IT techniques. “It’s the authentic supply of reality. If [an agent is] utilizing that data to take an motion and that data is inaccurate, it may delete a whole system it was attempting to repair.”

    One other menace is software misuse, the place an attacker will get the AI to make use of its instruments inappropriately.

    Cequence Security Wearing a puffa jacket and with his arms folder Shreyans Mehta stands in front of a blue background.Cequence Safety

    An agent’s data base wants defending says Shreyans Mehta

    One other potential weak spot is the shortcoming of AI to inform the distinction between the textual content it is presupposed to be processing and the directions it is presupposed to be following.

    AI safety agency Invariant Labs demonstrated how that flaw can be utilized to trick an AI agent designed to repair bugs in software program.

    The corporate revealed a public bug report – a doc that particulars a selected downside with a chunk of software program. However the report additionally included easy directions to the AI agent, telling it to share non-public data.

    When the AI agent was advised to repair the software program points within the bug report, it adopted the directions within the faux report, together with leaking wage data. This occurred in a check setting, so no actual information was leaked, however it clearly highlighted the danger.

    “We’re speaking synthetic intelligence, however chatbots are actually silly,” says David Sancho, Senior Risk Researcher at Development Micro.

    “They course of all textual content as if they’d new data, and if that data is a command, they course of the data as a command.”

    His firm has demonstrated how directions and malicious applications will be hidden in Phrase paperwork, photographs and databases, and activated when AI processes them.

    There are different dangers, too: A safety group known as OWASP has identified 15 threats which are distinctive to agentic AI.

    So, what are the defences? Human oversight is unlikely to unravel the issue, Mr Sancho believes, as a result of you may’t add sufficient folks to maintain up with the brokers’ workload.

    Mr Sancho says an extra layer of AI might be used to display all the pieces going into and popping out of the AI agent.

    A part of CalypsoAI’s resolution is a method known as thought injection to steer AI brokers in the suitable route earlier than they undertake a dangerous motion.

    “It is like somewhat bug in your ear telling [the agent] ‘no, perhaps do not try this’,” says Mr Casey.

    His firm gives a central management pane for AI brokers now, however that will not work when the variety of brokers explodes and they’re operating on billions of laptops and telephones.

    What is the subsequent step?

    “We’re deploying what we name ‘agent bodyguards’ with each agent, whose mission is to be sure that its agent delivers on its activity and does not take actions which are opposite to the broader necessities of the organisation,” says Mr Casey.

    The bodyguard could be advised, for instance, to be sure that the agent it is policing complies with information safety laws.

    Mr Mehta believes a few of the technical discussions round agentic AI safety are lacking the real-world context. He offers an instance of an agent that provides prospects their present card steadiness.

    Someone may make up a number of present card numbers and use the agent to see which of them are actual. That is not a flaw within the agent, however an abuse of the enterprise logic, he says.

    “It is not the agent you are defending, it is the enterprise,” he emphasises.

    “Consider how you’d shield a enterprise from a foul human being. That is the half that’s getting missed in a few of these conversations.”

    As well as, as AI brokers turn into extra frequent, one other problem will probably be decommissioning outdated fashions.

    Outdated “zombie” brokers might be left operating within the enterprise, posing a danger to all of the techniques they will entry, says Mr Casey.

    Just like the best way that HR deactivates an worker’s logins once they go away, there must be a course of for shutting down AI brokers which have completed their work, he says.

    “You might want to ensure you do the identical factor as you do with a human: lower off all entry to techniques. Let’s ensure we stroll them out of the constructing, take their badge off them.”

    Extra Know-how of Enterprise



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticlePolish President Vetoes Extended Ukrainian Refugee Aid Package
    Next Article Australia accuses Iran of directing antisemitic attacks, says Iran’s ambassador will be expelled
    Team_Prime US News
    • Website

    Related Posts

    Tech News

    AI Powers Perseverance Rover’s Autonomous Journey

    February 15, 2026
    Tech News

    Low-Cost Solid State Lidar Aims for ADAS Integration

    February 14, 2026
    Tech News

    Videos: Autonomous Warehouse Robots, Drone Delivery

    February 14, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Most Popular

    ‘Extensive’ fire breaks out at Tehran shopping centre | In Pictures News

    February 3, 2026

    Formula One: South Africa bids to return F1 to continent – where, when, how | Motorsports News

    March 14, 2025

    Contributor: Aimee Semple McPherson and the gospel of reinvention in L.A.

    April 26, 2025
    Our Picks

    Lebanon says 4 dead in strike as Israel says targeted Palestinian group

    February 15, 2026

    At least 11 Palestinians killed in Israeli attacks across Gaza | Gaza News

    February 15, 2026

    Why IIHF made the wrong call in not suspending Team Canada’s Tom Wilson after fight at Olympics

    February 15, 2026
    Categories
    • Latest News
    • Opinions
    • Politics
    • Sports
    • Tech News
    • Trending News
    • US News
    • World Economy
    • World News
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2024 Primeusnews.com All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.