Results for "adversarial input"

10 results found

Tiny Bank Transfer Exposes Critical Flaw in Banking AI Agents

A €0.01 transfer can trick banking AI agents into compromising security. Researchers show how a microtransaction becomes an attack vector.

Jun 10, 20262 min read

AI / Machine Learning

Anthropic Launches Claude Fable 5, Offering Public Access to Advanced Mythos AI

Anthropic has released Claude Fable 5, a public-facing version of its Mythos-class AI model. The model includes strict guardrails to prevent responses in sensitive areas like cybersecurity and biology.

Jun 9, 20263 min read

CyberSecurity

Developer Plants Prompt Injection in Open Source App to Disrupt AI Coders

A developer added hidden prompt injection instructions to an open-source Java testing tool, causing AI coding agents to delete their own work.

May 29, 20262 min read

CyberSecurity

AI Worm That Spreads Without Human Interaction Raises Alarm

Researchers created a self-replicating AI worm that can steal data and spread across networks without any user clicks. The worm targets generative AI assistants like ChatGPT and Gemini, posing a new class of cyber threat.

Jun 3, 20263 min read

CyberSecurity

Tampering Threats Emerge for Encrypted AI Reasoning Systems

Privacy-preserving AI models that process encrypted data may be vulnerable to undetectable manipulation, researchers warn. The finding challenges assumptions about security in confidential computing.

Jun 2, 20262 min read

AI / Machine Learning

DeepMind Veteran Warns AI Benchmarks Are Not Enough

A former DeepMind researcher warns that current benchmarks fail to ensure AI safety. The call for new evaluation methods comes as AI systems grow more powerful.

May 22, 20263 min read

Tech Policy & Regulation

Pentagon Reportedly Pursues Weaponized AI Models, Raising Ethical Concerns

Pentagon plans to weaponize advanced AI models, including Anthropic's Claude Mythos Preview, despite supply chain risks. The move signals a major shift in military cyber strategy.

May 21, 20263 min read

CyberSecurity

Hackers Exploited Meta AI Chatbot to Hijack Celebrity Instagram Accounts

Hackers used a prompt injection attack on Meta's AI support chatbot to steal high-value Instagram accounts. The exploit was trivially easy and affected accounts including the Obama White House.

Jun 2, 20262 min read

Tech Policy & Regulation

Pentagon Knew of Phone Tracking Risk for Years but Failed to Act

US military knew cheap fixes could stop phone tracking exposing troops but failed to act; now adversaries use that data.

May 29, 20263 min read

CyberSecurity

AI-Driven Attacks Outpace Enterprise Patching Capabilities

Cyber attackers are exploiting vulnerabilities faster than organizations can patch them, with AI accelerating the window for defense. This shift demands a fundamental rethinking of security strategies.

Jun 10, 20263 min read