Results for "adversarial prompts"
7 results found

AI Worm That Spreads Without Human Interaction Raises Alarm
Researchers created a self-replicating AI worm that can steal data and spread across networks without any user clicks. The worm targets generative AI assistants like ChatGPT and Gemini, posing a new class of cyber threat.

Developer Plants Prompt Injection in Open Source App to Disrupt AI Coders
A developer added hidden prompt injection instructions to an open-source Java testing tool, causing AI coding agents to delete their own work.

Hackers Exploited Meta AI Chatbot to Hijack Celebrity Instagram Accounts
Hackers used a prompt injection attack on Meta's AI support chatbot to steal high-value Instagram accounts. The exploit was trivially easy and affected accounts including the Obama White House.

DeepMind Veteran Warns AI Benchmarks Are Not Enough
A former DeepMind researcher warns that current benchmarks fail to ensure AI safety. The call for new evaluation methods comes as AI systems grow more powerful.

Pentagon Reportedly Pursues Weaponized AI Models, Raising Ethical Concerns
Pentagon plans to weaponize advanced AI models, including Anthropic's Claude Mythos Preview, despite supply chain risks. The move signals a major shift in military cyber strategy.

Tampering Threats Emerge for Encrypted AI Reasoning Systems
Privacy-preserving AI models that process encrypted data may be vulnerable to undetectable manipulation, researchers warn. The finding challenges assumptions about security in confidential computing.

Pentagon Knew of Phone Tracking Risk for Years but Failed to Act
US military knew cheap fixes could stop phone tracking exposing troops but failed to act; now adversaries use that data.