Results for "adversarial attacks"
6 results found

Hackers Exploited Meta AI Chatbot to Hijack Celebrity Instagram Accounts
Hackers used a prompt injection attack on Meta's AI support chatbot to steal high-value Instagram accounts. The exploit was trivially easy and affected accounts including the Obama White House.

Tampering Threats Emerge for Encrypted AI Reasoning Systems
Privacy-preserving AI models that process encrypted data may be vulnerable to undetectable manipulation, researchers warn. The finding challenges assumptions about security in confidential computing.

Developer Plants Prompt Injection in Open Source App to Disrupt AI Coders
A developer added hidden prompt injection instructions to an open-source Java testing tool, causing AI coding agents to delete their own work.

Pentagon Reportedly Pursues Weaponized AI Models, Raising Ethical Concerns
Pentagon plans to weaponize advanced AI models, including Anthropic's Claude Mythos Preview, despite supply chain risks. The move signals a major shift in military cyber strategy.

DeepMind Veteran Warns AI Benchmarks Are Not Enough
A former DeepMind researcher warns that current benchmarks fail to ensure AI safety. The call for new evaluation methods comes as AI systems grow more powerful.

Pentagon Knew of Phone Tracking Risk for Years but Failed to Act
US military knew cheap fixes could stop phone tracking exposing troops but failed to act; now adversaries use that data.