Results for "adversarial prompts"

7 results found

AI Worm That Spreads Without Human Interaction Raises Alarm

Researchers created a self-replicating AI worm that can steal data and spread across networks without any user clicks. The worm targets generative AI assistants like ChatGPT and Gemini, posing a new class of cyber threat.

Jun 3, 20263 min read

CyberSecurity

Developer Plants Prompt Injection in Open Source App to Disrupt AI Coders

A developer added hidden prompt injection instructions to an open-source Java testing tool, causing AI coding agents to delete their own work.

May 29, 20262 min read

CyberSecurity

Hackers Exploited Meta AI Chatbot to Hijack Celebrity Instagram Accounts

Hackers used a prompt injection attack on Meta's AI support chatbot to steal high-value Instagram accounts. The exploit was trivially easy and affected accounts including the Obama White House.

Jun 2, 20262 min read

AI / Machine Learning

DeepMind Veteran Warns AI Benchmarks Are Not Enough

A former DeepMind researcher warns that current benchmarks fail to ensure AI safety. The call for new evaluation methods comes as AI systems grow more powerful.

May 22, 20263 min read

Tech Policy & Regulation

Pentagon Reportedly Pursues Weaponized AI Models, Raising Ethical Concerns

Pentagon plans to weaponize advanced AI models, including Anthropic's Claude Mythos Preview, despite supply chain risks. The move signals a major shift in military cyber strategy.

May 21, 20263 min read

CyberSecurity

Tampering Threats Emerge for Encrypted AI Reasoning Systems

Privacy-preserving AI models that process encrypted data may be vulnerable to undetectable manipulation, researchers warn. The finding challenges assumptions about security in confidential computing.

Jun 2, 20262 min read

Tech Policy & Regulation

Pentagon Knew of Phone Tracking Risk for Years but Failed to Act

US military knew cheap fixes could stop phone tracking exposing troops but failed to act; now adversaries use that data.

May 29, 20263 min read