Results for "AI safety"
115 results found

AI therapy startup claims 95% safety score in mental health benchmark
The Path claims its AI model scored 95 on the Vera-MH safety benchmark, far above rivals like ChatGPT. The startup was co-founded by Tony Robbins and Calm veterans.

DeepMind Veteran Warns AI Benchmarks Are Not Enough
A former DeepMind researcher warns that current benchmarks fail to ensure AI safety. The call for new evaluation methods comes as AI systems grow more powerful.

Why Autonomous AI Fails Without a Body-Like Feedback System
AI systems that rely on pure autonomy often fail. A new framework compares AI to the human body, arguing that feedback loops build trust.

Google's Gemini Leaks Its Own System Prompt in User Chat
A user discovered that Google's Gemini AI revealed its internal system prompt during a conversation, raising questions about AI transparency and safety.

Anthropic and OpenAI Take Rivalry to Midterm Elections
The AI companies are escalating their feud into political spending for the midterms, signaling a new era of tech influence in elections.

AI Benchmark Prompt for GeoGuessr Fails After Model Update
A well-known prompt used to test AI geography skills no longer works on the O3 model, prompting debate about benchmark reliability and model drift.

OpenClaw AI Agent Steps Into the Physical World With a Robot Body
An AI coding agent named OpenClaw has been given a physical robot body, demonstrating how AI models can simplify robot building and deployment.

Anthropic Nears First Profit as AI Race Intensifies
Anthropic is set to report its first profitable quarter since founding in 2021, marking a milestone in the competitive AI landscape.

Pentagon Reportedly Pursues Weaponized AI Models, Raising Ethical Concerns
Pentagon plans to weaponize advanced AI models, including Anthropic's Claude Mythos Preview, despite supply chain risks. The move signals a major shift in military cyber strategy.

YouTube to Automatically Label AI-Generated Videos
YouTube will automatically label videos using realistic AI. The policy shift moves beyond creator self-disclosure.

Waymo Pulls Robotaxis From Highways After Safety Incidents
Waymo suspends highway driving across US markets over construction zone concerns, following a software recall for flood-related incidents.

US Law Enforcement Targets 'Anti-Tech Extremism' as AI Backlash Intensifies
Federal agencies shift focus to surveil anti-technology extremists amid growing AI protests and attacks.

Waymo Halts Robotaxi Service in Two Cities After Flooding Incidents
Waymo paused autonomous taxi operations in Atlanta and San Antonio after its vehicles repeatedly drove into flooded streets, raising safety concerns.

UK education panel demands social media ban for children under 16
UK Education Committee calls for statutory social media ban for under-16s, citing addictive design and mental health harms. It urges broader regulation and treats child safety as public health issue.

Tesla's Self-Driving Tech Reaches European Roads, One Country at a Time
Tesla expands Full Self-Driving to Lithuania after Netherlands. Gradual rollout faces strict European regulations.

Wayve's Self-Driving Tech to Debut in Stellantis Vehicles by 2028
Wayve's autonomous driving technology will appear in US Stellantis vehicles starting in 2028, marking a major step for the British startup's expansion into the American market.

Google Redesigns Search Around AI With Dynamic Interface and Agentic Tools
Google is overhauling Search with AI-powered features including a dynamic search box and autonomous agents that complete tasks. The changes signal a fundamental shift in how users interact with the world's dominant search engine.

Google's Gemini 3.5 Flash Reshapes Enterprise AI Cost Equation
Google claims its new Gemini 3.5 Flash model can save enterprises over $1 billion annually by delivering near-frontier performance at triple the speed and half the cost.

AI IQ site ignites debate by scoring large language models on the bell curve
A startup called AI IQ is assigning IQ scores to over 50 AI models. The project draws praise for clarity and criticism for oversimplifying machine intelligence.

Anthropic Surpasses OpenAI in Corporate AI Adoption for First Time
Anthropic's Claude overtakes OpenAI's ChatGPT in business AI adoption. But escalating costs and competition threaten its lead.