Results for "safety benchmark"
25 results found

AI therapy startup claims 95% safety score in mental health benchmark
The Path claims its AI model scored 95 on the Vera-MH safety benchmark, far above rivals like ChatGPT. The startup was co-founded by Tony Robbins and Calm veterans.

DeepMind Veteran Warns AI Benchmarks Are Not Enough
A former DeepMind researcher warns that current benchmarks fail to ensure AI safety. The call for new evaluation methods comes as AI systems grow more powerful.

AI Benchmark Prompt for GeoGuessr Fails After Model Update
A well-known prompt used to test AI geography skills no longer works on the O3 model, prompting debate about benchmark reliability and model drift.

Waymo Pulls Robotaxis From Highways After Safety Incidents
Waymo suspends highway driving across US markets over construction zone concerns, following a software recall for flood-related incidents.

Why Modular Home Security Systems Are Gaining Traction in 2026
Modular home security systems offer flexibility and no long-term contracts. This trend is reshaping how homeowners approach safety and smart home integration.

Google's Gemini Leaks Its Own System Prompt in User Chat
A user discovered that Google's Gemini AI revealed its internal system prompt during a conversation, raising questions about AI transparency and safety.

Waymo Halts Robotaxi Service in Two Cities After Flooding Incidents
Waymo paused autonomous taxi operations in Atlanta and San Antonio after its vehicles repeatedly drove into flooded streets, raising safety concerns.

UK education panel demands social media ban for children under 16
UK Education Committee calls for statutory social media ban for under-16s, citing addictive design and mental health harms. It urges broader regulation and treats child safety as public health issue.

Anthropic and OpenAI Take Rivalry to Midterm Elections
The AI companies are escalating their feud into political spending for the midterms, signaling a new era of tech influence in elections.

Adaptive Driving Beam Headlights Finally Reach US Roads
Audi's new Q9 SUV brings adaptive beam headlights to America, offering better illumination without glare after years of regulatory delays.

Samsung's Fainting Detection Puts Apple on Notice
Samsung has introduced a fainting detection feature that could redefine smartwatch health monitoring. Apple Watch users may soon face a tough choice.

OpenClaw AI Agent Steps Into the Physical World With a Robot Body
An AI coding agent named OpenClaw has been given a physical robot body, demonstrating how AI models can simplify robot building and deployment.

Tesla's Self-Driving Tech Reaches European Roads, One Country at a Time
Tesla expands Full Self-Driving to Lithuania after Netherlands. Gradual rollout faces strict European regulations.

Arlington Startup Raises $42M to Replace Ship Tracking With Smarter Sensors
A Virginia-based startup has raised $42 million to develop a networked sensor system for ships, aiming to surpass current AIS technology with a real-time 'hive mind' approach.

FBI seeks real-time access to nationwide license plate camera network
The FBI issued a request for proposals for nationwide license plate reader data in near real time. The contract would cover 75% of US locations and enable tracking of vehicles.

Pentagon Reportedly Pursues Weaponized AI Models, Raising Ethical Concerns
Pentagon plans to weaponize advanced AI models, including Anthropic's Claude Mythos Preview, despite supply chain risks. The move signals a major shift in military cyber strategy.

Anthropic Nears First Profit as AI Race Intensifies
Anthropic is set to report its first profitable quarter since founding in 2021, marking a milestone in the competitive AI landscape.

SpaceX Files for IPO, Plans NYSE Listing Under Ticker SPCX
SpaceX has officially filed for an initial public offering, planning to list on the NYSE under the ticker SPCX. The filing reveals unprecedented financial details about the private space company.

Wayve's Self-Driving Tech to Debut in Stellantis Vehicles by 2028
Wayve's autonomous driving technology will appear in US Stellantis vehicles starting in 2028, marking a major step for the British startup's expansion into the American market.

AMOS Malware Emerges as Major Threat to macOS Users
A stealthy infostealer called AMOS is spreading on macOS through deceptive ads and social engineering. Security experts warn it marks a shift in mainstream malware targeting Apple devices.