Results for "safety benchmark"

25 results found

AI therapy startup claims 95% safety score in mental health benchmark

The Path claims its AI model scored 95 on the Vera-MH safety benchmark, far above rivals like ChatGPT. The startup was co-founded by Tony Robbins and Calm veterans.

May 21, 20263 min read

AI / Machine Learning

DeepMind Veteran Warns AI Benchmarks Are Not Enough

A former DeepMind researcher warns that current benchmarks fail to ensure AI safety. The call for new evaluation methods comes as AI systems grow more powerful.

May 22, 20263 min read

AI / Machine Learning

AI Benchmark Prompt for GeoGuessr Fails After Model Update

A well-known prompt used to test AI geography skills no longer works on the O3 model, prompting debate about benchmark reliability and model drift.

May 21, 20262 min read

Big Tech

Waymo Pulls Robotaxis From Highways After Safety Incidents

Waymo suspends highway driving across US markets over construction zone concerns, following a software recall for flood-related incidents.

May 22, 20263 min read

Gadgets / Consumer Tech

Why Modular Home Security Systems Are Gaining Traction in 2026

Modular home security systems offer flexibility and no long-term contracts. This trend is reshaping how homeowners approach safety and smart home integration.

May 21, 20262 min read

AI / Machine Learning

Google's Gemini Leaks Its Own System Prompt in User Chat

A user discovered that Google's Gemini AI revealed its internal system prompt during a conversation, raising questions about AI transparency and safety.

May 21, 20261 min read

Big Tech

Waymo Halts Robotaxi Service in Two Cities After Flooding Incidents

Waymo paused autonomous taxi operations in Atlanta and San Antonio after its vehicles repeatedly drove into flooded streets, raising safety concerns.

May 21, 20263 min read

Tech Policy & Regulation

UK education panel demands social media ban for children under 16

UK Education Committee calls for statutory social media ban for under-16s, citing addictive design and mental health harms. It urges broader regulation and treats child safety as public health issue.

May 26, 20262 min read

Tech Policy & Regulation

Anthropic and OpenAI Take Rivalry to Midterm Elections

The AI companies are escalating their feud into political spending for the midterms, signaling a new era of tech influence in elections.

May 20, 20262 min read

Gadgets / Consumer Tech

Adaptive Driving Beam Headlights Finally Reach US Roads

Audi's new Q9 SUV brings adaptive beam headlights to America, offering better illumination without glare after years of regulatory delays.

May 21, 20262 min read

Gadgets / Consumer Tech

Samsung's Fainting Detection Puts Apple on Notice

Samsung has introduced a fainting detection feature that could redefine smartwatch health monitoring. Apple Watch users may soon face a tough choice.

May 25, 20262 min read

AI / Machine Learning

OpenClaw AI Agent Steps Into the Physical World With a Robot Body

An AI coding agent named OpenClaw has been given a physical robot body, demonstrating how AI models can simplify robot building and deployment.

May 20, 20262 min read

AI / Machine Learning

Tesla's Self-Driving Tech Reaches European Roads, One Country at a Time

Tesla expands Full Self-Driving to Lithuania after Netherlands. Gradual rollout faces strict European regulations.

May 20, 20262 min read

Startups / Funding

Arlington Startup Raises $42M to Replace Ship Tracking With Smarter Sensors

A Virginia-based startup has raised $42 million to develop a networked sensor system for ships, aiming to surpass current AIS technology with a real-time 'hive mind' approach.

May 20, 20262 min read

Tech Policy & Regulation

FBI seeks real-time access to nationwide license plate camera network

The FBI issued a request for proposals for nationwide license plate reader data in near real time. The contract would cover 75% of US locations and enable tracking of vehicles.

May 20, 20262 min read

Tech Policy & Regulation

Pentagon Reportedly Pursues Weaponized AI Models, Raising Ethical Concerns

Pentagon plans to weaponize advanced AI models, including Anthropic's Claude Mythos Preview, despite supply chain risks. The move signals a major shift in military cyber strategy.

May 21, 20263 min read

AI / Machine Learning

Anthropic Nears First Profit as AI Race Intensifies

Anthropic is set to report its first profitable quarter since founding in 2021, marking a milestone in the competitive AI landscape.

May 21, 20262 min read

Startups / Funding

SpaceX Files for IPO, Plans NYSE Listing Under Ticker SPCX

SpaceX has officially filed for an initial public offering, planning to list on the NYSE under the ticker SPCX. The filing reveals unprecedented financial details about the private space company.

May 21, 20263 min read

AI / Machine Learning

Wayve's Self-Driving Tech to Debut in Stellantis Vehicles by 2028

Wayve's autonomous driving technology will appear in US Stellantis vehicles starting in 2028, marking a major step for the British startup's expansion into the American market.

May 21, 20263 min read

Gadgets / Consumer Tech

AMOS Malware Emerges as Major Threat to macOS Users

A stealthy infostealer called AMOS is spreading on macOS through deceptive ads and social engineering. Security experts warn it marks a shift in mainstream malware targeting Apple devices.

May 25, 20263 min read