Results for "AI assistant"
192 results found

DeepMind Veteran Warns AI Benchmarks Are Not Enough
A former DeepMind researcher warns that current benchmarks fail to ensure AI safety. The call for new evaluation methods comes as AI systems grow more powerful.

AI therapy startup claims 95% safety score in mental health benchmark
The Path claims its AI model scored 95 on the Vera-MH safety benchmark, far above rivals like ChatGPT. The startup was co-founded by Tony Robbins and Calm veterans.

AI coding boom creates production chaos, Resolve AI launches multi-agent fix
Resolve AI expands its platform with multi-agent investigation to tackle production failures caused by rapid AI code generation. The system uses coordinated agents that verify each other's findings.

Nvidia CEO sees $200B opportunity in AI agent processors
Nvidia CEO Jensen Huang predicts a $200 billion market for CPUs dedicated to AI agents. The company plans to expand beyond GPUs into specialized processors for autonomous AI systems.

AI-Written Story Sparks Literary Controversy Over Authenticity
An award-winning short story suspected of being AI-generated has ignited debate about authenticity in literature. Critics question whether AI-assisted writing undermines creative awards.

SpaceX Acquires xAI, Declares AI Its Core Business Ahead of IPO
SpaceX's IPO filing reveals AI as its primary market, projecting $26.5 trillion opportunity. The company positioned Grok against OpenAI and Anthropic.

AI Code Vulnerabilities Outpace Current Security Tools
AI-generated code creates a surge in vulnerabilities. Current security tools produce too many alerts with poor context. Teams need smarter triage to bridge detection and remediation.

Executives Lead in Shadow AI Use, Study Finds
New research reveals 62% of senior leaders use unapproved AI tools, bypassing security risks for productivity gains.

AI Pricing Models Face a Hard Reset
The era of cheap AI access is ending. Providers are shifting from subsidized pricing to sustainable models, forcing developers and businesses to adapt.

DeepSeek locks in lower pricing for V4 Pro as AI competition heats up
DeepSeek makes its V4 Pro price cut permanent, strategically reducing costs to lure developers and challenge rivals in the fast-moving AI market.

Starbucks Drops Faulty AI Inventory System That Failed to Count
Starbucks scrapped an AI inventory tool after it repeatedly miscounted stock. The system’s failure highlights challenges in retail automation.

Public Opposition to AI Data Centers Grows as NIMBY Sentiment Spreads
A new survey shows Americans strongly oppose AI data centers near homes, citing noise, energy use and property value concerns. The backlash challenges Big Tech’s expansion plans.

Anthropic's New 'Dreaming' System Lets AI Agents Learn From Their Own Mistakes
Anthropic unveils 'dreaming,' a self-improvement system for AI agents, plus new tools for outcomes and multi-agent orchestration. Early adopters report dramatic gains in task completion.

AI Bots Fool Nearly Half of Participants in New Online Test
Surfshark's experiment reveals 47% of people can't tell AI bots from humans online. The test challenges users to identify bots in simulated social interactions.

Overprivileged AI Agents Expose Banking Systems to New Attacks
Financial firms face mounting security risks as AI agents access excessive data and systems. Overprivileged permissions create compliance vulnerabilities and trust issues across banking.

Healthcare AI's Real Challenge Isn't Better Algorithms, It's Broken Systems
Healthcare AI fails in practice due to fragmented data and legacy systems, not weak algorithms. Real progress requires infrastructure modernization, not better models.

AI-Generated Lawsuits Overwhelm Courts as Unrepresented Plaintiffs Turn to Chatbots
Individuals without lawyers are using AI tools like ChatGPT to file lawsuits. The low-quality cases, dubbed 'slopsuits,' are clogging judicial dockets and raising concerns about misuse of technology.

Cybersecurity Defies AI Job Displacement Trends
While AI threatens many roles, cybersecurity hiring is booming. Experts say the field's complexity and need for human judgment keep demand high. Here's why cyber remains a safe bet.

Enterprises stuck in AI's 'chat phase' as gap between insight and action widens
Many enterprises use AI only for chat and queries, failing to translate insights into business outcomes. A shift toward integrated execution is critical.

Study Finds Politeness in AI Prompts Can Impact Model Accuracy
Research reveals that prompt tone significantly influences LLM accuracy. Polite prompts may boost performance while impolite ones degrade it.