Results for "AI coding benchmarks"
104 results found

AI-Written Story Sparks Literary Controversy Over Authenticity
An award-winning short story suspected of being AI-generated has ignited debate about authenticity in literature. Critics question whether AI-assisted writing undermines creative awards.

SpaceX Acquires xAI, Declares AI Its Core Business Ahead of IPO
SpaceX's IPO filing reveals AI as its primary market, projecting $26.5 trillion opportunity. The company positioned Grok against OpenAI and Anthropic.

AI Code Vulnerabilities Outpace Current Security Tools
AI-generated code creates a surge in vulnerabilities. Current security tools produce too many alerts with poor context. Teams need smarter triage to bridge detection and remediation.

Executives Lead in Shadow AI Use, Study Finds
New research reveals 62% of senior leaders use unapproved AI tools, bypassing security risks for productivity gains.

Google's AI Agents Signal End of Traditional Search as We Know It
Google is redefining search by letting AI agents proactively find information without user prompting. This shift could fundamentally change how we interact with the internet.

Wozniak Tells Students: Your Brain Beats AI
Apple co-founder Steve Wozniak drew cheers from students by reminding them they possess actual intelligence, not artificial.

Starbucks Drops Faulty AI Inventory System That Failed to Count
Starbucks scrapped an AI inventory tool after it repeatedly miscounted stock. The system’s failure highlights challenges in retail automation.

Public Opposition to AI Data Centers Grows as NIMBY Sentiment Spreads
A new survey shows Americans strongly oppose AI data centers near homes, citing noise, energy use and property value concerns. The backlash challenges Big Tech’s expansion plans.

Anthropic's New 'Dreaming' System Lets AI Agents Learn From Their Own Mistakes
Anthropic unveils 'dreaming,' a self-improvement system for AI agents, plus new tools for outcomes and multi-agent orchestration. Early adopters report dramatic gains in task completion.

AI Bots Fool Nearly Half of Participants in New Online Test
Surfshark's experiment reveals 47% of people can't tell AI bots from humans online. The test challenges users to identify bots in simulated social interactions.

Overprivileged AI Agents Expose Banking Systems to New Attacks
Financial firms face mounting security risks as AI agents access excessive data and systems. Overprivileged permissions create compliance vulnerabilities and trust issues across banking.

AI Agents Burn Cash: Microsoft, Meta, Amazon Face Token Crisis
Agentic AI consumes up to 1000x more tokens than standard AI, causing budgets to explode. Tech giants are now pulling back as employee 'tokenmaxxing' backfires.

Healthcare AI's Real Challenge Isn't Better Algorithms, It's Broken Systems
Healthcare AI fails in practice due to fragmented data and legacy systems, not weak algorithms. Real progress requires infrastructure modernization, not better models.

AI-Generated Lawsuits Overwhelm Courts as Unrepresented Plaintiffs Turn to Chatbots
Individuals without lawyers are using AI tools like ChatGPT to file lawsuits. The low-quality cases, dubbed 'slopsuits,' are clogging judicial dockets and raising concerns about misuse of technology.

Lawyers Face Sanctions for Using AI-Generated Fake Citations in Facebook Defamation Case
A dismissed defamation lawsuit against Facebook users backfires as lawyers may face sanctions for submitting fake AI-generated citations to support their arguments.

China Robotics Investment Hits Record as Embodied AI Startups Attract Billions
China-based robotics startups raised $5.6 billion through mid-2026, matching the 2021 peak. Embodied AI companies drive the surge, with several startups reaching billion-dollar valuations.

Secretive AI Startup Hark Raises $700M at $6 Billion Valuation
Hark, Brett Adcock's stealth AI startup, raised a massive $700M Series A, valuing the 'universal' interface company at $6 billion.

Google Deploys AI for New Search Ad Formats
Google announced AI-powered ad formats for search at I/O. The move uses generative AI to create more dynamic ads, impacting advertisers and users.

Google's AI Assistants Demand More Personal Data, Raising Trust Questions
Google unveiled always-on AI agents at I/O 2026, but their functionality depends on accessing users' personal data, sparking renewed trust concerns.

Pentagon Reportedly Pursues Weaponized AI Models, Raising Ethical Concerns
Pentagon plans to weaponize advanced AI models, including Anthropic's Claude Mythos Preview, despite supply chain risks. The move signals a major shift in military cyber strategy.