Results for "AI inference"
192 results found

AI IQ site ignites debate by scoring large language models on the bell curve
A startup called AI IQ is assigning IQ scores to over 50 AI models. The project draws praise for clarity and criticism for oversimplifying machine intelligence.

Anthropic Surpasses OpenAI in Corporate AI Adoption for First Time
Anthropic's Claude overtakes OpenAI's ChatGPT in business AI adoption. But escalating costs and competition threaten its lead.

Why Autonomous AI Fails Without a Body-Like Feedback System
AI systems that rely on pure autonomy often fail. A new framework compares AI to the human body, arguing that feedback loops build trust.

Salesforce Turns Slackbot Into a Full AI Agent for the Enterprise
Salesforce rebuilt Slackbot from a simple notification tool into an AI agent that searches data, drafts documents and takes actions, intensifying workplace AI competition.

AI Coding Benchmarks Overlook Long-Term Code Health Risks
Current AI coding benchmarks measure one-shot performance but ignore quality erosion from repeated edits. This oversight could lead to unmaintainable codebases at scale.

AI Outpaces Human Patching, Making Vulnerability Windows Obsolete
AI-powered bug detection finds vulnerabilities faster than humans can patch. The industry shifts from reactive patching to building resilient software from the start.

Anthropic Nears First Profit as AI Race Intensifies
Anthropic is set to report its first profitable quarter since founding in 2021, marking a milestone in the competitive AI landscape.

AI-Generated Content Floods Social Media Platforms
AI-generated content floods social media platforms challenging moderation systems and raising concerns about online authenticity.

AI Benchmark Prompt for GeoGuessr Fails After Model Update
A well-known prompt used to test AI geography skills no longer works on the O3 model, prompting debate about benchmark reliability and model drift.

iOS 27 Siri update brings agentic AI capabilities through accessibility features
Apple's iOS 27 introduces advanced AI voice controls that make Siri more intuitive and proactive, hinting at future agentic AI powers.

AI Critics Call Training Data Practices 'Unauthorized Plagiarism at Scale'
A rising number of critics argue generative AI systems rely on unauthorized copying of copyrighted work, amounting to plagiarism at unprecedented scale. The debate intensifies as lawsuits mount and regulators weigh new rules for training data.

The Case Against AI Skepticism Is Weaker Than You Think
A growing backlash against AI is not just noise. It reflects real concerns about control, labor and culture that the tech industry ignores at its peril.

Anthropic Pledges $15 Billion a Year to SpaceX for AI Compute
Anthropic will pay $15 billion annually to SpaceX for access to its Colossus AI data centers through 2029, per SpaceX's IPO filing.

Workers Shift to Unauthorized AI as Corporate Policies Lag
A new study reveals most employees use unapproved AI tools at work despite known risks, citing poor organizational support.

AI-Driven Cyber Discovery Pushes UK Banks Toward Systemic Risk
UK banks face new systemic cyber risks as AI accelerates vulnerability discovery, threatening financial stability.

Google’s Gemini Voice Push Redefines How We Talk to AI
Google is leaning into voice interaction with Gemini, encouraging users to speak naturally. The shift capitalizes on voice dictation’s popularity and aims to make AI conversations feel human.

New AI Architecture Separates Prompts and Reasoning Into Parallel Streams
Researchers propose Multi-Stream LLMs, splitting prompts, thinking and I/O into parallel processes to boost efficiency and reduce latency.

DeepMind Veteran Warns AI Benchmarks Are Not Enough
A former DeepMind researcher warns that current benchmarks fail to ensure AI safety. The call for new evaluation methods comes as AI systems grow more powerful.

AI therapy startup claims 95% safety score in mental health benchmark
The Path claims its AI model scored 95 on the Vera-MH safety benchmark, far above rivals like ChatGPT. The startup was co-founded by Tony Robbins and Calm veterans.

AI coding boom creates production chaos, Resolve AI launches multi-agent fix
Resolve AI expands its platform with multi-agent investigation to tackle production failures caused by rapid AI code generation. The system uses coordinated agents that verify each other's findings.