Results for "model drift"
125 results found

AI Benchmark Prompt for GeoGuessr Fails After Model Update
A well-known prompt used to test AI geography skills no longer works on the O3 model, prompting debate about benchmark reliability and model drift.

Star Citizen Hits $1 Billion Crowdfunding Milestone, Still in Early Access
Star Citizen has raised $1 billion from backers but remains in early access after nine years of development, sparking debate about crowdfunding risks.

Salesforce Turns Slackbot Into a Full AI Agent for the Enterprise
Salesforce rebuilt Slackbot from a simple notification tool into an AI agent that searches data, drafts documents and takes actions, intensifying workplace AI competition.

Google's AI Assistants Demand More Personal Data, Raising Trust Questions
Google unveiled always-on AI agents at I/O 2026, but their functionality depends on accessing users' personal data, sparking renewed trust concerns.

Pentagon Reportedly Pursues Weaponized AI Models, Raising Ethical Concerns
Pentagon plans to weaponize advanced AI models, including Anthropic's Claude Mythos Preview, despite supply chain risks. The move signals a major shift in military cyber strategy.

AI-Generated Lawsuits Overwhelm Courts as Unrepresented Plaintiffs Turn to Chatbots
Individuals without lawyers are using AI tools like ChatGPT to file lawsuits. The low-quality cases, dubbed 'slopsuits,' are clogging judicial dockets and raising concerns about misuse of technology.

Grok's Government Adoption Lags, Undermining xAI's Growth Story
Grok appears in only 3 of 400+ government AI use cases per Reuters. The low adoption undercuts xAI's growth story tied to a potential massive SpaceX IPO.

Anthropic and OpenAI Take Rivalry to Midterm Elections
The AI companies are escalating their feud into political spending for the midterms, signaling a new era of tech influence in elections.

Lawyers Face Sanctions for Using AI-Generated Fake Citations in Facebook Defamation Case
A dismissed defamation lawsuit against Facebook users backfires as lawyers may face sanctions for submitting fake AI-generated citations to support their arguments.

Open-source coding model NousCoder-14B matches big rivals in just 4 days
An open-source AI coding model trained in four days matches proprietary systems, highlighting the rapid progress of open-source alternatives in AI-assisted software development.

Cerebras wafer-scale chip runs trillion-parameter model 7x faster than GPU clouds
Cerebras claims its wafer-scale chip runs a trillion-parameter AI model nearly seven times faster than GPU-based clouds, challenging Nvidia's dominance in inference.

Study Finds Politeness in AI Prompts Can Impact Model Accuracy
Research reveals that prompt tone significantly influences LLM accuracy. Polite prompts may boost performance while impolite ones degrade it.

AI therapy startup claims 95% safety score in mental health benchmark
The Path claims its AI model scored 95 on the Vera-MH safety benchmark, far above rivals like ChatGPT. The startup was co-founded by Tony Robbins and Calm veterans.

Google's Gemini 3.5 Flash Reshapes Enterprise AI Cost Equation
Google claims its new Gemini 3.5 Flash model can save enterprises over $1 billion annually by delivering near-frontier performance at triple the speed and half the cost.

Lumix L10's High Price Opens Door for Budget Camera Alternative
Panasonic's new Lumix L10 impresses but carries a steep price. An overlooked model offers comparable features at half the cost, shifting value in mirrorless cameras.

Flipper Devices Targets Network Tinkerers With New Linux Gadget
Flipper Devices announces a Linux-powered networking gadget for hackers and hobbyists. The base model will cost under $350.

Anker Soundcore Launches Two New Premium Earbuds With AI Translation and Dolby Atmos
Anker's Soundcore released two new premium earbuds. Testing shows the cheaper model is the better buy for most users despite the higher-end version having superior sound and features.

OpenRouter Hits $1.3B Valuation After $113M Series B Round
OpenRouter raised $113M in Series B funding led by CapitalG, more than doubling its valuation to $1.3B. Usage surged 5x in six months, signaling the rise of multi-AI-model platforms.

Sony's Most Expensive Wireless Headphones Deliver Stunning Audio but Carry Big Trade-Offs
Sony's flagship wireless headphones offer top-tier audio but suffer from comfort issues and a high price. Are they worth the premium?

Samsung Reportedly Planning Smaller Galaxy S27 Pro with Flagship Specs
Samsung may launch a Galaxy S27 Pro next year, offering Ultra-level features in a compact body. Report hints at a smaller screen but identical core specs.