Results for "model drift"

125 results found

AI Benchmark Prompt for GeoGuessr Fails After Model Update

A well-known prompt used to test AI geography skills no longer works on the O3 model, prompting debate about benchmark reliability and model drift.

May 21, 20262 min read

Startups / Funding

Star Citizen Hits $1 Billion Crowdfunding Milestone, Still in Early Access

Star Citizen has raised $1 billion from backers but remains in early access after nine years of development, sparking debate about crowdfunding risks.

May 25, 20263 min read

AI / Machine Learning

Salesforce Turns Slackbot Into a Full AI Agent for the Enterprise

Salesforce rebuilt Slackbot from a simple notification tool into an AI agent that searches data, drafts documents and takes actions, intensifying workplace AI competition.

May 19, 20262 min read

Big Tech

Google's AI Assistants Demand More Personal Data, Raising Trust Questions

Google unveiled always-on AI agents at I/O 2026, but their functionality depends on accessing users' personal data, sparking renewed trust concerns.

May 20, 20262 min read

Tech Policy & Regulation

Pentagon Reportedly Pursues Weaponized AI Models, Raising Ethical Concerns

Pentagon plans to weaponize advanced AI models, including Anthropic's Claude Mythos Preview, despite supply chain risks. The move signals a major shift in military cyber strategy.

May 21, 20263 min read

Tech Policy & Regulation

AI-Generated Lawsuits Overwhelm Courts as Unrepresented Plaintiffs Turn to Chatbots

Individuals without lawyers are using AI tools like ChatGPT to file lawsuits. The low-quality cases, dubbed 'slopsuits,' are clogging judicial dockets and raising concerns about misuse of technology.

May 26, 20263 min read

AI / Machine Learning

Grok's Government Adoption Lags, Undermining xAI's Growth Story

Grok appears in only 3 of 400+ government AI use cases per Reuters. The low adoption undercuts xAI's growth story tied to a potential massive SpaceX IPO.

May 22, 20262 min read

Tech Policy & Regulation

Anthropic and OpenAI Take Rivalry to Midterm Elections

The AI companies are escalating their feud into political spending for the midterms, signaling a new era of tech influence in elections.

May 20, 20262 min read

Tech Policy & Regulation

Lawyers Face Sanctions for Using AI-Generated Fake Citations in Facebook Defamation Case

A dismissed defamation lawsuit against Facebook users backfires as lawyers may face sanctions for submitting fake AI-generated citations to support their arguments.

May 20, 20262 min read

AI / Machine Learning

Open-source coding model NousCoder-14B matches big rivals in just 4 days

An open-source AI coding model trained in four days matches proprietary systems, highlighting the rapid progress of open-source alternatives in AI-assisted software development.

May 19, 20262 min read

AI / Machine Learning

Cerebras wafer-scale chip runs trillion-parameter model 7x faster than GPU clouds

Cerebras claims its wafer-scale chip runs a trillion-parameter AI model nearly seven times faster than GPU-based clouds, challenging Nvidia's dominance in inference.

May 20, 20263 min read

AI / Machine Learning

Study Finds Politeness in AI Prompts Can Impact Model Accuracy

Research reveals that prompt tone significantly influences LLM accuracy. Polite prompts may boost performance while impolite ones degrade it.

May 27, 20262 min read

AI / Machine Learning

AI therapy startup claims 95% safety score in mental health benchmark

The Path claims its AI model scored 95 on the Vera-MH safety benchmark, far above rivals like ChatGPT. The startup was co-founded by Tony Robbins and Calm veterans.

May 21, 20263 min read

AI / Machine Learning

Google's Gemini 3.5 Flash Reshapes Enterprise AI Cost Equation

Google claims its new Gemini 3.5 Flash model can save enterprises over $1 billion annually by delivering near-frontier performance at triple the speed and half the cost.

May 20, 20262 min read

Gadgets / Consumer Tech

Lumix L10's High Price Opens Door for Budget Camera Alternative

Panasonic's new Lumix L10 impresses but carries a steep price. An overlooked model offers comparable features at half the cost, shifting value in mirrorless cameras.

May 21, 20262 min read

Gadgets / Consumer Tech

Flipper Devices Targets Network Tinkerers With New Linux Gadget

Flipper Devices announces a Linux-powered networking gadget for hackers and hobbyists. The base model will cost under $350.

May 21, 20262 min read

Gadgets / Consumer Tech

Anker Soundcore Launches Two New Premium Earbuds With AI Translation and Dolby Atmos

Anker's Soundcore released two new premium earbuds. Testing shows the cheaper model is the better buy for most users despite the higher-end version having superior sound and features.

May 23, 20264 min read

Startups / Funding

OpenRouter Hits $1.3B Valuation After $113M Series B Round

OpenRouter raised $113M in Series B funding led by CapitalG, more than doubling its valuation to $1.3B. Usage surged 5x in six months, signaling the rise of multi-AI-model platforms.

May 26, 20262 min read

Gadgets / Consumer Tech

Sony's Most Expensive Wireless Headphones Deliver Stunning Audio but Carry Big Trade-Offs

Sony's flagship wireless headphones offer top-tier audio but suffer from comfort issues and a high price. Are they worth the premium?

May 20, 20262 min read

Gadgets / Consumer Tech

Samsung Reportedly Planning Smaller Galaxy S27 Pro with Flagship Specs

Samsung may launch a Galaxy S27 Pro next year, offering Ultra-level features in a compact body. Report hints at a smaller screen but identical core specs.

May 21, 20262 min read