Results for "AI evaluation"
420 results found

OpenAI Launches Initiative to Automate Bug Fixing for Open Source Projects
OpenAI's Daybreak program introduces Patch the Planet, an initiative using AI to help open source projects identify and patch vulnerabilities faster, addressing a critical gap in software supply chain security.

Honor 600 Review: Blazing Display and Epic Battery in a Mid-Range Phone
The Honor 600 delivers a flagship-like display and exceptional battery life at a mid-range price. Its AI features impress but occasionally feel uncanny.

Leaked iOS 27 Renders Show Siri's Biggest Redesign Yet
Bloomberg renders reveal a ChatGPT-like Siri interface in iOS 27, with a pill-shaped chat bubble and new AI features. Apple is expected to unveil the overhaul at WWDC in June.

Lowfat CLI Tool Cuts LLM Token Usage by 91.8%
A new open-source CLI filter called Lowfat claims to reduce LLM token consumption by over 91%, offering developers significant cost savings on AI API calls.

Microsoft Pushes Developer Tools Deeper Into Quantum and Containers at Build 2026
At Build 2026, Microsoft unveiled quantum development tools, container runtime upgrades, and new AI features for developers. The announcements signal a shift toward hybrid cloud and high-performance computing.

BT Partners With Anthropic to Defend Networks Against Cyberattacks
BT becomes the first UK company to join Anthropic's Project Glasswing, using the Claude Mythos Preview AI model to protect its networks from evolving cyber threats.

Rising PC Prices Signal End of Budget Laptops as Memory Costs Surge
Memory shortages have pushed PC prices up by double digits in Europe. Analysts warn that sub-$500 laptops could disappear by 2026 as AI demand diverts chip supply.

UK Scientists Testify: No Causal Proof Smartphones Harm Children's Brains
Neuroscientists told UK MPs that evidence of smartphones rewiring children's brains is mostly correlational. Researchers called for more studies on social media and AI chatbots.

NHS England Deploys Microsoft Copilot to Over Half a Million Staff
NHS England is rolling out Microsoft 365 Copilot to 505,000 staff after a successful pilot. The AI tools aim to improve service delivery and reduce costs.

Seedcamp Hits $1B AUM With $320M in New Funds for Early-Stage Startups
European seed investor Seedcamp closed on $320 million across two funds, reaching $1 billion in assets under management. The firm plans to back seed-stage startups with a focus on AI and deep tech.

Tesla's Self-Driving Tech Reaches European Roads, One Country at a Time
Tesla expands Full Self-Driving to Lithuania after Netherlands. Gradual rollout faces strict European regulations.

Wayve's Self-Driving Tech to Debut in Stellantis Vehicles by 2028
Wayve's autonomous driving technology will appear in US Stellantis vehicles starting in 2028, marking a major step for the British startup's expansion into the American market.

ChatGPT Mac App Vulnerability Patched After Security Flaw Found
A security flaw in the ChatGPT Mac app could have exposed conversations. OpenAI says no data was accessed and the issue is now fixed.

A New Lens on Business: Companies as Networks of Algorithms
Viewing companies as graphs of algorithms reveals new opportunities for automation and efficiency. This perspective challenges traditional management models.

ChatGPT Adds Safety Feature to Alert Trusted Contacts During Crisis
OpenAI lets users nominate a trusted contact ChatGPT can alert if it detects self-harm risk. The opt-in feature adds a safety net for vulnerable users.

Claude Code's Hidden Configuration Options Reveal Deeper Developer Control
A developer has documented undocumented configuration settings for Anthropic's Claude Code tool, revealing advanced customization options beyond official docs.

OpenAI Rolls Out More Factual ChatGPT Model With Better Personalization
OpenAI has updated ChatGPT's default model to GPT-5.5 Instant, claiming improved accuracy and tailored responses. The change takes effect immediately for all users.

Bots Surpass Human Internet Traffic for First Time, Cloudflare CEO Says
Cloudflare CEO Matthew Prince reports that automated bot traffic has overtaken human traffic online for the first time, arriving a year earlier than predictions.

OpenAI Upgrades ChatGPT Memory for Free Users, Closing Gap With Paid Tiers
OpenAI improved ChatGPT memory, especially for free users. The chatbot now better retains conversation context across sessions. This closes a key gap between free and paid tiers.

Why Viral Humanoid Robot Videos Mislead the Public
Viral videos of humanoid robots performing impressive feats often mask a gap between demonstrations and real-world reliability. Experts warn that anthropomorphism can lead to misleading assumptions about robot capabilities.