Results for "AI evaluation"

420 results found

Bias in Text-to-Image Models Raises Urgent Questions for AI Ethics

A new analysis reveals persistent racial and gender biases in popular text-to-image AI models. The findings underscore the need for more rigorous fairness testing before deployment.

Jun 21, 20262 min read

Startups / Funding

Listen Labs Raises $69M to Disrupt Market Research With AI Interviews

AI startup Listen Labs raised $69M to scale its AI customer interview platform, replacing surveys with video conversations and detecting fraud in market research.

Jun 21, 20263 min read

AI / Machine Learning

Realism of AI Ultrasound Images Prompts Medical Misuse Fears

A prominent critic concedes Midjourney generates ultrasound scans indistinguishable from clinical images. The admission highlights risks of medical deepfakes and gaps in regulation.

Jun 22, 20263 min read

AI / Machine Learning

The Atlantic Publishes Searchable Database of Music Used to Train AI Models

The Atlantic has created a searchable database of music datasets used to train AI models, revealing massive collections of songs. The move increases transparency in AI training data and raises questions about copyright and consent.

Jun 22, 20262 min read

AI / Machine Learning

Microsoft CSO Warns Human Understanding of AI Is Falling Behind

Microsoft's CSO warns humans face a narrowing window to understand AI as capabilities outpace comprehension. Growing oversight and transparency concerns mount.

Jun 22, 20262 min read

AI / Machine Learning

Human Writers Now Fear Sounding Like AI: Study Reveals Avoidance Tactics

A new study shows writers are altering their style to avoid sounding like AI. Many would stop supporting creators who use undisclosed AI.

Jun 22, 20263 min read

AI / Machine Learning

London Faces AI Skills Gap as Half of Businesses Report Unprepared Workforce

A new survey reveals half of London companies say their workforce lacks skills to meet AI demands. Businesses plan to boost training investments to close the gap.

Jun 22, 20262 min read

AI / Machine Learning

Claude Code's 'Extended Thinking' Feature Masks Deeper AI Reasoning Limits

Anthropic's new 'extended thinking' mode for Claude Code is drawing criticism for producing summaries rather than genuine step-by-step reasoning, raising questions about the transparency and reliability of AI-assisted coding tools.

Jun 22, 20262 min read

AI / Machine Learning

Claude Code Ban Highlights Risks of AI Tool Dependency

A developer's sudden ban from Anthropic's Claude Code raises concerns about opaque AI platform policies and developer reliance on single tools.

Jun 23, 20262 min read

AI / Machine Learning

AI Law Firm Wins English Court Case in Legal First

Garfield AI, an artificial intelligence law firm, won an English court case for an HR consultant over an unpaid debt. The victory marks an apparent first for AI in legal proceedings.

Jun 23, 20263 min read

AI / Machine Learning

New AI Method Lift4D Advances Real-World 4D Reconstruction From Single Views

Lift4D harmonizes single-view 3D estimates to produce temporally consistent 4D reconstructions. The method improves accuracy for dynamic scenes, benefiting AR, robotics and content creation.

Jun 23, 20263 min read

Startups / Funding

AI Agents and Short-Form Video Reshape Hiring as Fika Jobs Raises $4M

Stockholm-based Fika Jobs raises $4M for a video-first hiring platform where AI agents interview candidates and short-form profiles replace resumes.

Jun 23, 20262 min read

Tech Policy & Regulation

AI-Generated Apartment Photos Fuel Rental Market Deception

AI-powered virtual staging is enabling landlords to show fake apartment photos, wasting renters' time and eroding trust. Calls for regulation grow as deceptive listings become common in competitive housing markets.

Jun 23, 20263 min read

CyberSecurity

AI Agents Empower Amateur Hackers to Breach 14 Companies in Landmark Attack

An inexperienced hacker used Claude and OpenAI agents to compromise 14 companies, highlighting how AI is lowering the barrier for sophisticated cyberattacks.

Jun 24, 20263 min read

AI / Machine Learning

OpenAI and Broadcom Unveil Jalapeño Chip for AI Workloads

OpenAI and Broadcom have launched their first jointly developed AI chip, named Jalapeño, designed specifically for large language models. The chip marks a strategic shift toward custom silicon amid soaring demand for AI compute.

Jun 24, 20262 min read

AI / Machine Learning

Companies Tighten AI Budgets as Workers Waste Tokens on Trivial Tasks

Employers impose caps on generative AI after employees burn budgets on trivial tasks like email summaries.

Jun 24, 20262 min read

AI / Machine Learning

AI Unlocks Stoic Philosophy From Vesuvius-Buried Papyrus Scroll

AI has enabled researchers to read a charred scroll from the Vesuvius eruption without unrolling it, revealing stoic philosophy on ethics and art from the second century BC.

Jun 24, 20263 min read

Tech Policy & Regulation

Lawyers Face Sanctions for Using AI-Generated Fake Citations in Facebook Defamation Case

A dismissed defamation lawsuit against Facebook users backfires as lawyers may face sanctions for submitting fake AI-generated citations to support their arguments.

May 20, 20262 min read

Big Tech

Google AI Search Rewrites SEO Rules for Brands

Google now places AI-generated answers above traditional search links, upending decades of SEO strategy. Brands face new challenges with zero visibility into how AI describes them.

May 27, 20263 min read

Big Tech

Google AI search now pulls expert advice from Reddit

Google's AI-powered search results will now include Reddit posts as expert sources. The change aims to improve answer quality but raises questions about content reliability.

May 29, 20262 min read