A coalition of book publishers has filed a class action lawsuit against Meta and its CEO Mark Zuckerberg. The suit accuses the company of copyright infringement. Publishers claim Meta used their copyrighted works without authorization to train its Llama AI model.

According to the complaint, Meta scraped text from millions of books. This data was then used to train the large language model that powers Llama. The publishers argue this violates copyright law and undermines their business.

Allegations of Unauthorized Scraping

The lawsuit centers on how Meta obtained training data for Llama. The plaintiffs allege that Meta systematically copied copyrighted books from various sources. This included both fiction and nonfiction titles from major publishers.

Meta did not seek permission or pay licensing fees, the publishers claim. The company then incorporated this material into Llama's training dataset. The AI could now reproduce or paraphrase copyrighted content without authorization.

This case echoes similar lawsuits against other AI companies. OpenAI faces multiple copyright challenges over ChatGPT's training data. The tech industry is under growing scrutiny for how it builds generative AI systems.

Why This Matters

This lawsuit could set a legal precedent for AI training practices. If the court rules against Meta, it may force companies to license data more transparently. Authors and publishers could gain stronger control over how their works are used by AI developers.

The case also affects consumers who use Llama-powered products. If Meta loses, it might need to retrain its model or pay damages. This could delay new AI features or increase costs for users.

The publishing industry faces existential questions. AI systems that generate text threaten to replace writers and reduce book sales. This lawsuit is a direct challenge to the unlicensed use of creative works in AI development.

Meta has not yet filed a formal response. The company previously argued that training AI on public data falls under fair use. Courts have not yet resolved this debate. This case could provide one of the first major rulings on the issue.