Meta defends using pirated material, claims it's legal if you don't seed content

Meta logo
(Image credit: Shutterstock)

Meta claimed in a court filing this week that despite torrenting an 82 TB dataset of pirated, copyrighted material from shadow libraries to train its LLaMA AI models, that employees "took precautions not to "seed" any downloaded files".

The act of Seeding in torrenting terminology refers to sharing a file with other users during, (or commonly after) downloading it. Since torrenting is a peer-to-peer system, every user downloading a file can also upload parts of it to other users.

Authors allege Meta was "knowing participant" in "illegal peer-to-peer piracy network"

Authors of the copyrighted material alleged to have been obtained by Meta without prior licensing agreements have alleged [PDF] that "Meta's decision to bypass lawful acquisition methods and become a knowing participant in an illegal peer-to-peer piracy network".

With the court battle expected to continue, no final decision around the case has been made. Even following a final decision, it's expected that Meta will attempt to appeal the decision if they were to lose, meaning that final judgements could be a long while away.

But, similar cases do exist. OpenAI was sued by novelists in 2023, with the New York Times also suing OpenAI and Microsoft over "millions" copied news articles. As the long list of LLM-related litigation continues, this is likely not going to be the last we hear from Meta's specific case.

Sayem Ahmed
Subscription Editor

Sayem Ahmed is the Subscription Editor at Tom's Hardware. He covers a broad range of deep dives into hardware both new and old, including the CPUs, GPUs, and everything else that uses a semiconductor.