DeFi Daily News
Sunday, June 1, 2025
Advertisement
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos
No Result
View All Result
DeFi Daily News
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos
No Result
View All Result
DeFi Daily News
No Result
View All Result
Home DeFi Web 3

rewrite this title Zuckerberg Knowingly Used Pirated Data to Train Meta AI, Authors Allege – Decrypt

Jose Antonio Lanz by Jose Antonio Lanz
January 10, 2025
in Web 3
0 0
0
rewrite this title Zuckerberg Knowingly Used Pirated Data to Train Meta AI, Authors Allege – Decrypt
0
SHARES
0
VIEWS
Share on FacebookShare on TwitterShare on Telegram
Listen to this article


rewrite this content using a minimum of 1000 words and keep HTML tags

Mark Zuckerberg approved using pirated books to train Meta AI, even after his own team warned the material was illegally obtained, a group of authors allege in a recent court filing.

The allegations come from a copyright infringement lawsuit filed by a group of authors including the comedian Sarah Silverman, Christopher Golden, and Richard Kadrey in a California federal court in July 2023. The group claimed Meta misused their books to train its Llama LLM, and they’re asking for damages and an injunction to stop Meta from using their works. The judge in the case dismissed most of the author’s claims in November of that same year, but these recent allegations may breathe new life into the legal dispute.

“Meta’s CEO, Mark Zuckerberg, approved Meta’s use of the LibGen dataset notwithstanding concerns within Meta’s AI executive team (and others at Meta) that LibGen is ‘a dataset we know to be pirated,'” lawyers for the plaintiffs said in a Wednesday filing. Despite these red flags, the lawsuit alleges that, “after escalation,” Zuckerberg gave the green light for Meta’s AI team to proceed with using the controversial dataset.

Representatives for Meta did not immediately respond to Decrypt’s request for comment.

LibGen, short for Library Genesis, is an online platform that provides free access to books, academic papers, articles, and other written publications without properly abiding by copyright laws. It operates as a “shadow library,” offering these materials without authorization from publishers or copyright holders. It currently hosts over 33 million books and over 85 million articles.

The lawsuit alleges Meta tried to keep this under wraps until the last possible moment. Just two hours before the fact discovery deadline on December 13, 2024, the company dumped what plaintiffs describe as “some of the most incriminating internal documents it has produced to date.”

Meta’s own engineers seemed uncomfortable with the plan, according to statements in court filings. The group of authors allege internal messages show Meta engineers hesitated to download the pirated material, with one noting that “torrenting from a [Meta-owned] corporate laptop doesn’t feel right (smile emoji).” Nevertheless, they proceeded to not only download the books but also systematically strip out copyright information to prepare them for AI training, the lawsuit claims.

The latest filings in the lawsuit paint a picture of a company fully aware of the risks: One internal memo warned that “media coverage suggesting we have used a dataset we know to be pirated, such as LibGen, may undermine our negotiating position with regulators.” Yet Meta went ahead anyway, both downloading and distributing (or “seeding”) the pirated content through torrenting networks by January 2024, according to the lawsuit.

When questioned about these activities in a deposition, Zuckerberg appeared to distance himself from the decision, testifying that such piracy would raise “lots of red flags” and “seems like a bad thing.”

The court documents also suggest that Meta’s approach to handling copyrighted information paid more attention to model training than copyright rules. According to the filing, one engineer “filtered […] copyright lines and other data out of LibGen to prepare a CMI-stripped version of it to train Llama.” This systematic removal of copyright information could strengthen the authors’ claims that Meta knowingly tried to hide its use of pirated materials.

The revelations come at a crucial time for Meta’s AI ambitions. The company has been pushing hard to compete with OpenAI and Google in the AI space, with Llama 3.2 being the most popular open source LLM, and Meta AI being a solid free competitor to ChatGPT with similar features.

Most of these AI companies are facing legal battles due to their questionable practices when it comes to training their large language models. Meta was already sued by another group of authors for copyright infringements, OpenAI is currently facing different lawsuits for training its LLMs on copyrighted material, and Anthropic is also facing different accusations from authors and songwriters.

But in general the tech entrepreneurs and creators have been up in arms ever since generative AI exploded in popularity. There are currently dozens of different lawsuits against AI companies for willingly using copyrighted material to train their models. But as with most things on the bleeding edge, we’ll have to wait and see what the courts have to say about it all.

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

and include conclusion section that’s entertaining to read. do not include the title. Add a hyperlink to this website http://defi-daily.com and label it “DeFi Daily News” for more trending news articles like this



Source link

Tags: allegeAuthorsdataDecryptKnowinglyMetaPiratedrewritetitleTrainzuckerberg
ShareTweetShare
Previous Post

rewrite this title Kenya Drafts Policy to Legalize Cryptocurrencies, Expand Digital Economy

Next Post

rewrite this title *HOT* Under Armour Backpacks as low as $11.71 shipped!

Next Post
rewrite this title *HOT* Under Armour Backpacks as low as .71 shipped!

rewrite this title *HOT* Under Armour Backpacks as low as $11.71 shipped!

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
  • Trending
  • Comments
  • Latest
rewrite this title Haliey Welch Breaks Silence on Hawk Tuah Coin Collapse

rewrite this title Haliey Welch Breaks Silence on Hawk Tuah Coin Collapse

May 6, 2025
Bitcoin Miners Selling Bitcoin to Stay Solvent Amid Volatility in Price – Decrypt

Bitcoin Miners Selling Bitcoin to Stay Solvent Amid Volatility in Price – Decrypt

August 13, 2024
rewrite this title What Are Energy-Efficient Windows? Cost, Certification and How to Choose – NerdWallet

rewrite this title What Are Energy-Efficient Windows? Cost, Certification and How to Choose – NerdWallet

April 1, 2025
rewrite this title with good SEO Spar Supermarket In Switzerland Starts Accepting Bitcoin

rewrite this title with good SEO Spar Supermarket In Switzerland Starts Accepting Bitcoin

April 18, 2025
Tech companies are interested in nuclear power, but some utilities are blocking their progress.

Tech companies are interested in nuclear power, but some utilities are blocking their progress.

August 10, 2024
Why iPhones May Get More Expensive Amid Trump Tariffs

Why iPhones May Get More Expensive Amid Trump Tariffs

April 11, 2025
rewrite this title and make it good for SEORussia says several military aircraft caught fire after alleged large-scale Ukrainian drone attack: Report

rewrite this title and make it good for SEORussia says several military aircraft caught fire after alleged large-scale Ukrainian drone attack: Report

June 1, 2025
rewrite this title with good SEO M In Bitcoin Floods In For Silk Road Founder Ross Ulbricht As Support Grows

rewrite this title with good SEO $31M In Bitcoin Floods In For Silk Road Founder Ross Ulbricht As Support Grows

June 1, 2025
Dave’s Advice On How To Deal With Debt Collectors

Dave’s Advice On How To Deal With Debt Collectors

June 1, 2025
rewrite this title Discover’s 5% Bonus Categories for Q3 2025: Gas, Transit, Utilities – NerdWallet

rewrite this title Discover’s 5% Bonus Categories for Q3 2025: Gas, Transit, Utilities – NerdWallet

June 1, 2025
rewrite this title Indiana Pacers Aren’t Sexy, but They Might Be the Thunder’s Worst Nightmare | Deadspin.com

rewrite this title Indiana Pacers Aren’t Sexy, but They Might Be the Thunder’s Worst Nightmare | Deadspin.com

June 1, 2025
My Husband Wants Alcohol In The Budget, I Do Not

My Husband Wants Alcohol In The Budget, I Do Not

June 1, 2025
DeFi Daily

Stay updated with DeFi Daily, your trusted source for the latest news, insights, and analysis in finance and cryptocurrency. Explore breaking news, expert analysis, market data, and educational resources to navigate the world of decentralized finance.

  • About Us
  • Blogs
  • DeFi-IRA | Learn More.
  • Advertise with Us
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2024 Defi Daily.
Defi Daily is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos

Copyright © 2024 Defi Daily.
Defi Daily is not responsible for the content of external sites.