this post was submitted on 22 Jan 2026
522 points (98.9% liked)

Technology

79061 readers
3110 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] rafoix@lemmy.zip 72 points 1 day ago (3 children)

Will they be sued per book?

[–] UnspecificGravity@piefed.social 30 points 1 day ago (2 children)

It's not stealing when corpos do it.

Meta torrented their training data from the pirate bay. Hell, Spotify initially built their catalog from pirated music. They all do this shit. Corporations are built to steal our shit and sell it back to us. This isn't any different from pumping oil out of pubic lands and selling it back to us.

[–] demonsword@lemmy.world 3 points 1 day ago

pumping oil out of pubic lands

this sounds really painful lol

[–] ICastFist@programming.dev 2 points 1 day ago

wish meta had torrented all the viruses, too, would be fun to read the news of "facebook and instagram are offline as meta suffers from cyberattack"

[–] Goodlucksil@lemmy.dbzer0.com 11 points 1 day ago

No becaese the lawyer cohort will destroy them.

[–] jim3692@discuss.online -3 points 1 day ago* (last edited 1 day ago) (2 children)
[–] Filetternavn@lemmy.blahaj.zone 20 points 1 day ago (2 children)

Pirating books is not fair use. Using copyrighted works to train an AI model is not fair use. People seem to grossly misunderstand what fair use is, and how limited its scope is. Don't believe me? Here's legal the precedent

[–] Grimy@lemmy.world 10 points 1 day ago* (last edited 1 day ago) (1 children)

In his June ruling, Judge Alsup agreed with Anthropic's argument, stating the company's use of books by the plaintiffs to train their AI model was acceptable.

"The training use was a fair use," he wrote. "The use of the books at issue to train Claude and its precursors was exceedingly transformative."

However, the judge ruled that Anthropic's use of millions of pirated books to build its models – books that websites such as Library Genesis (LibGen) and Pirate Library Mirror (PiLiMi) copied without getting the authors' consent or giving them compensation – was not.

Pirating isn't but training on copyrighted works is fair use, you just have to buy them.

[–] Railcar8095@lemmy.world 8 points 1 day ago

So they can train with books they have legal acces to? Either bought, rent or licensed?

[–] jim3692@discuss.online 5 points 1 day ago

I was referring to Altman lobbying towards considering AI training as fair use of copyrighted material.

I know that pirating is not fair use. However, AI companies seem to rely on pirated copies to train their slop machines, and they are trying to justify this behavior.

[–] rafoix@lemmy.zip 2 points 1 day ago

A business is not fair use. They’re taking someone’s intellectual property and using it to make their product useful.