this post was submitted on 08 Apr 2026
40 points (100.0% liked)

Free and Open Source Software

22211 readers
36 users here now

If it's free and open source and it's also software, it can be discussed here. Subcommunity of Technology.


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 3 years ago
MODERATORS
 

TL;DR: The advent of AI based, LLM coding applications like Anthropic’s Claude and ChatGPT have prompted maintainers to experiment with integrating LLM contributions into open source codebases.

This is a fast path to open source irrelevancy, since the US copyright office has deemed LLM outputs to be uncopyrightable. This means that as more uncopyrightable LLM outputs are integrated into nominally open source codebases, value leaks out of the project, since the open source licences are not operative on public domain code.

That means that the public domain, AI generated code can be reused without attribution, and in the case of copyleft licences - can even be used in closed source projects.

you are viewing a single comment's thread
view the rest of the comments
[–] chicken@lemmy.dbzer0.com 1 points 4 hours ago (1 children)

If that is the case, is chardet 7.0.0 a derivative work of chardet, or is it a public domain LLM work? The whole LLM project is fraught with questions like these

I think the reimplementation stuff is a separate question because the argument for it working looks a lot stronger, and because it doesn't have anything to do with the source material having LLM output in it. Also if this method holds as legally valid, it's going to be easier to just do that than justify copying code directly (which would probably have to only be copies of the explicitly generated parts of the code, requiring figuring out how to replace the rest), which means it won't matter whether some portion of it was generated. I don't see much reason to think that a purist approach to accepting LLM code will offer any meaningful protection.

I’m mostly just playing along with your thought experiment. As I said, we know that projects are already accepting LLM code into projects that are nominally copyleft.

So what though? If they aren't entirely generated, you can't make a full fork, and why would a partial fork be useful? If it isn't disclosed what parts are AI, you can't even do that without risking breaking the law.

[–] yoasif@fedia.io 1 points 3 hours ago

I think the reimplementation stuff is a separate question because the argument for it working looks a lot stronger, and because it doesn't have anything to do with the source material having LLM output in it. Also if this method holds as legally valid, it's going to be easier to just do that than justify copying code directly (which would probably have to only be copies of the explicitly generated parts of the code, requiring figuring out how to replace the rest), which means it won't matter whether some portion of it was generated.

Is it a separate question, though?

Both works are copyrighted, one is just copyrighted as "all rights reserved" (our leaked commercial code) and the rest is licensed as LGPL. We're putting both pieces of code inside the LLM and then asking the LLM to make a new version.

What makes the action of leaking different from the act of putting it on the web? Rights are reserved in either case.

If they aren't entirely generated, you can't make a full fork, and why would a partial fork be useful?

Well, people are contributing to copyleft codebases expecting that when people build on their work, that work (the derivative works) are also licensed in the same way. You don't need to fork for the value to be lost. People expected virality to be part of their contribution, and clearly the new derivative works are partially non-copyleft.

Beyond that, as more of the codebase is LLM produced, the less of it is protected by the copyleft license, until we have a ship of Theseus situation where the codebase is available, but no longer copyleft. That is clearly not what was intended by e.g. the GPL. Just look at the Stallman quote in post.