Don't use LLM generated code in your projects yet!

cwebber@social.coop

There are only two strategies which are acceptable: either AI model output is completely illegal because of copyright stuff (this is unlikely to happen because there is now too much money behind it), or AI model output is fully in the public domain, which has its own problems but at least is an even playing field.

There won't be a middle ground that is safe. Because they want something that looks like a "middle ground", but really, all it does is lock in the big players' control over information, forever.

mhoye@cosocial.ca

@cwebber "The law always bends to capital, and when it doesn't capital buys new laws" is how I've heard that fundamentally expressed. Nobody should be looking at the copyright term extension acts and seeing a tool that benefits the people or the common good.

Copyright Term Extension Act - Wikipedia

(en.wikipedia.org)

rusty__shackleford@mastodon.social

@cwebber
Biggest enshittification to come: LLM companies trying to claim rights to the linux kernel and every opensource project their software has touched.

From a copyright perspective, everyone is absolutely insane for doing this.

promovicz@chaos.social

@cwebber I think we should resist socially and politically, for as long as there is a point, and until we figure out "benign LLMs". I'm pretty sure that's possible.

wyatt_h_knott@vermont.masto.host

@cwebber so, we will get a middle ground answer. Because what they actually want is to lock in the big player's control over information, forever. Just listen to Altman and his "we see intelligence as a utility that you will pay us for"

This is why Meta and Google are building fiber under the oceans. This is why Amazon wants to be all things to everyone. They want you locked in, they DO NOT LIKE the distributed power that the internet currently gives to indivuals.

woozle@toot.cat

@cwebber This agrees with my intuition on the matter -- the problem is not that content is being "stolen", it's that free AI "labor" "steals" the revenue that creators need in order to survive. For me, that points towards UBI, not reinforcing the highly unjust systems that trickle media revenue back to (a select few) creators.

(...speaking as a lifelong creator who almost made $5 playing live one time.)

thomasjwebb@mastodon.social

@cwebber Now I feel dumb. This is basically what my concern has been - that a situation would arise where the regulatory or legal situation turns it into an oligopoly and destroy smaller software companies. Yet I didn’t consider use of the output as a harm to oss projects that use it (unless the code quality is bad) so I’ve been using it in a few oss repos of mine on the grounds my day job leaves me with insufficient time to do it all myself. And thinking it’ll get more expensive.

janl@narrativ.es

@cwebber I’d settle for: if the models include licensed sources and use those without a license (proprietary or open source) then the model needs to be published openly and usage needs to be free.

rootwyrm@weird.autos

@cwebber the US is not a country of laws, period. What USPTO says doesn't matter.

The EU however, just 3 days ago adopted text. LLM scammers MUST comply with licenses including payment to train on copyrighted work, regardless of location. And purely LLM generated slop *cannot be copyrighted*. There MUST be significant human contribution.

So purely LLM generated slop to try and license wash something is pretty much definitively unlawful now.

Protecting copyrighted work and the EU’s creative sector in the age of AI | News | European Parliament

To protect the creative sector in the EU, the use of copyrighted work by artificial intelligence requires transparency and fair remuneration, Parliament says.

(www.europarl.europa.eu)

rootwyrm@weird.autos

@cwebber and remember, these are the dipshits pissing off the old companies that have infinite dollars by stealing *their* stuff. The people who spent millions turning copyright into a way to maintain monopolies and permanent rent-seeking.
The people who have used copyright as a weapon for many decades are decidedly not fans of 'companies' stealing the things they own to generate and sell things based on it.
And the LLM grifters absolutely do not have the money to pay them off.

jrconlin@mindof.jrconlin.com

@cwebber

I fully expect well funded companies to repeatedly challenge "AI cannot be copywritten because it wasn't human generated", and I expect it will be continually chipped away. That's going to make things stupidly complicated for a lot of non-technical reasons for a long, long time.

The advice I've given is to absolutely, and definitively denote exactly what code was AI generated keep detailed records of the history around it (including the source and date), because I guarantee that will become the crux of any future decision.

Until there's case law established, AI code is a liability.

ohir@social.vivaldi.net

@cwebber
It used to not be copyrightable. But considering nazi track the US is sliping on, the new copyright act prepared by Bezos and Thiel over a some blody drink will say:

1) anything produced by humanity belong to whomever the tyrant wants, as we have it all in the LLM.
2) any royalties are going to us, see above.

kennethbousquet@mastodon.social

@cwebber In my opinion, the moment that personal information gets out in the public domain without proper consent , this becomes an actionable matter.
AI generated code must be open-source and doing this way, helps everybody to freely create.
The moment the $$$ gets in the picture, you are killing the true creativity potential of the people.

raggi@don.rag.pub

@cwebber did you read the copyright office opinion doc? What’s your take on what it says?

johannab@cosocial.ca

@promovicz @cwebber

There is validity, with all kinds of different framing, to resisting the careless use of a complex and poorly understood technology as the answer to Life, the Universe, and Everything.

I think the thesis at hand though, is that trying to use outdated and inadequate, poorly fit-for-context copyright law as the tool (a technology, heh) to do that is not likely to be productive. It will consume our resources without meeting our purposes.

johannab@cosocial.ca

@promovicz @cwebber

Part of the problem still being … what, exactly, IS our purpose in this melee?

martyfouts@mastodon.online

@cwebber The UK has a third option: the person operating the AI is the author and the output is copyrighted. Would not surprise me if the industry lobbies more jurisdictions into similar legislation.

cwebber@social.coop

@MartyFouts Link to more info on UK case law?

feld@friedcheese.us

@cwebber I'd be more concerned if someone can make a tool that can prove code came from a specific model but I don't think that's gonna happen either

paul@notnull.space

@cwebber I can see the future: legal concerns over LLM written code results in people rewriting code by hand to circumvent potential LLM code licence violations.

CIRCLE WITH A DOT

Don't use LLM generated code in your projects yet!

Copyright Term Extension Act - Wikipedia

Protecting copyrighted work and the EU’s creative sector in the age of AI | News | European Parliament

Protecting copyrighted work and the EU’s creative sector in the age of AI | News | European Parliament