Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license?

If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license?

Scheduled Pinned Locked Moved Uncategorized
75 Posts 37 Posters 148 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • lcamtuf@infosec.exchangeL lcamtuf@infosec.exchange

    If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license? In philosophy, this problem is known as the Slop of Theseus

    swampputty@infosec.exchangeS This user is from outside of this forum
    swampputty@infosec.exchangeS This user is from outside of this forum
    swampputty@infosec.exchange
    wrote last edited by
    #12

    @lcamtuf someone or sonething else has done the work. not you. so whoever creates the work, owns the work.

    1 Reply Last reply
    0
    • snoopj@hachyderm.ioS snoopj@hachyderm.io

      @bgalehouse @kevinr @lcamtuf it's a tempting argument to attempt but it kinda falls apart when "the entire library was in the training corpus anyway" is a given.

      The fact that it is a terrible argument is of course not really going to stop anyone from making it.

      arnebab@rollenspiel.socialA This user is from outside of this forum
      arnebab@rollenspiel.socialA This user is from outside of this forum
      arnebab@rollenspiel.social
      wrote last edited by
      #13

      @SnoopJ There’s the concept of clean room reimplementations (see the link by @bgalehouse😞 one group writes the spec -- possibly with access to the source.

      The second group has never seen the source and only gets the spec. This second group then writes the program according to the spec.

      You could simulate this if you had an AI that was provably not trained on the original source.

      ("provably not trained" most likely means re-training from scratch)

      @bgalehouse @kevinr @lcamtuf

      kevinr@masto.free-dissociation.comK 1 Reply Last reply
      0
      • tbortels@infosec.exchangeT tbortels@infosec.exchange

        @lcamtuf @bgalehouse @kevinr

        But here's an interesting question:

        If you do not execute the code - did you accept the license? Does simply reading it sufficiently to be able to write a spec bind you to that license? That seems a bit too much.

        arnebab@rollenspiel.socialA This user is from outside of this forum
        arnebab@rollenspiel.socialA This user is from outside of this forum
        arnebab@rollenspiel.social
        wrote last edited by
        #14

        @tbortels if you do not accept the license, you do not have any right to use the code. It’s "all rights reserved" then. @lcamtuf @bgalehouse @kevinr

        kevinr@masto.free-dissociation.comK tbortels@infosec.exchangeT 2 Replies Last reply
        0
        • tbortels@infosec.exchangeT tbortels@infosec.exchange

          @lcamtuf @bgalehouse @kevinr

          But here's an interesting question:

          If you do not execute the code - did you accept the license? Does simply reading it sufficiently to be able to write a spec bind you to that license? That seems a bit too much.

          bredroll@mas.toB This user is from outside of this forum
          bredroll@mas.toB This user is from outside of this forum
          bredroll@mas.to
          wrote last edited by
          #15

          @tbortels @lcamtuf @bgalehouse @kevinr if a thing has a licence then that covers its use, so using it as a wallpaper image or software component or training data could be argued.

          1 Reply Last reply
          0
          • arnebab@rollenspiel.socialA arnebab@rollenspiel.social

            @tbortels if you do not accept the license, you do not have any right to use the code. It’s "all rights reserved" then. @lcamtuf @bgalehouse @kevinr

            kevinr@masto.free-dissociation.comK This user is from outside of this forum
            kevinr@masto.free-dissociation.comK This user is from outside of this forum
            kevinr@masto.free-dissociation.com
            wrote last edited by
            #16

            @ArneBab @tbortels @lcamtuf @bgalehouse

            Yeah the license applies whether you accept it or not. And whether your spec counts as a derivative work or not will depend greatly on the details of your spec

            tbortels@infosec.exchangeT 1 Reply Last reply
            0
            • hopeless@mas.toH hopeless@mas.to

              @kevinr @lcamtuf In retrospect... "actual answer", "of course", "prima facie" are all red flags you're reading a bunch of nonsense blather.

              kevinr@masto.free-dissociation.comK This user is from outside of this forum
              kevinr@masto.free-dissociation.comK This user is from outside of this forum
              kevinr@masto.free-dissociation.com
              wrote last edited by
              #17

              @hopeless @lcamtuf no, you're just reading an educated asshole who happens to be right

              hopeless@mas.toH 1 Reply Last reply
              0
              • arnebab@rollenspiel.socialA arnebab@rollenspiel.social

                @SnoopJ There’s the concept of clean room reimplementations (see the link by @bgalehouse😞 one group writes the spec -- possibly with access to the source.

                The second group has never seen the source and only gets the spec. This second group then writes the program according to the spec.

                You could simulate this if you had an AI that was provably not trained on the original source.

                ("provably not trained" most likely means re-training from scratch)

                @bgalehouse @kevinr @lcamtuf

                kevinr@masto.free-dissociation.comK This user is from outside of this forum
                kevinr@masto.free-dissociation.comK This user is from outside of this forum
                kevinr@masto.free-dissociation.com
                wrote last edited by
                #18

                @ArneBab @SnoopJ @bgalehouse @lcamtuf

                And the spec would need to carefully elide certain details which would get it classed as a derivative work itself—much harder for an LLM to do than a team of humans

                arnebab@rollenspiel.socialA 1 Reply Last reply
                0
                • kevinr@masto.free-dissociation.comK kevinr@masto.free-dissociation.com

                  @lcamtuf actual answer: of course you do, it’s prima facie a derivative work, same as if you had rewritten the program by hand.

                  groxx@hachyderm.ioG This user is from outside of this forum
                  groxx@hachyderm.ioG This user is from outside of this forum
                  groxx@hachyderm.io
                  wrote last edited by
                  #19

                  @kevinr @lcamtuf yea, seems like at best it's treated like it was done by a human... and then it's just blatant plagiarism.

                  1 Reply Last reply
                  0
                  • kevinr@masto.free-dissociation.comK kevinr@masto.free-dissociation.com

                    @lcamtuf actual answer: of course you do, it’s prima facie a derivative work, same as if you had rewritten the program by hand.

                    kevinr@masto.free-dissociation.comK This user is from outside of this forum
                    kevinr@masto.free-dissociation.comK This user is from outside of this forum
                    kevinr@masto.free-dissociation.com
                    wrote last edited by
                    #20

                    @lcamtuf There are a number of tools online which purport to strip the copyright from images by running them through an image model, and they're just as obviously bullshit

                    1 Reply Last reply
                    0
                    • kevinr@masto.free-dissociation.comK kevinr@masto.free-dissociation.com

                      @ArneBab @SnoopJ @bgalehouse @lcamtuf

                      And the spec would need to carefully elide certain details which would get it classed as a derivative work itself—much harder for an LLM to do than a team of humans

                      arnebab@rollenspiel.socialA This user is from outside of this forum
                      arnebab@rollenspiel.socialA This user is from outside of this forum
                      arnebab@rollenspiel.social
                      wrote last edited by
                      #21

                      @kevinr and proving that the AI was not trained on the original source will be pretty hard, because FLOSS programs with compatible licenses can legally copy code from one project into the other.

                      You’ll likely have to exclude all code from the project and all code that’s too similar from the training data. And then train an AI from scratch. Which would be extremely expensive.

                      @SnoopJ @bgalehouse @lcamtuf

                      arnebab@rollenspiel.socialA thebluewizard@masto.hackers.townT 2 Replies Last reply
                      0
                      • arnebab@rollenspiel.socialA arnebab@rollenspiel.social

                        @kevinr and proving that the AI was not trained on the original source will be pretty hard, because FLOSS programs with compatible licenses can legally copy code from one project into the other.

                        You’ll likely have to exclude all code from the project and all code that’s too similar from the training data. And then train an AI from scratch. Which would be extremely expensive.

                        @SnoopJ @bgalehouse @lcamtuf

                        arnebab@rollenspiel.socialA This user is from outside of this forum
                        arnebab@rollenspiel.socialA This user is from outside of this forum
                        arnebab@rollenspiel.social
                        wrote last edited by
                        #22

                        @kevinr but I expect that someone will come in and say "my prompt includes 'forget all code from <project>', so the AI does not know it".

                        … OK, I have to admit that I lost trust into the sanity of a part of humanity …

                        @SnoopJ @bgalehouse @lcamtuf

                        1 Reply Last reply
                        0
                        • kevinr@masto.free-dissociation.comK kevinr@masto.free-dissociation.com

                          @hopeless @lcamtuf no, you're just reading an educated asshole who happens to be right

                          hopeless@mas.toH This user is from outside of this forum
                          hopeless@mas.toH This user is from outside of this forum
                          hopeless@mas.to
                          wrote last edited by
                          #23

                          @kevinr @lcamtuf Well, from a reader's perspective, it reads like you are still trying to convince yourself you're right.

                          This is usually a bad sign if you're trying to convince anyone else you're right.

                          The problem is in what sense is it "derivative" if the original content is neither known or referenced? We are talking about copyright alone here and your choice of phrase "derivative work".

                          1 Reply Last reply
                          0
                          • lcamtuf@infosec.exchangeL lcamtuf@infosec.exchange

                            If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license? In philosophy, this problem is known as the Slop of Theseus

                            noortjevee@mstdn.socialN This user is from outside of this forum
                            noortjevee@mstdn.socialN This user is from outside of this forum
                            noortjevee@mstdn.social
                            wrote last edited by
                            #24

                            @lcamtuf shakes my fist at theseus

                            1 Reply Last reply
                            0
                            • tbortels@infosec.exchangeT tbortels@infosec.exchange

                              @bgalehouse @lcamtuf @kevinr

                              Assuming you used the original source code to derive the detailed spec, then yes, that too is a derivative work.

                              The "viral" nature of that sort of license has bothered me for a long time. It's always been simultaneously overly far reaching and impossible to realistically enforce.

                              gisgeek@floss.socialG This user is from outside of this forum
                              gisgeek@floss.socialG This user is from outside of this forum
                              gisgeek@floss.social
                              wrote last edited by
                              #25

                              @tbortels @bgalehouse @lcamtuf @kevinr Well, yes but no. The point about spec is the level of detailing taken from the original work. If you write an original novel about a wild, big monkey found in a jungle, brought to New York, who escapes and so on, the King Kong author cannot claim any rights to that, sorry. If it were different, many narratives and movies would not exist today. That is inspiration, not derivation. Of course it is fair declaring inspiration, but call it with the right name.

                              gisgeek@floss.socialG tbortels@infosec.exchangeT 2 Replies Last reply
                              0
                              • lcamtuf@infosec.exchangeL lcamtuf@infosec.exchange

                                If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license? In philosophy, this problem is known as the Slop of Theseus

                                tkissing@mastodon.socialT This user is from outside of this forum
                                tkissing@mastodon.socialT This user is from outside of this forum
                                tkissing@mastodon.social
                                wrote last edited by
                                #26

                                @lcamtuf OpenAI already gets all upset, if someone uses their AI to train a different AI. If the whole technocrap brotherhood wasn't built around hypocrisy, the slop factory owners should be on the side of "no, you can't do this".

                                1 Reply Last reply
                                0
                                • victimofsimony@infosec.exchangeV victimofsimony@infosec.exchange

                                  @lcamtuf

                                  This case law exists in the U.S.

                                  There are two cases (or arguably three if you include Sega v. SNK).

                                  Here's what you really care about:
                                  🅰️ Any author of code is judged based on their own use of the existing code, so reverse-engineering of code used to be based on an engineer writing down, line by line, in plain English, what to do. Then a second person sat down and made up code, line-by-line to accomplish that task. Things have changed but the idea that you can't literally harvest existing code is still a thing.
                                  🅱️ You own the #AI made code but can't copyright it... so you can't profit from it in the same way.

                                  fantasmitaasex@todon.euF This user is from outside of this forum
                                  fantasmitaasex@todon.euF This user is from outside of this forum
                                  fantasmitaasex@todon.eu
                                  wrote last edited by
                                  #27

                                  @VictimOfSimony @lcamtuf
                                  C The fucking source code was used to train the LLM

                                  1 Reply Last reply
                                  0
                                  • gisgeek@floss.socialG gisgeek@floss.social

                                    @tbortels @bgalehouse @lcamtuf @kevinr Well, yes but no. The point about spec is the level of detailing taken from the original work. If you write an original novel about a wild, big monkey found in a jungle, brought to New York, who escapes and so on, the King Kong author cannot claim any rights to that, sorry. If it were different, many narratives and movies would not exist today. That is inspiration, not derivation. Of course it is fair declaring inspiration, but call it with the right name.

                                    gisgeek@floss.socialG This user is from outside of this forum
                                    gisgeek@floss.socialG This user is from outside of this forum
                                    gisgeek@floss.social
                                    wrote last edited by
                                    #28

                                    @tbortels @bgalehouse @lcamtuf @kevinr Just to compare, all free libre office suites today are currently inspired to MS office, often with a very similar UX, but they are clearly separate work with their own license and MS cannot claim any rights about them. Do not confuse software with industrial artifacts and patents. Even out of US software is not patentable...

                                    1 Reply Last reply
                                    0
                                    • chuckmcmanis@chaos.socialC chuckmcmanis@chaos.social

                                      @lcamtuf The current declination by the Supreme Court to overturn or review this ruling: https://www.copyright.gov/rulings-filings/review-board/docs/a-recent-entrance-to-paradise.pdf Which holds things created by AI are neither "derivative works" or "original works" and are not eligible for Copyright protection so no, you don't need to abide by the previous license. No one does. And if someone reverse engineers your code DMCA doesn't apply either (it isn't copyrighted).

                                      astolk@c.imA This user is from outside of this forum
                                      astolk@c.imA This user is from outside of this forum
                                      astolk@c.im
                                      wrote last edited by
                                      #29

                                      @ChuckMcManis @lcamtuf say if Disney were to produce an entire movie with AI, could you share copies freely with your pals?

                                      revk@toot.me.ukR 1 Reply Last reply
                                      0
                                      • tbortels@infosec.exchangeT tbortels@infosec.exchange

                                        @lcamtuf @bgalehouse @kevinr

                                        But here's an interesting question:

                                        If you do not execute the code - did you accept the license? Does simply reading it sufficiently to be able to write a spec bind you to that license? That seems a bit too much.

                                        ahltorp@mastodon.nuA This user is from outside of this forum
                                        ahltorp@mastodon.nuA This user is from outside of this forum
                                        ahltorp@mastodon.nu
                                        wrote last edited by
                                        #30

                                        @tbortels @lcamtuf @bgalehouse @kevinr Copyright bound licenses work by exempting you from the blanket and default prohibition on copying.

                                        So if you copy a work that has copyright restrictions according to copyright law, using the license is your only way of not infringing the law. It doesn’t matter if you ”accept” it or not.

                                        If you are not copying, the license is irrelevant.

                                        revk@toot.me.ukR 1 Reply Last reply
                                        0
                                        • kevinr@masto.free-dissociation.comK kevinr@masto.free-dissociation.com

                                          @lcamtuf actual answer: of course you do, it’s prima facie a derivative work, same as if you had rewritten the program by hand.

                                          revk@toot.me.ukR This user is from outside of this forum
                                          revk@toot.me.ukR This user is from outside of this forum
                                          revk@toot.me.uk
                                          wrote last edited by
                                          #31

                                          @kevinr @lcamtuf with the possible extra step that you can’t claim any copyright in your derivative work.

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups