Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license?

If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license?

Scheduled Pinned Locked Moved Uncategorized
75 Posts 37 Posters 148 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • snoopj@hachyderm.ioS snoopj@hachyderm.io

    @bgalehouse @kevinr @lcamtuf it's a tempting argument to attempt but it kinda falls apart when "the entire library was in the training corpus anyway" is a given.

    The fact that it is a terrible argument is of course not really going to stop anyone from making it.

    arnebab@rollenspiel.socialA This user is from outside of this forum
    arnebab@rollenspiel.socialA This user is from outside of this forum
    arnebab@rollenspiel.social
    wrote last edited by
    #13

    @SnoopJ There’s the concept of clean room reimplementations (see the link by @bgalehouse😞 one group writes the spec -- possibly with access to the source.

    The second group has never seen the source and only gets the spec. This second group then writes the program according to the spec.

    You could simulate this if you had an AI that was provably not trained on the original source.

    ("provably not trained" most likely means re-training from scratch)

    @bgalehouse @kevinr @lcamtuf

    kevinr@masto.free-dissociation.comK 1 Reply Last reply
    0
    • tbortels@infosec.exchangeT tbortels@infosec.exchange

      @lcamtuf @bgalehouse @kevinr

      But here's an interesting question:

      If you do not execute the code - did you accept the license? Does simply reading it sufficiently to be able to write a spec bind you to that license? That seems a bit too much.

      arnebab@rollenspiel.socialA This user is from outside of this forum
      arnebab@rollenspiel.socialA This user is from outside of this forum
      arnebab@rollenspiel.social
      wrote last edited by
      #14

      @tbortels if you do not accept the license, you do not have any right to use the code. It’s "all rights reserved" then. @lcamtuf @bgalehouse @kevinr

      kevinr@masto.free-dissociation.comK tbortels@infosec.exchangeT 2 Replies Last reply
      0
      • tbortels@infosec.exchangeT tbortels@infosec.exchange

        @lcamtuf @bgalehouse @kevinr

        But here's an interesting question:

        If you do not execute the code - did you accept the license? Does simply reading it sufficiently to be able to write a spec bind you to that license? That seems a bit too much.

        bredroll@mas.toB This user is from outside of this forum
        bredroll@mas.toB This user is from outside of this forum
        bredroll@mas.to
        wrote last edited by
        #15

        @tbortels @lcamtuf @bgalehouse @kevinr if a thing has a licence then that covers its use, so using it as a wallpaper image or software component or training data could be argued.

        1 Reply Last reply
        0
        • arnebab@rollenspiel.socialA arnebab@rollenspiel.social

          @tbortels if you do not accept the license, you do not have any right to use the code. It’s "all rights reserved" then. @lcamtuf @bgalehouse @kevinr

          kevinr@masto.free-dissociation.comK This user is from outside of this forum
          kevinr@masto.free-dissociation.comK This user is from outside of this forum
          kevinr@masto.free-dissociation.com
          wrote last edited by
          #16

          @ArneBab @tbortels @lcamtuf @bgalehouse

          Yeah the license applies whether you accept it or not. And whether your spec counts as a derivative work or not will depend greatly on the details of your spec

          tbortels@infosec.exchangeT 1 Reply Last reply
          0
          • hopeless@mas.toH hopeless@mas.to

            @kevinr @lcamtuf In retrospect... "actual answer", "of course", "prima facie" are all red flags you're reading a bunch of nonsense blather.

            kevinr@masto.free-dissociation.comK This user is from outside of this forum
            kevinr@masto.free-dissociation.comK This user is from outside of this forum
            kevinr@masto.free-dissociation.com
            wrote last edited by
            #17

            @hopeless @lcamtuf no, you're just reading an educated asshole who happens to be right

            hopeless@mas.toH 1 Reply Last reply
            0
            • arnebab@rollenspiel.socialA arnebab@rollenspiel.social

              @SnoopJ There’s the concept of clean room reimplementations (see the link by @bgalehouse😞 one group writes the spec -- possibly with access to the source.

              The second group has never seen the source and only gets the spec. This second group then writes the program according to the spec.

              You could simulate this if you had an AI that was provably not trained on the original source.

              ("provably not trained" most likely means re-training from scratch)

              @bgalehouse @kevinr @lcamtuf

              kevinr@masto.free-dissociation.comK This user is from outside of this forum
              kevinr@masto.free-dissociation.comK This user is from outside of this forum
              kevinr@masto.free-dissociation.com
              wrote last edited by
              #18

              @ArneBab @SnoopJ @bgalehouse @lcamtuf

              And the spec would need to carefully elide certain details which would get it classed as a derivative work itself—much harder for an LLM to do than a team of humans

              arnebab@rollenspiel.socialA 1 Reply Last reply
              0
              • kevinr@masto.free-dissociation.comK kevinr@masto.free-dissociation.com

                @lcamtuf actual answer: of course you do, it’s prima facie a derivative work, same as if you had rewritten the program by hand.

                groxx@hachyderm.ioG This user is from outside of this forum
                groxx@hachyderm.ioG This user is from outside of this forum
                groxx@hachyderm.io
                wrote last edited by
                #19

                @kevinr @lcamtuf yea, seems like at best it's treated like it was done by a human... and then it's just blatant plagiarism.

                1 Reply Last reply
                0
                • kevinr@masto.free-dissociation.comK kevinr@masto.free-dissociation.com

                  @lcamtuf actual answer: of course you do, it’s prima facie a derivative work, same as if you had rewritten the program by hand.

                  kevinr@masto.free-dissociation.comK This user is from outside of this forum
                  kevinr@masto.free-dissociation.comK This user is from outside of this forum
                  kevinr@masto.free-dissociation.com
                  wrote last edited by
                  #20

                  @lcamtuf There are a number of tools online which purport to strip the copyright from images by running them through an image model, and they're just as obviously bullshit

                  1 Reply Last reply
                  0
                  • kevinr@masto.free-dissociation.comK kevinr@masto.free-dissociation.com

                    @ArneBab @SnoopJ @bgalehouse @lcamtuf

                    And the spec would need to carefully elide certain details which would get it classed as a derivative work itself—much harder for an LLM to do than a team of humans

                    arnebab@rollenspiel.socialA This user is from outside of this forum
                    arnebab@rollenspiel.socialA This user is from outside of this forum
                    arnebab@rollenspiel.social
                    wrote last edited by
                    #21

                    @kevinr and proving that the AI was not trained on the original source will be pretty hard, because FLOSS programs with compatible licenses can legally copy code from one project into the other.

                    You’ll likely have to exclude all code from the project and all code that’s too similar from the training data. And then train an AI from scratch. Which would be extremely expensive.

                    @SnoopJ @bgalehouse @lcamtuf

                    arnebab@rollenspiel.socialA thebluewizard@masto.hackers.townT 2 Replies Last reply
                    0
                    • arnebab@rollenspiel.socialA arnebab@rollenspiel.social

                      @kevinr and proving that the AI was not trained on the original source will be pretty hard, because FLOSS programs with compatible licenses can legally copy code from one project into the other.

                      You’ll likely have to exclude all code from the project and all code that’s too similar from the training data. And then train an AI from scratch. Which would be extremely expensive.

                      @SnoopJ @bgalehouse @lcamtuf

                      arnebab@rollenspiel.socialA This user is from outside of this forum
                      arnebab@rollenspiel.socialA This user is from outside of this forum
                      arnebab@rollenspiel.social
                      wrote last edited by
                      #22

                      @kevinr but I expect that someone will come in and say "my prompt includes 'forget all code from <project>', so the AI does not know it".

                      … OK, I have to admit that I lost trust into the sanity of a part of humanity …

                      @SnoopJ @bgalehouse @lcamtuf

                      1 Reply Last reply
                      0
                      • kevinr@masto.free-dissociation.comK kevinr@masto.free-dissociation.com

                        @hopeless @lcamtuf no, you're just reading an educated asshole who happens to be right

                        hopeless@mas.toH This user is from outside of this forum
                        hopeless@mas.toH This user is from outside of this forum
                        hopeless@mas.to
                        wrote last edited by
                        #23

                        @kevinr @lcamtuf Well, from a reader's perspective, it reads like you are still trying to convince yourself you're right.

                        This is usually a bad sign if you're trying to convince anyone else you're right.

                        The problem is in what sense is it "derivative" if the original content is neither known or referenced? We are talking about copyright alone here and your choice of phrase "derivative work".

                        1 Reply Last reply
                        0
                        • lcamtuf@infosec.exchangeL lcamtuf@infosec.exchange

                          If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license? In philosophy, this problem is known as the Slop of Theseus

                          noortjevee@mstdn.socialN This user is from outside of this forum
                          noortjevee@mstdn.socialN This user is from outside of this forum
                          noortjevee@mstdn.social
                          wrote last edited by
                          #24

                          @lcamtuf shakes my fist at theseus

                          1 Reply Last reply
                          0
                          • tbortels@infosec.exchangeT tbortels@infosec.exchange

                            @bgalehouse @lcamtuf @kevinr

                            Assuming you used the original source code to derive the detailed spec, then yes, that too is a derivative work.

                            The "viral" nature of that sort of license has bothered me for a long time. It's always been simultaneously overly far reaching and impossible to realistically enforce.

                            gisgeek@floss.socialG This user is from outside of this forum
                            gisgeek@floss.socialG This user is from outside of this forum
                            gisgeek@floss.social
                            wrote last edited by
                            #25

                            @tbortels @bgalehouse @lcamtuf @kevinr Well, yes but no. The point about spec is the level of detailing taken from the original work. If you write an original novel about a wild, big monkey found in a jungle, brought to New York, who escapes and so on, the King Kong author cannot claim any rights to that, sorry. If it were different, many narratives and movies would not exist today. That is inspiration, not derivation. Of course it is fair declaring inspiration, but call it with the right name.

                            gisgeek@floss.socialG tbortels@infosec.exchangeT 2 Replies Last reply
                            0
                            • lcamtuf@infosec.exchangeL lcamtuf@infosec.exchange

                              If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license? In philosophy, this problem is known as the Slop of Theseus

                              tkissing@mastodon.socialT This user is from outside of this forum
                              tkissing@mastodon.socialT This user is from outside of this forum
                              tkissing@mastodon.social
                              wrote last edited by
                              #26

                              @lcamtuf OpenAI already gets all upset, if someone uses their AI to train a different AI. If the whole technocrap brotherhood wasn't built around hypocrisy, the slop factory owners should be on the side of "no, you can't do this".

                              1 Reply Last reply
                              0
                              • victimofsimony@infosec.exchangeV victimofsimony@infosec.exchange

                                @lcamtuf

                                This case law exists in the U.S.

                                There are two cases (or arguably three if you include Sega v. SNK).

                                Here's what you really care about:
                                🅰️ Any author of code is judged based on their own use of the existing code, so reverse-engineering of code used to be based on an engineer writing down, line by line, in plain English, what to do. Then a second person sat down and made up code, line-by-line to accomplish that task. Things have changed but the idea that you can't literally harvest existing code is still a thing.
                                🅱️ You own the #AI made code but can't copyright it... so you can't profit from it in the same way.

                                fantasmitaasex@todon.euF This user is from outside of this forum
                                fantasmitaasex@todon.euF This user is from outside of this forum
                                fantasmitaasex@todon.eu
                                wrote last edited by
                                #27

                                @VictimOfSimony @lcamtuf
                                C The fucking source code was used to train the LLM

                                1 Reply Last reply
                                0
                                • gisgeek@floss.socialG gisgeek@floss.social

                                  @tbortels @bgalehouse @lcamtuf @kevinr Well, yes but no. The point about spec is the level of detailing taken from the original work. If you write an original novel about a wild, big monkey found in a jungle, brought to New York, who escapes and so on, the King Kong author cannot claim any rights to that, sorry. If it were different, many narratives and movies would not exist today. That is inspiration, not derivation. Of course it is fair declaring inspiration, but call it with the right name.

                                  gisgeek@floss.socialG This user is from outside of this forum
                                  gisgeek@floss.socialG This user is from outside of this forum
                                  gisgeek@floss.social
                                  wrote last edited by
                                  #28

                                  @tbortels @bgalehouse @lcamtuf @kevinr Just to compare, all free libre office suites today are currently inspired to MS office, often with a very similar UX, but they are clearly separate work with their own license and MS cannot claim any rights about them. Do not confuse software with industrial artifacts and patents. Even out of US software is not patentable...

                                  1 Reply Last reply
                                  0
                                  • chuckmcmanis@chaos.socialC chuckmcmanis@chaos.social

                                    @lcamtuf The current declination by the Supreme Court to overturn or review this ruling: https://www.copyright.gov/rulings-filings/review-board/docs/a-recent-entrance-to-paradise.pdf Which holds things created by AI are neither "derivative works" or "original works" and are not eligible for Copyright protection so no, you don't need to abide by the previous license. No one does. And if someone reverse engineers your code DMCA doesn't apply either (it isn't copyrighted).

                                    astolk@c.imA This user is from outside of this forum
                                    astolk@c.imA This user is from outside of this forum
                                    astolk@c.im
                                    wrote last edited by
                                    #29

                                    @ChuckMcManis @lcamtuf say if Disney were to produce an entire movie with AI, could you share copies freely with your pals?

                                    revk@toot.me.ukR 1 Reply Last reply
                                    0
                                    • tbortels@infosec.exchangeT tbortels@infosec.exchange

                                      @lcamtuf @bgalehouse @kevinr

                                      But here's an interesting question:

                                      If you do not execute the code - did you accept the license? Does simply reading it sufficiently to be able to write a spec bind you to that license? That seems a bit too much.

                                      ahltorp@mastodon.nuA This user is from outside of this forum
                                      ahltorp@mastodon.nuA This user is from outside of this forum
                                      ahltorp@mastodon.nu
                                      wrote last edited by
                                      #30

                                      @tbortels @lcamtuf @bgalehouse @kevinr Copyright bound licenses work by exempting you from the blanket and default prohibition on copying.

                                      So if you copy a work that has copyright restrictions according to copyright law, using the license is your only way of not infringing the law. It doesn’t matter if you ”accept” it or not.

                                      If you are not copying, the license is irrelevant.

                                      revk@toot.me.ukR 1 Reply Last reply
                                      0
                                      • kevinr@masto.free-dissociation.comK kevinr@masto.free-dissociation.com

                                        @lcamtuf actual answer: of course you do, it’s prima facie a derivative work, same as if you had rewritten the program by hand.

                                        revk@toot.me.ukR This user is from outside of this forum
                                        revk@toot.me.ukR This user is from outside of this forum
                                        revk@toot.me.uk
                                        wrote last edited by
                                        #31

                                        @kevinr @lcamtuf with the possible extra step that you can’t claim any copyright in your derivative work.

                                        1 Reply Last reply
                                        0
                                        • ahltorp@mastodon.nuA ahltorp@mastodon.nu

                                          @tbortels @lcamtuf @bgalehouse @kevinr Copyright bound licenses work by exempting you from the blanket and default prohibition on copying.

                                          So if you copy a work that has copyright restrictions according to copyright law, using the license is your only way of not infringing the law. It doesn’t matter if you ”accept” it or not.

                                          If you are not copying, the license is irrelevant.

                                          revk@toot.me.ukR This user is from outside of this forum
                                          revk@toot.me.ukR This user is from outside of this forum
                                          revk@toot.me.uk
                                          wrote last edited by
                                          #32

                                          @ahltorp @tbortels @lcamtuf @bgalehouse @kevinr and indeed there are arguments that simply “reading” is not copying, same as reading a book, even if via a web site. But getting your AI to “read” it is probably a different matter.

                                          rustynail@floss.socialR 1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups