Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. A new twist in the "AI license laundering of chardet" story https://github.com/chardet/chardet/issues/327

A new twist in the "AI license laundering of chardet" story https://github.com/chardet/chardet/issues/327

Scheduled Pinned Locked Moved Uncategorized
52 Posts 38 Posters 136 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • cwebber@social.coopC cwebber@social.coop

    A new twist in the "AI license laundering of chardet" story https://github.com/chardet/chardet/issues/327

    sanityinc@hachyderm.ioS This user is from outside of this forum
    sanityinc@hachyderm.ioS This user is from outside of this forum
    sanityinc@hachyderm.io
    wrote last edited by
    #8

    @cwebber significant popcorn moment

    1 Reply Last reply
    0
    • cwebber@social.coopC cwebber@social.coop

      Winning option 1: yes, you can vibe code proprietary codebases into the public domain, allowing us to bootstrap proprietary codebases quickly

      Winning option 2: stopping laundering of copyleft codebases

      Either of these are interesting outcomes!

      vv@solarpunk.moeV This user is from outside of this forum
      vv@solarpunk.moeV This user is from outside of this forum
      vv@solarpunk.moe
      wrote last edited by
      #9

      @cwebber the losing outcome is people use it, but it is shitty, and then it's so widely adopted as a general concept that you're forced to use shitty software

      cwebber@social.coopC 1 Reply Last reply
      0
      • vv@solarpunk.moeV vv@solarpunk.moe

        @cwebber the losing outcome is people use it, but it is shitty, and then it's so widely adopted as a general concept that you're forced to use shitty software

        cwebber@social.coopC This user is from outside of this forum
        cwebber@social.coopC This user is from outside of this forum
        cwebber@social.coop
        wrote last edited by
        #10

        @vv yeah that's defintely the shitty outcome for usability

        But... given that a lot of shittiness comes from an *uneven playing field* when it comes to copyright stuff, and people thinking they can wear down the commons with no consequences, I think it's worth pushing the needle on this approach

        1 Reply Last reply
        0
        • cwebber@social.coopC cwebber@social.coop

          But really, relicensing a GPL codebase to MIT is uninteresting.

          Let's do the interesting one, which is: vibe code a "clean room" reimplementation of an entire proprietary codebase! After all, Microsoft released a "shared source" proprietary version of Windows. Now try seeing what happens if you run THAT through the "turn it into public domain" machine

          Win-win outcome, no matter how it goes

          lizzy@social.vlhl.devL This user is from outside of this forum
          lizzy@social.vlhl.devL This user is from outside of this forum
          lizzy@social.vlhl.dev
          wrote last edited by
          #11
          @cwebber it's just gonna launder wine code lol
          1 Reply Last reply
          0
          • cwebber@social.coopC cwebber@social.coop

            I left a comment to that effect here https://github.com/chardet/chardet/issues/327#issuecomment-4005721071

            cwebber@social.coopC This user is from outside of this forum
            cwebber@social.coopC This user is from outside of this forum
            cwebber@social.coop
            wrote last edited by
            #12

            omg I am just seeing now that the dude who did the "AI relicensing" fucking replied with an obvious slop response, of all the fucking disrespectful things to do, holy fucking shit https://github.com/chardet/chardet/issues/327#issuecomment-4005195078

            neurobashing@mastodon.socialN kye@tech.lgbtK soapdog@toot.cafeS rcriii@hostux.socialR cstanhope@social.coopC 9 Replies Last reply
            0
            • cwebber@social.coopC cwebber@social.coop

              But really, relicensing a GPL codebase to MIT is uninteresting.

              Let's do the interesting one, which is: vibe code a "clean room" reimplementation of an entire proprietary codebase! After all, Microsoft released a "shared source" proprietary version of Windows. Now try seeing what happens if you run THAT through the "turn it into public domain" machine

              Win-win outcome, no matter how it goes

              eldaking@weirder.earthE This user is from outside of this forum
              eldaking@weirder.earthE This user is from outside of this forum
              eldaking@weirder.earth
              wrote last edited by
              #13

              @cwebber I have been saying this since the first copilot iteration!

              If Microsoft wants to show us how to launder copyright, by all means, let them.

              1 Reply Last reply
              0
              • cwebber@social.coopC cwebber@social.coop

                I left a comment to that effect here https://github.com/chardet/chardet/issues/327#issuecomment-4005721071

                thomasjwebb@mastodon.socialT This user is from outside of this forum
                thomasjwebb@mastodon.socialT This user is from outside of this forum
                thomasjwebb@mastodon.social
                wrote last edited by
                #14

                @cwebber that's exactly where my mind went to. Any time I've rewritten something that was in copyleft because I needed it copycenter or even with such inspiration, I wouldn't let myself even look at the original code. But it would be a net boon to OSS if the same rules apply to proprietary stuff. The bad situation would be if corporate lawyers effectively made it so that only their code is protected from such reimplementation.

                1 Reply Last reply
                0
                • cwebber@social.coopC cwebber@social.coop

                  I left a comment to that effect here https://github.com/chardet/chardet/issues/327#issuecomment-4005721071

                  johnefrancis@cosocial.caJ This user is from outside of this forum
                  johnefrancis@cosocial.caJ This user is from outside of this forum
                  johnefrancis@cosocial.ca
                  wrote last edited by
                  #15

                  @cwebber good times! 😅

                  It's going to be fun to see how the boundaries of "human produced work" are defined over time, but I expect it will work out in whatever way benefits the big money players in software and media.

                  Does this only apply to "AI"? What does that mean? If I have a machine generated background crowd or vapour in some frames of my $300M blockbuster movie, can I still copyright it?

                  1 Reply Last reply
                  0
                  • cwebber@social.coopC cwebber@social.coop

                    omg I am just seeing now that the dude who did the "AI relicensing" fucking replied with an obvious slop response, of all the fucking disrespectful things to do, holy fucking shit https://github.com/chardet/chardet/issues/327#issuecomment-4005195078

                    neurobashing@mastodon.socialN This user is from outside of this forum
                    neurobashing@mastodon.socialN This user is from outside of this forum
                    neurobashing@mastodon.social
                    wrote last edited by
                    #16

                    @cwebber happy to see Mark Pilgrim still exists, he sorta disappeared from public life and I hadn't seen him active in public-facing software at all

                    1 Reply Last reply
                    0
                    • cwebber@social.coopC cwebber@social.coop

                      But really, relicensing a GPL codebase to MIT is uninteresting.

                      Let's do the interesting one, which is: vibe code a "clean room" reimplementation of an entire proprietary codebase! After all, Microsoft released a "shared source" proprietary version of Windows. Now try seeing what happens if you run THAT through the "turn it into public domain" machine

                      Win-win outcome, no matter how it goes

                      njoseph@social.masto.hostN This user is from outside of this forum
                      njoseph@social.masto.hostN This user is from outside of this forum
                      njoseph@social.masto.host
                      wrote last edited by
                      #17

                      @cwebber Microsoft can still sue for patent violations. But Windows 7 is over 15 years old.

                      Also, trademark violations should be carefully avoided.

                      1 Reply Last reply
                      0
                      • cwebber@social.coopC cwebber@social.coop

                        omg I am just seeing now that the dude who did the "AI relicensing" fucking replied with an obvious slop response, of all the fucking disrespectful things to do, holy fucking shit https://github.com/chardet/chardet/issues/327#issuecomment-4005195078

                        kye@tech.lgbtK This user is from outside of this forum
                        kye@tech.lgbtK This user is from outside of this forum
                        kye@tech.lgbt
                        wrote last edited by
                        #18

                        @cwebber The ReactOS people take such careful steps to avoid any even potentially compromising contact with Windows source. I'm not a fan of copyright in general, but as long as it exists, people need to be mindful of what they're doing and the history it touches on.

                        It's a whole project of its own to get contributors to relicense their code and rewrite what can't be relicensed, and most projects that do it take a bunch of flak even if they have good reasons. (like GPL->AGPL)

                        And this dude just vibed his way through it

                        1 Reply Last reply
                        0
                        • cwebber@social.coopC cwebber@social.coop

                          A new twist in the "AI license laundering of chardet" story https://github.com/chardet/chardet/issues/327

                          etoani@freeradical.zoneE This user is from outside of this forum
                          etoani@freeradical.zoneE This user is from outside of this forum
                          etoani@freeradical.zone
                          wrote last edited by
                          #19

                          @cwebber I would very much like someone with a legal mind explain how software licenses interact with yesterday's ruling that AI gen work is not copyrightable. What exactly is the basis of the copyright here? I hope we get to see someone dive into this.

                          kye@tech.lgbtK 1 Reply Last reply
                          0
                          • etoani@freeradical.zoneE etoani@freeradical.zone

                            @cwebber I would very much like someone with a legal mind explain how software licenses interact with yesterday's ruling that AI gen work is not copyrightable. What exactly is the basis of the copyright here? I hope we get to see someone dive into this.

                            kye@tech.lgbtK This user is from outside of this forum
                            kye@tech.lgbtK This user is from outside of this forum
                            kye@tech.lgbt
                            wrote last edited by
                            #20

                            @etoani @cwebber It was a decline to rule. The case they declined to rule on stood, and it focused narrowly on someone trying to get his pet AI recognized as sentient to qualify for authorship under copyright law.

                            Where the line is on how much authorship flips "authored parts are copyrightable" to "the whole thing is copyrighted" is still contested and evolving in courts.

                            edit: The SCOTUS likes to let lawyers duke it out in district courts and wait for enough rulings, especially with serious cross-district conflicts, at that level to pick from to hear.

                            1 Reply Last reply
                            0
                            • cwebber@social.coopC cwebber@social.coop

                              omg I am just seeing now that the dude who did the "AI relicensing" fucking replied with an obvious slop response, of all the fucking disrespectful things to do, holy fucking shit https://github.com/chardet/chardet/issues/327#issuecomment-4005195078

                              soapdog@toot.cafeS This user is from outside of this forum
                              soapdog@toot.cafeS This user is from outside of this forum
                              soapdog@toot.cafe
                              wrote last edited by
                              #21

                              @cwebber that whole relicensing and this slop reply are vomit inducing.

                              dajb@social.coopD ectopod@hachyderm.ioE 2 Replies Last reply
                              0
                              • soapdog@toot.cafeS soapdog@toot.cafe

                                @cwebber that whole relicensing and this slop reply are vomit inducing.

                                dajb@social.coopD This user is from outside of this forum
                                dajb@social.coopD This user is from outside of this forum
                                dajb@social.coop
                                wrote last edited by
                                #22

                                @soapdog @cwebber It's just the lack of understanding of what an LLM is that's makes one's hand want to smack one's forehead. Or, preferably, his.

                                1 Reply Last reply
                                0
                                • cwebber@social.coopC cwebber@social.coop

                                  omg I am just seeing now that the dude who did the "AI relicensing" fucking replied with an obvious slop response, of all the fucking disrespectful things to do, holy fucking shit https://github.com/chardet/chardet/issues/327#issuecomment-4005195078

                                  rcriii@hostux.socialR This user is from outside of this forum
                                  rcriii@hostux.socialR This user is from outside of this forum
                                  rcriii@hostux.social
                                  wrote last edited by
                                  #23

                                  @cwebber I love the sentence "If you are indeed the Mark Pilgrim..." So steeped in bad faith that you assume others are too.

                                  1 Reply Last reply
                                  0
                                  • R relay@relay.an.exchange shared this topic
                                  • cwebber@social.coopC cwebber@social.coop

                                    But really, relicensing a GPL codebase to MIT is uninteresting.

                                    Let's do the interesting one, which is: vibe code a "clean room" reimplementation of an entire proprietary codebase! After all, Microsoft released a "shared source" proprietary version of Windows. Now try seeing what happens if you run THAT through the "turn it into public domain" machine

                                    Win-win outcome, no matter how it goes

                                    cyberia@tilde.zoneC This user is from outside of this forum
                                    cyberia@tilde.zoneC This user is from outside of this forum
                                    cyberia@tilde.zone
                                    wrote last edited by
                                    #24

                                    @cwebber Well, the maintainer's point was that this is "clean room", by which they mean Claude was not given the existing codebase as input. The counter argument is that the existing codebase almost certainly forms part of Claude's training data, so the claim of it being genuinely clean room is bogus. So to make your idea work, you'd have to use the proprietary codebase as training data, rather than prompt input.

                                    cyberia@tilde.zoneC H 2 Replies Last reply
                                    0
                                    • cyberia@tilde.zoneC cyberia@tilde.zone

                                      @cwebber Well, the maintainer's point was that this is "clean room", by which they mean Claude was not given the existing codebase as input. The counter argument is that the existing codebase almost certainly forms part of Claude's training data, so the claim of it being genuinely clean room is bogus. So to make your idea work, you'd have to use the proprietary codebase as training data, rather than prompt input.

                                      cyberia@tilde.zoneC This user is from outside of this forum
                                      cyberia@tilde.zoneC This user is from outside of this forum
                                      cyberia@tilde.zone
                                      wrote last edited by
                                      #25

                                      @cwebber and I suspect that if you made an LLM based on the specific code as training data, a court would probably rule differently to how they have ruled about LLM generated code in other cases. maybe.

                                      1 Reply Last reply
                                      0
                                      • cyberia@tilde.zoneC cyberia@tilde.zone

                                        @cwebber Well, the maintainer's point was that this is "clean room", by which they mean Claude was not given the existing codebase as input. The counter argument is that the existing codebase almost certainly forms part of Claude's training data, so the claim of it being genuinely clean room is bogus. So to make your idea work, you'd have to use the proprietary codebase as training data, rather than prompt input.

                                        H This user is from outside of this forum
                                        H This user is from outside of this forum
                                        hashbangperl@hachyderm.io
                                        wrote last edited by
                                        #26

                                        @cyberia @cwebber it would need a controlled clean-room training data and training and context, so yeah it was trained on the original GPL code and is not a clean-room implementation

                                        1 Reply Last reply
                                        0
                                        • cwebber@social.coopC cwebber@social.coop

                                          But really, relicensing a GPL codebase to MIT is uninteresting.

                                          Let's do the interesting one, which is: vibe code a "clean room" reimplementation of an entire proprietary codebase! After all, Microsoft released a "shared source" proprietary version of Windows. Now try seeing what happens if you run THAT through the "turn it into public domain" machine

                                          Win-win outcome, no matter how it goes

                                          npdoty@techpolicy.socialN This user is from outside of this forum
                                          npdoty@techpolicy.socialN This user is from outside of this forum
                                          npdoty@techpolicy.social
                                          wrote last edited by
                                          #27

                                          @cwebber I cynically fear that the likely outcome is that proprietary copyright holders with lots of lawyers and money could succeed in preventing re-licensing as open source, while copyleft advocates with few resources couldn't actually prevent re-licensing to closed.

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups