Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. This is bad.

This is bad.

Scheduled Pinned Locked Moved Uncategorized
84 Posts 20 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • dave@alvarado.socialD dave@alvarado.social

    @xgranade @SnoopJ @theorangetheme yeah I've been thinking about that, and I'm not sure I agree. The outputted code is the outputted code. "y = x + 1" doesn't gain additional attack surface because Claude autocompleted it.

    I think there are all sorts of *human* exploits that can happen and are happening, but those are all based on our laziness checking Claude's work, not Claude's output itself. Things like maintainers going "Jesus take the wheel" when Claude writes commits because it's easier

    dave@alvarado.socialD This user is from outside of this forum
    dave@alvarado.socialD This user is from outside of this forum
    dave@alvarado.social
    wrote last edited by
    #72

    @xgranade @SnoopJ @theorangetheme please don't read any of this as my endorsement of slop, I can't stand it. I'm just trying to pick apart how code autocompleted by Claude is different from the moral hazard of trusting Claude in the first place.

    1 Reply Last reply
    0
    • dave@alvarado.socialD dave@alvarado.social

      @xgranade @SnoopJ @theorangetheme yeah I've been thinking about that, and I'm not sure I agree. The outputted code is the outputted code. "y = x + 1" doesn't gain additional attack surface because Claude autocompleted it.

      I think there are all sorts of *human* exploits that can happen and are happening, but those are all based on our laziness checking Claude's work, not Claude's output itself. Things like maintainers going "Jesus take the wheel" when Claude writes commits because it's easier

      xgranade@wandering.shopX This user is from outside of this forum
      xgranade@wandering.shopX This user is from outside of this forum
      xgranade@wandering.shop
      wrote last edited by
      #73

      @dave @SnoopJ @theorangetheme My views here are complicated, but let me try and give a somewhat accurate condensed version?

      First, to your `y = x + 1` example, if the code is simple enough, that vulnerability can be mitigated by human review — the problem is still there, I contend, but was contained by review. The problem is that humans *suck* at scanning for that kind of problem. Take the TSA looking for guns in x-ray scans... they keep failing at that, and incredibly badly.

      xgranade@wandering.shopX 1 Reply Last reply
      0
      • xgranade@wandering.shopX xgranade@wandering.shop

        @srtcd424 No need to apologize, I just want to be clear about my own views on this rather than inadvertently implying criticism of Python *in particular* that I neither mean nor want to make.

        srtcd424@mas.toS This user is from outside of this forum
        srtcd424@mas.toS This user is from outside of this forum
        srtcd424@mas.to
        wrote last edited by
        #74

        @xgranade
        Yeah, fair. It feels like we're fish trapped in a pool of trustworthy software that's rapidly drying up & shrinking 😞

        1 Reply Last reply
        0
        • xgranade@wandering.shopX xgranade@wandering.shop

          @nausicaa @astraluma As @joelle pointed out, Claude is also a name that real people have. @SnoopJ's cantrip is going to be less susceptible to false positives by filtering on "anthropic.com" as well.

          SnoopJ (@SnoopJ@hachyderm.io)

          @theorangetheme@en.osm.town @xgranade@wandering.shop here are the commits on `main` where it's explicitly a co-author: (Edit: I missed a few commits because I hadn't pulled :picardfacepalm:) ``` $ git log --oneline -i --grep "Co-authored-by: Claude.*anthropic\.com" 300de1e98ac gh-86519: Add prefixmatch APIs to the re module (GH-31137) ac8b5b68900 gh-143650: Fix importlib race condition on import failure (GH-143651) 9b8d59c136c gh-72798: Add mapping example to str.translate documentation (#144454) 34e5a63f145 gh-141444: Replace dead URL in urllib.robotparser example (GH-144443) 59f247e43bc gh-115952: Fix a potential virtual memory allocation denial of service in pickle (GH-119204) 5b1862bdd80 gh-87512: Fix `subprocess` using `timeout=` on Windows blocking with a large `input=` (GH-142058) cc6bc4c97f7 GH-134453: Fix subprocess memoryview input handling on POSIX (GH-134949) 532c37695d0 gh-137134: Update SQLite to 3.50.4 for binary releases (GH-137135) ```

          favicon

          Hachyderm.io (hachyderm.io)

          joelle@social.joelle.usJ This user is from outside of this forum
          joelle@social.joelle.usJ This user is from outside of this forum
          joelle@social.joelle.us
          wrote last edited by
          #75

          @xgranade @nausicaa @astraluma @SnoopJ

          Also sometimes it's in the *commit message* that Claude helped, rather than in the user or first line of the commit, so --oneline probably isn't what you want either.

          xgranade@wandering.shopX 1 Reply Last reply
          0
          • xgranade@wandering.shopX xgranade@wandering.shop

            @dave @SnoopJ @theorangetheme My views here are complicated, but let me try and give a somewhat accurate condensed version?

            First, to your `y = x + 1` example, if the code is simple enough, that vulnerability can be mitigated by human review — the problem is still there, I contend, but was contained by review. The problem is that humans *suck* at scanning for that kind of problem. Take the TSA looking for guns in x-ray scans... they keep failing at that, and incredibly badly.

            xgranade@wandering.shopX This user is from outside of this forum
            xgranade@wandering.shopX This user is from outside of this forum
            xgranade@wandering.shop
            wrote last edited by
            #76

            @dave @SnoopJ @theorangetheme As code changes grow, it's even harder to do that mitigation, especially when those code changes interact with a highly complex code base. There's times where `y = x + 1` would be a catastrophic error due to someone else doing pointer math and whatnot, say.

            Beyond that, though, it's not clear to what degree *if any* extruded code can be copyrighted. If it can't be, what impact does that have on the project.

            xgranade@wandering.shopX 1 Reply Last reply
            0
            • joelle@social.joelle.usJ joelle@social.joelle.us

              @xgranade @nausicaa @astraluma @SnoopJ

              Also sometimes it's in the *commit message* that Claude helped, rather than in the user or first line of the commit, so --oneline probably isn't what you want either.

              xgranade@wandering.shopX This user is from outside of this forum
              xgranade@wandering.shopX This user is from outside of this forum
              xgranade@wandering.shop
              wrote last edited by
              #77

              @joelle @nausicaa @astraluma @SnoopJ True, but that at least biases towards false negatives instead of false positives, which seems like a fair tradeoff?

              1 Reply Last reply
              0
              • xgranade@wandering.shopX xgranade@wandering.shop

                @dave @SnoopJ @theorangetheme As code changes grow, it's even harder to do that mitigation, especially when those code changes interact with a highly complex code base. There's times where `y = x + 1` would be a catastrophic error due to someone else doing pointer math and whatnot, say.

                Beyond that, though, it's not clear to what degree *if any* extruded code can be copyrighted. If it can't be, what impact does that have on the project.

                xgranade@wandering.shopX This user is from outside of this forum
                xgranade@wandering.shopX This user is from outside of this forum
                xgranade@wandering.shop
                wrote last edited by
                #78

                @dave @SnoopJ @theorangetheme What happens if, as sometimes happens, the code extruded by a generator is a verbatim quotation of code in its training set, and that comes from a different license? I'm not a lawyer, so I don't understand these risks well enough to always know what is and isn't safe for me to accept, especially if slop extruders are involved.

                snoopj@hachyderm.ioS 1 Reply Last reply
                0
                • xgranade@wandering.shopX xgranade@wandering.shop

                  @dave @SnoopJ @theorangetheme What happens if, as sometimes happens, the code extruded by a generator is a verbatim quotation of code in its training set, and that comes from a different license? I'm not a lawyer, so I don't understand these risks well enough to always know what is and isn't safe for me to accept, especially if slop extruders are involved.

                  snoopj@hachyderm.ioS This user is from outside of this forum
                  snoopj@hachyderm.ioS This user is from outside of this forum
                  snoopj@hachyderm.io
                  wrote last edited by
                  #79

                  @xgranade @dave @theorangetheme IANAL either but it is worth pointing out that generation and *distribution* are separate activities, and humans are still holding all the liability for the latter (which is also the only legally-enforceable part to begin with)

                  xgranade@wandering.shopX 1 Reply Last reply
                  0
                  • snoopj@hachyderm.ioS snoopj@hachyderm.io

                    @xgranade @dave @theorangetheme IANAL either but it is worth pointing out that generation and *distribution* are separate activities, and humans are still holding all the liability for the latter (which is also the only legally-enforceable part to begin with)

                    xgranade@wandering.shopX This user is from outside of this forum
                    xgranade@wandering.shopX This user is from outside of this forum
                    xgranade@wandering.shop
                    wrote last edited by
                    #80

                    @SnoopJ @dave @theorangetheme That's fair, yeah. My point is more I don't understand the exact shape of the risk... if I redistribute code that was generated by an AI agent, what additional risk if any do I incur?

                    snoopj@hachyderm.ioS 1 Reply Last reply
                    0
                    • xgranade@wandering.shopX xgranade@wandering.shop

                      @nausicaa @astraluma As @joelle pointed out, Claude is also a name that real people have. @SnoopJ's cantrip is going to be less susceptible to false positives by filtering on "anthropic.com" as well.

                      SnoopJ (@SnoopJ@hachyderm.io)

                      @theorangetheme@en.osm.town @xgranade@wandering.shop here are the commits on `main` where it's explicitly a co-author: (Edit: I missed a few commits because I hadn't pulled :picardfacepalm:) ``` $ git log --oneline -i --grep "Co-authored-by: Claude.*anthropic\.com" 300de1e98ac gh-86519: Add prefixmatch APIs to the re module (GH-31137) ac8b5b68900 gh-143650: Fix importlib race condition on import failure (GH-143651) 9b8d59c136c gh-72798: Add mapping example to str.translate documentation (#144454) 34e5a63f145 gh-141444: Replace dead URL in urllib.robotparser example (GH-144443) 59f247e43bc gh-115952: Fix a potential virtual memory allocation denial of service in pickle (GH-119204) 5b1862bdd80 gh-87512: Fix `subprocess` using `timeout=` on Windows blocking with a large `input=` (GH-142058) cc6bc4c97f7 GH-134453: Fix subprocess memoryview input handling on POSIX (GH-134949) 532c37695d0 gh-137134: Update SQLite to 3.50.4 for binary releases (GH-137135) ```

                      favicon

                      Hachyderm.io (hachyderm.io)

                      nausicaa@xoxo.zoneN This user is from outside of this forum
                      nausicaa@xoxo.zoneN This user is from outside of this forum
                      nausicaa@xoxo.zone
                      wrote last edited by
                      #81

                      @xgranade @astraluma @joelle @SnoopJ Fair. Given the current scale, I just clicked through to check the different commits, but that doesn't scale as well as SnoopJ's approach.

                      xgranade@wandering.shopX 1 Reply Last reply
                      0
                      • nausicaa@xoxo.zoneN nausicaa@xoxo.zone

                        @xgranade @astraluma @joelle @SnoopJ Fair. Given the current scale, I just clicked through to check the different commits, but that doesn't scale as well as SnoopJ's approach.

                        xgranade@wandering.shopX This user is from outside of this forum
                        xgranade@wandering.shopX This user is from outside of this forum
                        xgranade@wandering.shop
                        wrote last edited by
                        #82

                        @nausicaa @astraluma @joelle @SnoopJ That's fair, too, this is so far a small handful and it's not too hard to manually validate that positives are actually true positives.

                        1 Reply Last reply
                        0
                        • xgranade@wandering.shopX xgranade@wandering.shop

                          @SnoopJ @dave @theorangetheme That's fair, yeah. My point is more I don't understand the exact shape of the risk... if I redistribute code that was generated by an AI agent, what additional risk if any do I incur?

                          snoopj@hachyderm.ioS This user is from outside of this forum
                          snoopj@hachyderm.ioS This user is from outside of this forum
                          snoopj@hachyderm.io
                          wrote last edited by
                          #83

                          @xgranade @dave @theorangetheme IMO the risk profile from a legal liability standpoint is exactly the same as if you'd written it by hand

                          that is, if you distribute a machine-generated copy of a protected work, that doesn't really factor into the ability of that work's owner to sue you for said distribution. the owner has as much standing (in the legalistic sense) as they would if you'd copied and pasted by hand

                          now the actual *trial* that might arise could have some differences, especially where a judge's discretion is involved (e.g. in awarding damages), but considering how things have gone in the courts so far, I feel reasonably confident in saying that a litigant with a big enough warchest to be a pain in the ass in court over it is going to get treated about the same?

                          (which might be a complicated way to say "the legalistic arguments are moot, whoever has the deeper pockets wins" but I do enjoy pondering the legal theory even if I know how little it matters to the legal system that actually exists)

                          1 Reply Last reply
                          0
                          • xgranade@wandering.shopX xgranade@wandering.shop

                            I'm gonna be real with folks here. I fucked up, and bad, with my participation in the open-slopware list. As a result, I'm not the right person to do it, but there has to be some kind of accounting for what damage AI is doing to open source.

                            For all the whinging about "supply chains" over the past few years, it *is* a problem when your code suddenly depends on AI, even if only indirectly.

                            jo@infosec.exchangeJ This user is from outside of this forum
                            jo@infosec.exchangeJ This user is from outside of this forum
                            jo@infosec.exchange
                            wrote last edited by
                            #84

                            @xgranade As someone who doesn't know anything about open-slopware, what was bad about it?

                            1 Reply Last reply
                            1
                            0
                            • R relay@relay.infosec.exchange shared this topic
                            Reply
                            • Reply as topic
                            Log in to reply
                            • Oldest to Newest
                            • Newest to Oldest
                            • Most Votes


                            • Login

                            • Login or register to search.
                            • First post
                              Last post
                            0
                            • Categories
                            • Recent
                            • Tags
                            • Popular
                            • World
                            • Users
                            • Groups