Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. cool.

cool.

Scheduled Pinned Locked Moved Uncategorized
19 Posts 7 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • viss@mastodon.socialV viss@mastodon.social

    cool. the zip i fetched on my phone when the leak hit a while back was legit.

    i have the claude code source

    Link Preview Image
    apth@infosec.exchangeA This user is from outside of this forum
    apth@infosec.exchangeA This user is from outside of this forum
    apth@infosec.exchange
    wrote last edited by
    #7

    @Viss here's a thing I don't understand very well. Anthropic's own safeguards are "ask the LLM not to do something", but we know asking LLMs not to do something isn't a guarantee they will not do that thing (deleted emails, deleted production databases, etc).

    Isn't that fundamentally kind of... fucked? Like the burden is then on users to make the system safe with controls external to the LLM because the vendor can't make it safe themselves?

    1 Reply Last reply
    1
    0
    • R relay@relay.infosec.exchange shared this topic
    • varx@defcon.socialV varx@defcon.social

      @Viss I tried sneaking a system reminder into a code comment to see if I could make claude talk like a pirate, but either it was too obvious or they have added a regex to catch it. It actually called it out as a "prompt injection attempt" for me to look into.

      viss@mastodon.socialV This user is from outside of this forum
      viss@mastodon.socialV This user is from outside of this forum
      viss@mastodon.social
      wrote last edited by
      #8

      @varx heh, maybe they updated stuff after the leak

      1 Reply Last reply
      0
      • viss@mastodon.socialV viss@mastodon.social

        hey cool wanna prompt inject the claude code tui?

        Link Preview Image
        viss@mastodon.socialV This user is from outside of this forum
        viss@mastodon.socialV This user is from outside of this forum
        viss@mastodon.social
        wrote last edited by
        #9

        Link Preview Image
        security-review.tx - Pastebin.com

        Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

        favicon

        Pastebin (pastebin.com)

        so have a look at that - its the claude code tui wrapper system instructions that apply to any 'security review' anybody asks claude to do.

        review that file and tell me if you think claude is still a good tool to aim at code that needs a security review.

        hrbrmstr@mastodon.socialH S viss@mastodon.socialV 3 Replies Last reply
        0
        • viss@mastodon.socialV viss@mastodon.social

          Link Preview Image
          security-review.tx - Pastebin.com

          Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

          favicon

          Pastebin (pastebin.com)

          so have a look at that - its the claude code tui wrapper system instructions that apply to any 'security review' anybody asks claude to do.

          review that file and tell me if you think claude is still a good tool to aim at code that needs a security review.

          hrbrmstr@mastodon.socialH This user is from outside of this forum
          hrbrmstr@mastodon.socialH This user is from outside of this forum
          hrbrmstr@mastodon.social
          wrote last edited by
          #10

          @Viss all the foundation model runners and lazy AI researchers declared bankruptcy when it comes to prompt injection ("it's an unfixable problem") so they dgaf anymore.

          I'm eagerly awaiting adding malicious content into RSS feeds that are `/feed` imported into Slack so that Slack's AI get's pwnd six ways from Sunday.

          viss@mastodon.socialV 1 Reply Last reply
          0
          • hrbrmstr@mastodon.socialH hrbrmstr@mastodon.social

            @Viss all the foundation model runners and lazy AI researchers declared bankruptcy when it comes to prompt injection ("it's an unfixable problem") so they dgaf anymore.

            I'm eagerly awaiting adding malicious content into RSS feeds that are `/feed` imported into Slack so that Slack's AI get's pwnd six ways from Sunday.

            viss@mastodon.socialV This user is from outside of this forum
            viss@mastodon.socialV This user is from outside of this forum
            viss@mastodon.social
            wrote last edited by
            #11

            @hrbrmstr yep. when i signed up for claude code, i took a run at their new bug bounty, and found a way to inject arbitrary text into their slack channel using prompt injection. they closed it as 'informational'.

            wtf.
            i can send whatever i want directly at your staff in a secure way and thats 'informational'?

            lfzz@mastodon.socialL 1 Reply Last reply
            0
            • viss@mastodon.socialV viss@mastodon.social

              Link Preview Image
              security-review.tx - Pastebin.com

              Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

              favicon

              Pastebin (pastebin.com)

              so have a look at that - its the claude code tui wrapper system instructions that apply to any 'security review' anybody asks claude to do.

              review that file and tell me if you think claude is still a good tool to aim at code that needs a security review.

              S This user is from outside of this forum
              S This user is from outside of this forum
              sharkfie@infosec.exchange
              wrote last edited by
              #12

              @Viss what a cool and well thought out technology

              viss@mastodon.socialV 1 Reply Last reply
              0
              • S sharkfie@infosec.exchange

                @Viss what a cool and well thought out technology

                viss@mastodon.socialV This user is from outside of this forum
                viss@mastodon.socialV This user is from outside of this forum
                viss@mastodon.social
                wrote last edited by
                #13

                @sharkfie

                1 Reply Last reply
                0
                • viss@mastodon.socialV viss@mastodon.social

                  @hrbrmstr yep. when i signed up for claude code, i took a run at their new bug bounty, and found a way to inject arbitrary text into their slack channel using prompt injection. they closed it as 'informational'.

                  wtf.
                  i can send whatever i want directly at your staff in a secure way and thats 'informational'?

                  lfzz@mastodon.socialL This user is from outside of this forum
                  lfzz@mastodon.socialL This user is from outside of this forum
                  lfzz@mastodon.social
                  wrote last edited by
                  #14

                  @Viss @hrbrmstr friends don't let friends do bug bounty. If it is a corpo : immediate disclosure is responsible disclosure. Or less professionally 'fuckem' it takes me longer to get in touch with someone from ur team then it took to find the vulnerabilities.

                  hrbrmstr@mastodon.socialH 1 Reply Last reply
                  0
                  • lfzz@mastodon.socialL lfzz@mastodon.social

                    @Viss @hrbrmstr friends don't let friends do bug bounty. If it is a corpo : immediate disclosure is responsible disclosure. Or less professionally 'fuckem' it takes me longer to get in touch with someone from ur team then it took to find the vulnerabilities.

                    hrbrmstr@mastodon.socialH This user is from outside of this forum
                    hrbrmstr@mastodon.socialH This user is from outside of this forum
                    hrbrmstr@mastodon.social
                    wrote last edited by
                    #15

                    @lfzz @Viss I've been a bug bounty detractor forever and even worse now.

                    viss@mastodon.socialV 1 Reply Last reply
                    0
                    • hrbrmstr@mastodon.socialH hrbrmstr@mastodon.social

                      @lfzz @Viss I've been a bug bounty detractor forever and even worse now.

                      viss@mastodon.socialV This user is from outside of this forum
                      viss@mastodon.socialV This user is from outside of this forum
                      viss@mastodon.social
                      wrote last edited by
                      #16

                      @hrbrmstr @lfzz did you see my thread from last week?

                      lfzz@mastodon.socialL 1 Reply Last reply
                      0
                      • viss@mastodon.socialV viss@mastodon.social

                        @hrbrmstr @lfzz did you see my thread from last week?

                        lfzz@mastodon.socialL This user is from outside of this forum
                        lfzz@mastodon.socialL This user is from outside of this forum
                        lfzz@mastodon.social
                        wrote last edited by
                        #17

                        @Viss @hrbrmstr no, at least I cant remember, last week was kinda of a blur

                        viss@mastodon.socialV 1 Reply Last reply
                        0
                        • lfzz@mastodon.socialL lfzz@mastodon.social

                          @Viss @hrbrmstr no, at least I cant remember, last week was kinda of a blur

                          viss@mastodon.socialV This user is from outside of this forum
                          viss@mastodon.socialV This user is from outside of this forum
                          viss@mastodon.social
                          wrote last edited by
                          #18

                          @lfzz @hrbrmstr

                          Viss (@Viss@mastodon.social)

                          i am subscribing to misery, i think. anthropic posted a new bug bounty today, on hackerone, and i had to buy claude code for work, and i applied to their 'cyber program' (and got access in ten minutes?! wow - i submitted to openais cyber cyber thing a week and some change ago and havent heard anything back. radio silence) so i figured, aim mythos or whatever right back at anthropic, and i think i found a bug. an interesting one too. i submit it and am FULLY expecting to be pissed later.

                          favicon

                          Mastodon (mastodon.social)

                          1 Reply Last reply
                          0
                          • viss@mastodon.socialV viss@mastodon.social

                            Link Preview Image
                            security-review.tx - Pastebin.com

                            Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

                            favicon

                            Pastebin (pastebin.com)

                            so have a look at that - its the claude code tui wrapper system instructions that apply to any 'security review' anybody asks claude to do.

                            review that file and tell me if you think claude is still a good tool to aim at code that needs a security review.

                            viss@mastodon.socialV This user is from outside of this forum
                            viss@mastodon.socialV This user is from outside of this forum
                            viss@mastodon.social
                            wrote last edited by
                            #19

                            Link Preview Image
                            Anthropic’s bug-hunting Mythos was greatest marketing stunt ever, says cURL creator

                            After all that hype, AI scanner found one low-severity cURL flaw

                            favicon

                            theregister (www.theregister.com)

                            1 Reply Last reply
                            1
                            0
                            Reply
                            • Reply as topic
                            Log in to reply
                            • Oldest to Newest
                            • Newest to Oldest
                            • Most Votes


                            • Login

                            • Login or register to search.
                            • First post
                              Last post
                            0
                            • Categories
                            • Recent
                            • Tags
                            • Popular
                            • World
                            • Users
                            • Groups