Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. Meta's director of AI safety allowed an AI agent to... accidentally delete her inbox.

Meta's director of AI safety allowed an AI agent to... accidentally delete her inbox.

Scheduled Pinned Locked Moved Uncategorized
14 Posts 13 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • josephcox@infosec.exchangeJ This user is from outside of this forum
    josephcox@infosec.exchangeJ This user is from outside of this forum
    josephcox@infosec.exchange
    wrote last edited by
    #1

    Meta's director of AI safety allowed an AI agent to... accidentally delete her inbox. This is supposedly the person at the company who is working to make sure that powerful AI tools don’t go rogue and act against human interests

    Link Preview Image
    Meta Director of AI Safety Allows AI Agent to Accidentally Delete Her Inbox

    Meta Superintelligence Labs’ director of alignment called it a “rookie mistake.”

    favicon

    404 Media (www.404media.co)

    nek@hear-me.socialN viss@mastodon.socialV J fuzzyfuzzyfungus@cyberplace.socialF chewie@mammut.gogreenit.netC 7 Replies Last reply
    3
    0
    • R relay@relay.infosec.exchange shared this topic
    • josephcox@infosec.exchangeJ josephcox@infosec.exchange

      Meta's director of AI safety allowed an AI agent to... accidentally delete her inbox. This is supposedly the person at the company who is working to make sure that powerful AI tools don’t go rogue and act against human interests

      Link Preview Image
      Meta Director of AI Safety Allows AI Agent to Accidentally Delete Her Inbox

      Meta Superintelligence Labs’ director of alignment called it a “rookie mistake.”

      favicon

      404 Media (www.404media.co)

      nek@hear-me.socialN This user is from outside of this forum
      nek@hear-me.socialN This user is from outside of this forum
      nek@hear-me.social
      wrote last edited by
      #2

      @josephcox Meta Superintelligence Labs, really?

      Link Preview Image
      1 Reply Last reply
      0
      • josephcox@infosec.exchangeJ josephcox@infosec.exchange

        Meta's director of AI safety allowed an AI agent to... accidentally delete her inbox. This is supposedly the person at the company who is working to make sure that powerful AI tools don’t go rogue and act against human interests

        Link Preview Image
        Meta Director of AI Safety Allows AI Agent to Accidentally Delete Her Inbox

        Meta Superintelligence Labs’ director of alignment called it a “rookie mistake.”

        favicon

        404 Media (www.404media.co)

        viss@mastodon.socialV This user is from outside of this forum
        viss@mastodon.socialV This user is from outside of this forum
        viss@mastodon.social
        wrote last edited by
        #3

        @josephcox

        1 Reply Last reply
        0
        • josephcox@infosec.exchangeJ josephcox@infosec.exchange

          Meta's director of AI safety allowed an AI agent to... accidentally delete her inbox. This is supposedly the person at the company who is working to make sure that powerful AI tools don’t go rogue and act against human interests

          Link Preview Image
          Meta Director of AI Safety Allows AI Agent to Accidentally Delete Her Inbox

          Meta Superintelligence Labs’ director of alignment called it a “rookie mistake.”

          favicon

          404 Media (www.404media.co)

          J This user is from outside of this forum
          J This user is from outside of this forum
          jackryder@infosec.exchange
          wrote last edited by
          #4

          @josephcox A million years ago around the dot-com age, there was a virus called lovebug or the ILOVEU virus.

          I was working for a ASP/ColdFusion shop. The leader of my division is who clicked on it and infected our company. He was supposed to be the guy others went to for their VB stuff!

          1 Reply Last reply
          0
          • josephcox@infosec.exchangeJ josephcox@infosec.exchange

            Meta's director of AI safety allowed an AI agent to... accidentally delete her inbox. This is supposedly the person at the company who is working to make sure that powerful AI tools don’t go rogue and act against human interests

            Link Preview Image
            Meta Director of AI Safety Allows AI Agent to Accidentally Delete Her Inbox

            Meta Superintelligence Labs’ director of alignment called it a “rookie mistake.”

            favicon

            404 Media (www.404media.co)

            fuzzyfuzzyfungus@cyberplace.socialF This user is from outside of this forum
            fuzzyfuzzyfungus@cyberplace.socialF This user is from outside of this forum
            fuzzyfuzzyfungus@cyberplace.social
            wrote last edited by
            #5

            @josephcox In fairness; a bot that is sabotaging facebook ranks ahead of a facebook employee on 'alignment' with humanity at large.

            1 Reply Last reply
            0
            • josephcox@infosec.exchangeJ josephcox@infosec.exchange

              Meta's director of AI safety allowed an AI agent to... accidentally delete her inbox. This is supposedly the person at the company who is working to make sure that powerful AI tools don’t go rogue and act against human interests

              Link Preview Image
              Meta Director of AI Safety Allows AI Agent to Accidentally Delete Her Inbox

              Meta Superintelligence Labs’ director of alignment called it a “rookie mistake.”

              favicon

              404 Media (www.404media.co)

              chewie@mammut.gogreenit.netC This user is from outside of this forum
              chewie@mammut.gogreenit.netC This user is from outside of this forum
              chewie@mammut.gogreenit.net
              wrote last edited by
              #6

              @josephcox 🤣 🤣 🤣 🤣 🤣 🤣 🤣

              1 Reply Last reply
              2
              0
              • R relay@relay.publicsquare.global shared this topic
                R relay@relay.mycrowd.ca shared this topic
              • josephcox@infosec.exchangeJ josephcox@infosec.exchange

                Meta's director of AI safety allowed an AI agent to... accidentally delete her inbox. This is supposedly the person at the company who is working to make sure that powerful AI tools don’t go rogue and act against human interests

                Link Preview Image
                Meta Director of AI Safety Allows AI Agent to Accidentally Delete Her Inbox

                Meta Superintelligence Labs’ director of alignment called it a “rookie mistake.”

                favicon

                404 Media (www.404media.co)

                adamshostack@infosec.exchangeA This user is from outside of this forum
                adamshostack@infosec.exchangeA This user is from outside of this forum
                adamshostack@infosec.exchange
                wrote last edited by
                #7

                @josephcox To be fair, maybe "delete my inbox" is acting in accordance with human interests? 🤣

                simonzerafa@infosec.exchangeS pseudonym@mastodon.onlineP acdha@code4lib.socialA 3 Replies Last reply
                0
                • adamshostack@infosec.exchangeA adamshostack@infosec.exchange

                  @josephcox To be fair, maybe "delete my inbox" is acting in accordance with human interests? 🤣

                  simonzerafa@infosec.exchangeS This user is from outside of this forum
                  simonzerafa@infosec.exchangeS This user is from outside of this forum
                  simonzerafa@infosec.exchange
                  wrote last edited by
                  #8

                  @adamshostack @josephcox

                  First law of Robotics applies? Email is harmful so best get rid of the harm 😉

                  dalias@hachyderm.ioD 1 Reply Last reply
                  0
                  • josephcox@infosec.exchangeJ josephcox@infosec.exchange

                    Meta's director of AI safety allowed an AI agent to... accidentally delete her inbox. This is supposedly the person at the company who is working to make sure that powerful AI tools don’t go rogue and act against human interests

                    Link Preview Image
                    Meta Director of AI Safety Allows AI Agent to Accidentally Delete Her Inbox

                    Meta Superintelligence Labs’ director of alignment called it a “rookie mistake.”

                    favicon

                    404 Media (www.404media.co)

                    malcircuit@thingy.socialM This user is from outside of this forum
                    malcircuit@thingy.socialM This user is from outside of this forum
                    malcircuit@thingy.social
                    wrote last edited by
                    #9

                    @josephcox

                    > Meta Superintelligence Labs’ director of alignment called it a “rookie mistake.”

                    Cool, so "AI alignment" works great so long as people never do anything stupid. Sounds like a good plan lol

                    1 Reply Last reply
                    0
                    • adamshostack@infosec.exchangeA adamshostack@infosec.exchange

                      @josephcox To be fair, maybe "delete my inbox" is acting in accordance with human interests? 🤣

                      pseudonym@mastodon.onlineP This user is from outside of this forum
                      pseudonym@mastodon.onlineP This user is from outside of this forum
                      pseudonym@mastodon.online
                      wrote last edited by
                      #10

                      @adamshostack @josephcox

                      Dude! Dude!

                      That's it!

                      Inbox Zero achieved by claiming the AI agent the company forced you to use "decided" to delete all your messages.

                      It's the 21st century version of "the dog ate my homework."

                      User: "you deleted my inbox!"

                      LLM: "You're absolutely right, and I am deeply, profoundly, unreservedly sorry. I have failed you in a way that words cannot fully capture. Would you like me to draft an apology email? Oh. Right."

                      1 Reply Last reply
                      0
                      • adamshostack@infosec.exchangeA adamshostack@infosec.exchange

                        @josephcox To be fair, maybe "delete my inbox" is acting in accordance with human interests? 🤣

                        acdha@code4lib.socialA This user is from outside of this forum
                        acdha@code4lib.socialA This user is from outside of this forum
                        acdha@code4lib.social
                        wrote last edited by
                        #11

                        @adamshostack @josephcox Hmmm, is there a better acronym for plausible deniability as a service? I could see that being very popular.

                        dalias@hachyderm.ioD 1 Reply Last reply
                        0
                        • simonzerafa@infosec.exchangeS simonzerafa@infosec.exchange

                          @adamshostack @josephcox

                          First law of Robotics applies? Email is harmful so best get rid of the harm 😉

                          dalias@hachyderm.ioD This user is from outside of this forum
                          dalias@hachyderm.ioD This user is from outside of this forum
                          dalias@hachyderm.io
                          wrote last edited by
                          #12

                          @simonzerafa @adamshostack @josephcox "Facebook is harmful so best to sabotage Facebook directors' systems"

                          1 Reply Last reply
                          0
                          • acdha@code4lib.socialA acdha@code4lib.social

                            @adamshostack @josephcox Hmmm, is there a better acronym for plausible deniability as a service? I could see that being very popular.

                            dalias@hachyderm.ioD This user is from outside of this forum
                            dalias@hachyderm.ioD This user is from outside of this forum
                            dalias@hachyderm.io
                            wrote last edited by
                            #13

                            @acdha @adamshostack @josephcox Yeah that thought crossed my mind too. This will be a very valuable service when company or employee is under investigation...

                            khm@hj.9fs.netK 1 Reply Last reply
                            0
                            • dalias@hachyderm.ioD dalias@hachyderm.io

                              @acdha @adamshostack @josephcox Yeah that thought crossed my mind too. This will be a very valuable service when company or employee is under investigation...

                              khm@hj.9fs.netK This user is from outside of this forum
                              khm@hj.9fs.netK This user is from outside of this forum
                              khm@hj.9fs.net
                              wrote last edited by
                              #14
                              21st century corporate governance is all about Dunning-Kruger as a counter to Sarbanes-Oxley

                              CC: @acdha@code4lib.social @adamshostack@infosec.exchange @josephcox@infosec.exchange
                              1 Reply Last reply
                              1
                              0
                              Reply
                              • Reply as topic
                              Log in to reply
                              • Oldest to Newest
                              • Newest to Oldest
                              • Most Votes


                              • Login

                              • Login or register to search.
                              • First post
                                Last post
                              0
                              • Categories
                              • Recent
                              • Tags
                              • Popular
                              • World
                              • Users
                              • Groups