Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. When you use an AI to check on an AI, assuming they operate indendent, their combined success rate is multiplicative.

When you use an AI to check on an AI, assuming they operate indendent, their combined success rate is multiplicative.

Scheduled Pinned Locked Moved Uncategorized
5 Posts 2 Posters 1 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • icing@chaos.socialI This user is from outside of this forum
    icing@chaos.socialI This user is from outside of this forum
    icing@chaos.social
    wrote on last edited by
    #1

    When you use an AI to check on an AI, assuming they operate indendent, their combined success rate is multiplicative.

    E.g. 80% success rate, applied twice gives 64% success rate.

    (They might not be independent, training for Sycophancy might make it worse.)

    marshray@infosec.exchangeM 1 Reply Last reply
    0
    • icing@chaos.socialI icing@chaos.social

      When you use an AI to check on an AI, assuming they operate indendent, their combined success rate is multiplicative.

      E.g. 80% success rate, applied twice gives 64% success rate.

      (They might not be independent, training for Sycophancy might make it worse.)

      marshray@infosec.exchangeM This user is from outside of this forum
      marshray@infosec.exchangeM This user is from outside of this forum
      marshray@infosec.exchange
      wrote on last edited by
      #2

      @icing If they are truly configured to operate as redundant “checks”, wouldn’t it be the *failure* rate (1 - P_success) that multiplies?

      icing@chaos.socialI 1 Reply Last reply
      0
      • marshray@infosec.exchangeM marshray@infosec.exchange

        @icing If they are truly configured to operate as redundant “checks”, wouldn’t it be the *failure* rate (1 - P_success) that multiplies?

        icing@chaos.socialI This user is from outside of this forum
        icing@chaos.socialI This user is from outside of this forum
        icing@chaos.social
        wrote on last edited by
        #3

        @marshray When each check can ruin the outcome, the success rates multiply. When any positive check is overall success, the failure rates multiply, or?

        icing@chaos.socialI 1 Reply Last reply
        0
        • icing@chaos.socialI icing@chaos.social

          @marshray When each check can ruin the outcome, the success rates multiply. When any positive check is overall success, the failure rates multiply, or?

          icing@chaos.socialI This user is from outside of this forum
          icing@chaos.socialI This user is from outside of this forum
          icing@chaos.social
          wrote on last edited by
          #4

          @marshray back to AI.

          When AI 1 write a vulnerability report and you use AI 2 to check those reports, the overall assessment is only good when both do the right thing.

          marshray@infosec.exchangeM 1 Reply Last reply
          0
          • icing@chaos.socialI icing@chaos.social

            @marshray back to AI.

            When AI 1 write a vulnerability report and you use AI 2 to check those reports, the overall assessment is only good when both do the right thing.

            marshray@infosec.exchangeM This user is from outside of this forum
            marshray@infosec.exchangeM This user is from outside of this forum
            marshray@infosec.exchange
            wrote on last edited by
            #5

            @icing I see the critical assumptions as:

            1. AI 1 is operated to not create a bad report in the first place
            2. AI 2 is operated to reject a bad report
            3. The AIs probability of failure at (1) and (2) are uncorrelated

            If these assumptions were somehow validated, then they would constitute a “belt and suspenders” type redundant system.

            But such assumptions are rarely justified in practice.

            1 Reply Last reply
            1
            0
            • R relay@relay.infosec.exchange shared this topic on
            Reply
            • Reply as topic
            Log in to reply
            • Oldest to Newest
            • Newest to Oldest
            • Most Votes


            • Login

            • Login or register to search.
            • First post
              Last post
            0
            • Categories
            • Recent
            • Tags
            • Popular
            • World
            • Users
            • Groups