Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. $30 billion and the water supply of a small city later

$30 billion and the water supply of a small city later

Scheduled Pinned Locked Moved Uncategorized
27 Posts 16 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • mistakenotmy@mastodon.socialM mistakenotmy@mastodon.social

    $30 billion and the water supply of a small city later

    pontus_k@mastodon.socialP This user is from outside of this forum
    pontus_k@mastodon.socialP This user is from outside of this forum
    pontus_k@mastodon.social
    wrote last edited by
    #14

    @mistakenotmy I got curious and tried adding a system prompt to Claude basically saying "always use tools, don't trust yourself, always verify if possible", and then it got it right. They tell the models to act like this because it drives more engagement when they are confident and answer quickly on things that are deemed "trivial". That makes it somehow worse.

    T 1 Reply Last reply
    0
    • mistakenotmy@mastodon.socialM mistakenotmy@mastodon.social

      $30 billion and the water supply of a small city later

      chrisvest@mastodon.socialC This user is from outside of this forum
      chrisvest@mastodon.socialC This user is from outside of this forum
      chrisvest@mastodon.social
      wrote last edited by
      #15

      @mistakenotmy locally run gemma4:26b gets it right, but it also gets suspicious about the intent behind my questioning…

      Link Preview Image
      1 Reply Last reply
      0
      • mistakenotmy@mastodon.socialM mistakenotmy@mastodon.social

        $30 billion and the water supply of a small city later

        rich@mastodon.gamedev.placeR This user is from outside of this forum
        rich@mastodon.gamedev.placeR This user is from outside of this forum
        rich@mastodon.gamedev.place
        wrote last edited by
        #16

        @mistakenotmy pick your own answer 😕

        1 Reply Last reply
        0
        • mistakenotmy@mastodon.socialM mistakenotmy@mastodon.social

          $30 billion and the water supply of a small city later

          darcmoughty@infosec.exchangeD This user is from outside of this forum
          darcmoughty@infosec.exchangeD This user is from outside of this forum
          darcmoughty@infosec.exchange
          wrote last edited by
          #17

          @mistakenotmy We have our LLM tools hooked up to our calendars. My favorite is to ask it to do some analysis like "count how many free two hour blocks during business hours I had last week" and getting the right answer, then asking it to do it for a month and having it tell me that I had no events for entire weeks.

          Literally worse than "just glancing at a calendar".

          1 Reply Last reply
          0
          • cjpaloma@mstdn.socialC cjpaloma@mstdn.social

            @iwein @mistakenotmy I"m sorry, that wasn't clear -to me- from your reply. I guess I just sort of get suspicious anytime a person says "I'm not an apologist, but..."

            We are agreed then: the tech bros are idiots! Onward!

            toolbear@tech.lgbtT This user is from outside of this forum
            toolbear@tech.lgbtT This user is from outside of this forum
            toolbear@tech.lgbt
            wrote last edited by
            #18

            @CJPaloma @iwein @mistakenotmy
            It's a poorly formulated reply that makes you seem like an AI apologist to me. Especially this part:

            > The problem only arises when idiots claim a tool can do anything one can dream of. Blame the idiots, not the tools.

            Tools & technology have politics. They aren't neutral. AI in particular, regardless of its capability or who wields it, has the politics of surveillance & oppression, environmental pollution & exploitation, labor disruption, and more.

            I doubt this is the first time you've seen it characterized as such on the fediverse. So it also reads to me like you're turning a blind eye to those criticisms, many times expressed more eloquently than me.

            If you are opposed to AI (genAI and LLMs), consider that you aren't coming off that way currently.

            iwein@mas.toI 1 Reply Last reply
            0
            • mistakenotmy@mastodon.socialM mistakenotmy@mastodon.social

              $30 billion and the water supply of a small city later

              d_olex@mastodon.socialD This user is from outside of this forum
              d_olex@mastodon.socialD This user is from outside of this forum
              d_olex@mastodon.social
              wrote last edited by
              #19

              @mistakenotmy lol, fake news

              Link Preview Image
              1 Reply Last reply
              0
              • d_olex@mastodon.socialD This user is from outside of this forum
                d_olex@mastodon.socialD This user is from outside of this forum
                d_olex@mastodon.social
                wrote last edited by
                #20

                @pwinn @mistakenotmy older model with different random word

                Link Preview Image
                d_olex@mastodon.socialD 1 Reply Last reply
                0
                • d_olex@mastodon.socialD d_olex@mastodon.social

                  @pwinn @mistakenotmy older model with different random word

                  Link Preview Image
                  d_olex@mastodon.socialD This user is from outside of this forum
                  d_olex@mastodon.socialD This user is from outside of this forum
                  d_olex@mastodon.social
                  wrote last edited by
                  #21

                  @pwinn @mistakenotmy All models available under free subscription are able to produce correct answer for different random words 🤷🏻
                  (haven’t tested with Opus 4.7 since I don’t have pro plan account under my hand)

                  d_olex@mastodon.socialD 1 Reply Last reply
                  0
                  • d_olex@mastodon.socialD d_olex@mastodon.social

                    @pwinn @mistakenotmy All models available under free subscription are able to produce correct answer for different random words 🤷🏻
                    (haven’t tested with Opus 4.7 since I don’t have pro plan account under my hand)

                    d_olex@mastodon.socialD This user is from outside of this forum
                    d_olex@mastodon.socialD This user is from outside of this forum
                    d_olex@mastodon.social
                    wrote last edited by
                    #22

                    @pwinn @mistakenotmy … and completely random (presumably not hardcoded) question, just in case

                    Link Preview Image
                    d_olex@mastodon.socialD 1 Reply Last reply
                    0
                    • d_olex@mastodon.socialD d_olex@mastodon.social

                      @pwinn @mistakenotmy … and completely random (presumably not hardcoded) question, just in case

                      Link Preview Image
                      d_olex@mastodon.socialD This user is from outside of this forum
                      d_olex@mastodon.socialD This user is from outside of this forum
                      d_olex@mastodon.social
                      wrote last edited by
                      #23

                      @pwinn @mistakenotmy ... and to put some serious nails into the coffin of "LLMs are dumb and can't solve puzzles" take -- here's Hack The Box CTF profile of my Sonnet 4.5/4.6 based AI bot: it can solve insane difficulty tasks and performs on the same level with top 0.5% of human players. Most of these tasks are recent ones so it doesn't have any writeups or solutions in its training data. So yeah: trust no one and conduct your own experiments 🙂

                      Link Preview ImageLink Preview Image
                      1 Reply Last reply
                      0
                      • mistakenotmy@mastodon.socialM mistakenotmy@mastodon.social

                        $30 billion and the water supply of a small city later

                        T This user is from outside of this forum
                        T This user is from outside of this forum
                        twin_ice@ohai.social
                        wrote last edited by
                        #24

                        @mistakenotmy Sorry, I call cap.

                        Link Preview Image
                        1 Reply Last reply
                        0
                        • pontus_k@mastodon.socialP pontus_k@mastodon.social

                          @mistakenotmy I got curious and tried adding a system prompt to Claude basically saying "always use tools, don't trust yourself, always verify if possible", and then it got it right. They tell the models to act like this because it drives more engagement when they are confident and answer quickly on things that are deemed "trivial". That makes it somehow worse.

                          T This user is from outside of this forum
                          T This user is from outside of this forum
                          twin_ice@ohai.social
                          wrote last edited by
                          #25

                          @pontus_k my custom prompt is even simpler. Be terse, don't be afraid of challenging my opinion. I've attached my own results and for both questions it's a one character: "1".

                          1 Reply Last reply
                          0
                          • mistakenotmy@mastodon.socialM mistakenotmy@mastodon.social

                            $30 billion and the water supply of a small city later

                            n_dimension@infosec.exchangeN This user is from outside of this forum
                            n_dimension@infosec.exchangeN This user is from outside of this forum
                            n_dimension@infosec.exchange
                            wrote last edited by
                            #26

                            @mistakenotmy

                            3 years of kindergarden, 8 years of primary school, 3 three years of highschool and three years of university and uncounted media consumed in between and all the resources wasted and the stupid human misspelled "strawberry"

                            What a colosal failure.
                            Euthanise this aborted biological.

                            1 Reply Last reply
                            0
                            • toolbear@tech.lgbtT toolbear@tech.lgbt

                              @CJPaloma @iwein @mistakenotmy
                              It's a poorly formulated reply that makes you seem like an AI apologist to me. Especially this part:

                              > The problem only arises when idiots claim a tool can do anything one can dream of. Blame the idiots, not the tools.

                              Tools & technology have politics. They aren't neutral. AI in particular, regardless of its capability or who wields it, has the politics of surveillance & oppression, environmental pollution & exploitation, labor disruption, and more.

                              I doubt this is the first time you've seen it characterized as such on the fediverse. So it also reads to me like you're turning a blind eye to those criticisms, many times expressed more eloquently than me.

                              If you are opposed to AI (genAI and LLMs), consider that you aren't coming off that way currently.

                              iwein@mas.toI This user is from outside of this forum
                              iwein@mas.toI This user is from outside of this forum
                              iwein@mas.to
                              wrote last edited by
                              #27

                              @toolbear all that is ok 🙂

                              1 Reply Last reply
                              0
                              • R relay@relay.publicsquare.global shared this topic
                              Reply
                              • Reply as topic
                              Log in to reply
                              • Oldest to Newest
                              • Newest to Oldest
                              • Most Votes


                              • Login

                              • Login or register to search.
                              • First post
                                Last post
                              0
                              • Categories
                              • Recent
                              • Tags
                              • Popular
                              • World
                              • Users
                              • Groups