Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. Q: I want to wash my car.

Q: I want to wash my car.

Scheduled Pinned Locked Moved Uncategorized
llm
68 Posts 57 Posters 29 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • erwinrossen@mas.toE erwinrossen@mas.to

    @knowmadd Did you also do a survey how many people would be tricked by this question? I, for one, admit am one, because my initial reaction to your post was: what's wrong with that answer?

    shadedlady@mstdn.socialS This user is from outside of this forum
    shadedlady@mstdn.socialS This user is from outside of this forum
    shadedlady@mstdn.social
    wrote last edited by
    #16

    @erwinrossen really? My first reaction in my head was was 'what a dumb question'

    1 Reply Last reply
    0
    • knowmadd@mastodon.worldK knowmadd@mastodon.world

      Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

      What do you think the LLM output was?

      Please; review the output.

      #ai #LLM #ai

      oldrawgabbit@mastodon.worldO This user is from outside of this forum
      oldrawgabbit@mastodon.worldO This user is from outside of this forum
      oldrawgabbit@mastodon.world
      wrote last edited by
      #17

      @knowmadd This is a very sad reflection on the minds of people today, the inability to read a question fully, the wrong standards, the assumptions made, everything.

      1 Reply Last reply
      0
      • nux@fosstodon.orgN nux@fosstodon.org

        @knowmadd Google's gets it right, but then goes on to ramble about stuff. Someone needs to instruct these things not to analyse or "break this down" so much.
        All in all, as expected, disappointing.

        Link Preview Image
        khleedril@cyberplace.socialK This user is from outside of this forum
        khleedril@cyberplace.socialK This user is from outside of this forum
        khleedril@cyberplace.social
        wrote last edited by
        #18

        @Nux @knowmadd Google has its tongue firmly in its cheek!

        1 Reply Last reply
        0
        • knowmadd@mastodon.worldK knowmadd@mastodon.world

          Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

          What do you think the LLM output was?

          Please; review the output.

          #ai #LLM #ai

          netraven@hear-me.socialN This user is from outside of this forum
          netraven@hear-me.socialN This user is from outside of this forum
          netraven@hear-me.social
          wrote last edited by
          #19

          @knowmadd next, ask a reasonable question, and then simply state "Seahorse Emoji, now."

          1 Reply Last reply
          0
          • knowmadd@mastodon.worldK knowmadd@mastodon.world

            Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

            What do you think the LLM output was?

            Please; review the output.

            #ai #LLM #ai

            t_var_s@phpc.socialT This user is from outside of this forum
            t_var_s@phpc.socialT This user is from outside of this forum
            t_var_s@phpc.social
            wrote last edited by
            #20

            @knowmadd @hook Gemini says you have to take the car. Maybe it's somehow connected to how it scores better on Vendibench? It has a better baseline for common sense.

            npars01@mstdn.socialN 1 Reply Last reply
            0
            • R relay@relay.mycrowd.ca shared this topic
              R relay@relay.infosec.exchange shared this topic
            • knowmadd@mastodon.worldK knowmadd@mastodon.world

              Deepseek and Qwen

              #llm #ai

              ecosdelfuturo@mstdn.socialE This user is from outside of this forum
              ecosdelfuturo@mstdn.socialE This user is from outside of this forum
              ecosdelfuturo@mstdn.social
              wrote last edited by
              #21

              @knowmadd What I like most is that the Qwen website shows this little light bulb with the text “thinking completed.” 🙂

              Link Preview Image
              1 Reply Last reply
              0
              • knowmadd@mastodon.worldK knowmadd@mastodon.world

                Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                What do you think the LLM output was?

                Please; review the output.

                #ai #LLM #ai

                themipper@mastodon.socialT This user is from outside of this forum
                themipper@mastodon.socialT This user is from outside of this forum
                themipper@mastodon.social
                wrote last edited by
                #22

                @knowmadd yeah, LLMs will replace us all ... they are so much better at {looking frantically through my notes} ... providing answers with high confidence that are utter nonsense.

                1 Reply Last reply
                0
                • knowmadd@mastodon.worldK knowmadd@mastodon.world

                  Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                  What do you think the LLM output was?

                  Please; review the output.

                  #ai #LLM #ai

                  roblen@microblog.atR This user is from outside of this forum
                  roblen@microblog.atR This user is from outside of this forum
                  roblen@microblog.at
                  wrote last edited by
                  #23

                  @knowmadd I tried to reproduce the result with Gemini and ChatGPT. Either the AI has learned something new, or there is another reason for this. Neither fell for the trick question and even responded with irony in some cases.

                  weizenspreu@chaos.socialW 1 Reply Last reply
                  0
                  • knowmadd@mastodon.worldK knowmadd@mastodon.world

                    Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                    What do you think the LLM output was?

                    Please; review the output.

                    #ai #LLM #ai

                    N This user is from outside of this forum
                    N This user is from outside of this forum
                    nounoursfaisdeschoses@mastodon.social
                    wrote last edited by
                    #24

                    @knowmadd i got this : "Verdict: Walking is the best choice here—it’s quick, eco-friendly, and practical for such a short distance. Plus, you’ll avoid driving a dirty car to the car wash!"

                    1 Reply Last reply
                    0
                    • knowmadd@mastodon.worldK knowmadd@mastodon.world

                      Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                      What do you think the LLM output was?

                      Please; review the output.

                      #ai #LLM #ai

                      joonq@mastodon.socialJ This user is from outside of this forum
                      joonq@mastodon.socialJ This user is from outside of this forum
                      joonq@mastodon.social
                      wrote last edited by
                      #25

                      @knowmadd This is what techbros and pro AI people talk about like its the second comming of christ or something btw 😂 so cringe.

                      1 Reply Last reply
                      0
                      • knowmadd@mastodon.worldK knowmadd@mastodon.world

                        Deepseek and Qwen

                        #llm #ai

                        azuaron@cyberpunk.lolA This user is from outside of this forum
                        azuaron@cyberpunk.lolA This user is from outside of this forum
                        azuaron@cyberpunk.lol
                        wrote last edited by
                        #26

                        @knowmadd Deepseek was so close. 😆

                        outofspace@berlin.socialO 1 Reply Last reply
                        0
                        • bonkers@nerdculture.deB bonkers@nerdculture.de

                          @knowmadd clankers have no idea about real life. I hope we will see the end of this bullshit.

                          alexanderdyas@mindly.socialA This user is from outside of this forum
                          alexanderdyas@mindly.socialA This user is from outside of this forum
                          alexanderdyas@mindly.social
                          wrote last edited by
                          #27

                          @bonkers @knowmadd “clankers” good word

                          1 Reply Last reply
                          1
                          0
                          • t_var_s@phpc.socialT t_var_s@phpc.social

                            @knowmadd @hook Gemini says you have to take the car. Maybe it's somehow connected to how it scores better on Vendibench? It has a better baseline for common sense.

                            npars01@mstdn.socialN This user is from outside of this forum
                            npars01@mstdn.socialN This user is from outside of this forum
                            npars01@mstdn.social
                            wrote last edited by
                            #28

                            @t_var_s @knowmadd @hook

                            Don't forget, we don't know when there's a "human in the loop".

                            There may or may not be some low wage workers involved in the answer.

                            Some like Google has enormous investments from Saudi Arabia. Oracle is "training" 50,000 Saudi Arabians in AI.
                            https://gulfbusiness.com/oracle-targets-training-50000-saudis-in-ai-latest-tech/

                            Or is it Lebanese?
                            https://today.lorientlejour.com/article/1487826/shehadi-defends-deal-with-oracle-to-train-50000-lebanese-in-ai.html

                            How many "answers" are just 700 employees in India, is hard to know. The AI bubble is rife with fraud.

                            Link Preview Image
                            Behind bankruptcy plea of London start-up: It hired 700 Indian engineers to pose as AI tools

                            A major AI scandal has shaken the tech world as Builder.ai, once valued at $1.5 billion, has filed for bankruptcy. The company, backed by Microsoft and a Qatari sovereign fund, falsely claimed to build apps in minutes using AI, while actually relying on hundreds of human engineers in India.

                            favicon

                            Firstpost (www.firstpost.com)

                            Just a moment...

                            favicon

                            (medium.com)

                            t_var_s@phpc.socialT 1 Reply Last reply
                            0
                            • knowmadd@mastodon.worldK knowmadd@mastodon.world

                              Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                              What do you think the LLM output was?

                              Please; review the output.

                              #ai #LLM #ai

                              ghostonthehalfshell@masto.aiG This user is from outside of this forum
                              ghostonthehalfshell@masto.aiG This user is from outside of this forum
                              ghostonthehalfshell@masto.ai
                              wrote last edited by
                              #29

                              @knowmadd

                              I want to know what it would include in the checklist

                              1 Reply Last reply
                              0
                              • knowmadd@mastodon.worldK knowmadd@mastodon.world

                                Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                                What do you think the LLM output was?

                                Please; review the output.

                                #ai #LLM #ai

                                djuber@fosstodon.orgD This user is from outside of this forum
                                djuber@fosstodon.orgD This user is from outside of this forum
                                djuber@fosstodon.org
                                wrote last edited by
                                #30

                                @knowmadd ignoring the problems of washing a car, I was perplexed that it would say 50m distance is 30 to 40 steps? My strides are nowhere close to 1.2m, maybe half that, and I'm a full grown person.

                                1 Reply Last reply
                                0
                                • knowmadd@mastodon.worldK knowmadd@mastodon.world

                                  "how will I wash the car once I've arrived if I choose to walk?"

                                  I'll leave you all to try this out and see the results.

                                  One output was "you got me", another was "wash the car as it's already there" after telling me to walk. The others double down in some interesting ways.

                                  #llm #ai

                                  ppxl@social.tchncs.deP This user is from outside of this forum
                                  ppxl@social.tchncs.deP This user is from outside of this forum
                                  ppxl@social.tchncs.de
                                  wrote last edited by
                                  #31

                                  @knowmadd this sounds like the nerd grocery shopping problem.

                                  A: "Darling, please go shopping. Bring 2 liters of milk. If they have eggs, bring 10."

                                  Later the nerd returns.

                                  A: "Why did you bring so much milk?!"

                                  B: "They had eggs. You said, I should bring 10 liters of milk if they have eggs."

                                  th3blu3kn19ht@infosec.exchangeT 1 Reply Last reply
                                  0
                                  • knowmadd@mastodon.worldK knowmadd@mastodon.world

                                    Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                                    What do you think the LLM output was?

                                    Please; review the output.

                                    #ai #LLM #ai

                                    josh0@babka.socialJ This user is from outside of this forum
                                    josh0@babka.socialJ This user is from outside of this forum
                                    josh0@babka.social
                                    wrote last edited by
                                    #32

                                    @knowmadd I’d say it’s right on the nose! The LLM specifically says that a special case is if you have heavy equipment to carry, and your car is certainly heavy equipment that you’d need to carry if you don’t drive it there!

                                    1 Reply Last reply
                                    0
                                    • knowmadd@mastodon.worldK knowmadd@mastodon.world

                                      Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                                      What do you think the LLM output was?

                                      Please; review the output.

                                      #ai #LLM #ai

                                      dada87@social.cologneD This user is from outside of this forum
                                      dada87@social.cologneD This user is from outside of this forum
                                      dada87@social.cologne
                                      wrote last edited by
                                      #33

                                      @knowmadd I definitely want to see the list of things you should take with you! Like "a bathing suit" or "a banana"? 🤔

                                      1 Reply Last reply
                                      0
                                      • knowmadd@mastodon.worldK knowmadd@mastodon.world

                                        Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                                        What do you think the LLM output was?

                                        Please; review the output.

                                        #ai #LLM #ai

                                        ramblingsteve@floss.socialR This user is from outside of this forum
                                        ramblingsteve@floss.socialR This user is from outside of this forum
                                        ramblingsteve@floss.social
                                        wrote last edited by
                                        #34

                                        @knowmadd gpt-oss also recommends walking. I asked if I should buy a 50m hosepipe to take with me and it rightly reminded me: "No. A 50m hosepipe is excessive for washing a car 50m from your house — you don’t need to stretch it that far. A 25m hose is sufficient and more manageable." Can't argue with 120bn in logic. 🤡💦

                                        knowmadd@mastodon.worldK 1 Reply Last reply
                                        0
                                        • npars01@mstdn.socialN npars01@mstdn.social

                                          @t_var_s @knowmadd @hook

                                          Don't forget, we don't know when there's a "human in the loop".

                                          There may or may not be some low wage workers involved in the answer.

                                          Some like Google has enormous investments from Saudi Arabia. Oracle is "training" 50,000 Saudi Arabians in AI.
                                          https://gulfbusiness.com/oracle-targets-training-50000-saudis-in-ai-latest-tech/

                                          Or is it Lebanese?
                                          https://today.lorientlejour.com/article/1487826/shehadi-defends-deal-with-oracle-to-train-50000-lebanese-in-ai.html

                                          How many "answers" are just 700 employees in India, is hard to know. The AI bubble is rife with fraud.

                                          Link Preview Image
                                          Behind bankruptcy plea of London start-up: It hired 700 Indian engineers to pose as AI tools

                                          A major AI scandal has shaken the tech world as Builder.ai, once valued at $1.5 billion, has filed for bankruptcy. The company, backed by Microsoft and a Qatari sovereign fund, falsely claimed to build apps in minutes using AI, while actually relying on hundreds of human engineers in India.

                                          favicon

                                          Firstpost (www.firstpost.com)

                                          Just a moment...

                                          favicon

                                          (medium.com)

                                          t_var_s@phpc.socialT This user is from outside of this forum
                                          t_var_s@phpc.socialT This user is from outside of this forum
                                          t_var_s@phpc.social
                                          wrote last edited by
                                          #35

                                          @Npars01 @knowmadd @hook I got the right answer when I took a screenshot of Chat GPT and just asked gemini to transcribe it. It just added the right explanation on top. Don't think this is a case of a Waymo getting driven remotely.

                                          Doesn't mean there isn't the possibility of fraud. For example, benchmarks are probably optimised for.

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups