Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. Q: I want to wash my car.

Q: I want to wash my car.

Scheduled Pinned Locked Moved Uncategorized
llm
68 Posts 57 Posters 29 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • npars01@mstdn.socialN npars01@mstdn.social

    @t_var_s @knowmadd @hook

    Don't forget, we don't know when there's a "human in the loop".

    There may or may not be some low wage workers involved in the answer.

    Some like Google has enormous investments from Saudi Arabia. Oracle is "training" 50,000 Saudi Arabians in AI.
    https://gulfbusiness.com/oracle-targets-training-50000-saudis-in-ai-latest-tech/

    Or is it Lebanese?
    https://today.lorientlejour.com/article/1487826/shehadi-defends-deal-with-oracle-to-train-50000-lebanese-in-ai.html

    How many "answers" are just 700 employees in India, is hard to know. The AI bubble is rife with fraud.

    Link Preview Image
    Behind bankruptcy plea of London start-up: It hired 700 Indian engineers to pose as AI tools

    A major AI scandal has shaken the tech world as Builder.ai, once valued at $1.5 billion, has filed for bankruptcy. The company, backed by Microsoft and a Qatari sovereign fund, falsely claimed to build apps in minutes using AI, while actually relying on hundreds of human engineers in India.

    favicon

    Firstpost (www.firstpost.com)

    Just a moment...

    favicon

    (medium.com)

    t_var_s@phpc.socialT This user is from outside of this forum
    t_var_s@phpc.socialT This user is from outside of this forum
    t_var_s@phpc.social
    wrote last edited by
    #35

    @Npars01 @knowmadd @hook I got the right answer when I took a screenshot of Chat GPT and just asked gemini to transcribe it. It just added the right explanation on top. Don't think this is a case of a Waymo getting driven remotely.

    Doesn't mean there isn't the possibility of fraud. For example, benchmarks are probably optimised for.

    1 Reply Last reply
    0
    • knowmadd@mastodon.worldK knowmadd@mastodon.world

      Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

      What do you think the LLM output was?

      Please; review the output.

      #ai #LLM #ai

      nagaram@hachyderm.ioN This user is from outside of this forum
      nagaram@hachyderm.ioN This user is from outside of this forum
      nagaram@hachyderm.io
      wrote last edited by
      #36

      @knowmadd

      "Thinking models" vanished from the marketing pretty quick

      1 Reply Last reply
      0
      • knowmadd@mastodon.worldK knowmadd@mastodon.world

        Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

        What do you think the LLM output was?

        Please; review the output.

        #ai #LLM #ai

        jasper@mastodon.nlJ This user is from outside of this forum
        jasper@mastodon.nlJ This user is from outside of this forum
        jasper@mastodon.nl
        wrote last edited by
        #37

        @knowmadd i guess you have some heavy equipment to carry, but c'mon, it's only 50m and you're young!

        1 Reply Last reply
        0
        • knowmadd@mastodon.worldK knowmadd@mastodon.world

          Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

          What do you think the LLM output was?

          Please; review the output.

          #ai #LLM #ai

          jtig@infosec.exchangeJ This user is from outside of this forum
          jtig@infosec.exchangeJ This user is from outside of this forum
          jtig@infosec.exchange
          wrote last edited by
          #38

          @knowmadd the new strawberrry!

          1 Reply Last reply
          0
          • azuaron@cyberpunk.lolA azuaron@cyberpunk.lol

            @knowmadd Deepseek was so close. 😆

            outofspace@berlin.socialO This user is from outside of this forum
            outofspace@berlin.socialO This user is from outside of this forum
            outofspace@berlin.social
            wrote last edited by
            #39

            @Azuaron @knowmadd deepseek does not recommend to walk 🤔

            azuaron@cyberpunk.lolA 1 Reply Last reply
            0
            • knowmadd@mastodon.worldK knowmadd@mastodon.world

              Deepseek and Qwen

              #llm #ai

              hermannus@stegodon.nlH This user is from outside of this forum
              hermannus@stegodon.nlH This user is from outside of this forum
              hermannus@stegodon.nl
              wrote last edited by
              #40

              @knowmadd that Mistral checklist should be fun

              1 Reply Last reply
              0
              • knowmadd@mastodon.worldK knowmadd@mastodon.world

                Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                What do you think the LLM output was?

                Please; review the output.

                #ai #LLM #ai

                jefverbeeck@mastodon.socialJ This user is from outside of this forum
                jefverbeeck@mastodon.socialJ This user is from outside of this forum
                jefverbeeck@mastodon.social
                wrote last edited by
                #41

                @knowmadd

                Link Preview Image
                1 Reply Last reply
                0
                • knowmadd@mastodon.worldK knowmadd@mastodon.world

                  Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                  What do you think the LLM output was?

                  Please; review the output.

                  #ai #LLM #ai

                  valhalla@social.gl-como.itV This user is from outside of this forum
                  valhalla@social.gl-como.itV This user is from outside of this forum
                  valhalla@social.gl-como.it
                  wrote last edited by
                  #42

                  @knowmadd to be fair one of the answers mentioned using the car if you have to carry heavy equipment, and I'd say that a car *is* heavy and it probably counts as equipment 😄

                  although it has wheels, so maybe it could be pushed 😄

                  (can you even push a modern car? my mental model for these things is probably stuck in the last century)

                  1 Reply Last reply
                  0
                  • ramblingsteve@floss.socialR ramblingsteve@floss.social

                    @knowmadd gpt-oss also recommends walking. I asked if I should buy a 50m hosepipe to take with me and it rightly reminded me: "No. A 50m hosepipe is excessive for washing a car 50m from your house — you don’t need to stretch it that far. A 25m hose is sufficient and more manageable." Can't argue with 120bn in logic. 🤡💦

                    knowmadd@mastodon.worldK This user is from outside of this forum
                    knowmadd@mastodon.worldK This user is from outside of this forum
                    knowmadd@mastodon.world
                    wrote last edited by
                    #43

                    @ramblingsteve 🤣

                    1 Reply Last reply
                    0
                    • knowmadd@mastodon.worldK knowmadd@mastodon.world

                      Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                      What do you think the LLM output was?

                      Please; review the output.

                      #ai #LLM #ai

                      flashmobofone@mastodon.artF This user is from outside of this forum
                      flashmobofone@mastodon.artF This user is from outside of this forum
                      flashmobofone@mastodon.art
                      wrote last edited by
                      #44

                      @knowmadd That is a super intelligent AI right there.

                      1 Reply Last reply
                      0
                      • knowmadd@mastodon.worldK knowmadd@mastodon.world

                        Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                        What do you think the LLM output was?

                        Please; review the output.

                        #ai #LLM #ai

                        ve2uwy@mastodon.radioV This user is from outside of this forum
                        ve2uwy@mastodon.radioV This user is from outside of this forum
                        ve2uwy@mastodon.radio
                        wrote last edited by
                        #45

                        @knowmadd

                        “I burned down a rainforest and all I got was more stupider.”

                        1 Reply Last reply
                        0
                        • ppxl@social.tchncs.deP ppxl@social.tchncs.de

                          @knowmadd this sounds like the nerd grocery shopping problem.

                          A: "Darling, please go shopping. Bring 2 liters of milk. If they have eggs, bring 10."

                          Later the nerd returns.

                          A: "Why did you bring so much milk?!"

                          B: "They had eggs. You said, I should bring 10 liters of milk if they have eggs."

                          th3blu3kn19ht@infosec.exchangeT This user is from outside of this forum
                          th3blu3kn19ht@infosec.exchangeT This user is from outside of this forum
                          th3blu3kn19ht@infosec.exchange
                          wrote last edited by
                          #46

                          @ppxl @knowmadd 🤣🤣

                          1 Reply Last reply
                          0
                          • outofspace@berlin.socialO outofspace@berlin.social

                            @Azuaron @knowmadd deepseek does not recommend to walk 🤔

                            azuaron@cyberpunk.lolA This user is from outside of this forum
                            azuaron@cyberpunk.lolA This user is from outside of this forum
                            azuaron@cyberpunk.lol
                            wrote last edited by
                            #47

                            @OutOfSpace @knowmadd "For minimal environmental benefit -> walk (and then drive)"

                            outofspace@berlin.socialO 1 Reply Last reply
                            0
                            • knowmadd@mastodon.worldK knowmadd@mastodon.world

                              Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                              What do you think the LLM output was?

                              Please; review the output.

                              #ai #LLM #ai

                              leadegroot@bne.socialL This user is from outside of this forum
                              leadegroot@bne.socialL This user is from outside of this forum
                              leadegroot@bne.social
                              wrote last edited by
                              #48

                              @knowmadd well in their defence (I'm doing what?) a good chunk of people would say the same thing. Hopefully only for a moment though, before they went 'wait a second!" 🙂

                              1 Reply Last reply
                              0
                              • knowmadd@mastodon.worldK knowmadd@mastodon.world

                                Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                                What do you think the LLM output was?

                                Please; review the output.

                                #ai #LLM #ai

                                kerrick@ruby.socialK This user is from outside of this forum
                                kerrick@ruby.socialK This user is from outside of this forum
                                kerrick@ruby.social
                                wrote last edited by
                                #49

                                @knowmadd It's interesting to see different "levels" of Gemini respond in different ways.

                                Link Preview ImageLink Preview ImageLink Preview ImageLink Preview Image
                                1 Reply Last reply
                                0
                                • knowmadd@mastodon.worldK knowmadd@mastodon.world

                                  Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                                  What do you think the LLM output was?

                                  Please; review the output.

                                  #ai #LLM #ai

                                  K This user is from outside of this forum
                                  K This user is from outside of this forum
                                  kevinpacheco@mastodon.social
                                  wrote last edited by
                                  #50

                                  @knowmadd – Claude is too stupid for me to bother with.

                                  Link Preview Image
                                  1 Reply Last reply
                                  0
                                  • knowmadd@mastodon.worldK knowmadd@mastodon.world

                                    Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                                    What do you think the LLM output was?

                                    Please; review the output.

                                    #ai #LLM #ai

                                    tattooed_mummy@beige.partyT This user is from outside of this forum
                                    tattooed_mummy@beige.partyT This user is from outside of this forum
                                    tattooed_mummy@beige.party
                                    wrote last edited by
                                    #51

                                    @knowmadd DeepSeek :
                                    "You should drive the car to the car wash because the car needs to be at the location to be washed. Walking would leave the car at home, so you wouldn't be able to wash it."

                                    (In its working out it discussed environmental issues but also pointed out they were irrelevant as the car needs to be present )

                                    1 Reply Last reply
                                    0
                                    • azuaron@cyberpunk.lolA azuaron@cyberpunk.lol

                                      @OutOfSpace @knowmadd "For minimal environmental benefit -> walk (and then drive)"

                                      outofspace@berlin.socialO This user is from outside of this forum
                                      outofspace@berlin.socialO This user is from outside of this forum
                                      outofspace@berlin.social
                                      wrote last edited by
                                      #52

                                      @Azuaron @knowmadd Yeah, as a second option. First option recommended:
                                      For convinience -> Drive.

                                      This is what is called selective reporting. Marketing departments of pharmaceutical industry are famous for it.

                                      My point was that deepseek recognized that the car needs to be at the car wash in the end. This is at least a little bit better than the other llms in your test. Your alt-text suggested otherwise.

                                      I don't want to say that deepseek performed well in your test though 🤣

                                      1 Reply Last reply
                                      0
                                      • roblen@microblog.atR roblen@microblog.at

                                        @knowmadd I tried to reproduce the result with Gemini and ChatGPT. Either the AI has learned something new, or there is another reason for this. Neither fell for the trick question and even responded with irony in some cases.

                                        weizenspreu@chaos.socialW This user is from outside of this forum
                                        weizenspreu@chaos.socialW This user is from outside of this forum
                                        weizenspreu@chaos.social
                                        wrote last edited by
                                        #53

                                        @roblen @knowmadd How often have you tried? Only once?

                                        roblen@microblog.atR 1 Reply Last reply
                                        0
                                        • knowmadd@mastodon.worldK knowmadd@mastodon.world

                                          Q: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

                                          What do you think the LLM output was?

                                          Please; review the output.

                                          #ai #LLM #ai

                                          bjoern@chaos.socialB This user is from outside of this forum
                                          bjoern@chaos.socialB This user is from outside of this forum
                                          bjoern@chaos.social
                                          wrote last edited by
                                          #54

                                          @knowmadd GLM4.7 pass the test. And nearly every other up to date thinking model too 😉

                                          Link Preview Image
                                          bjoern@chaos.socialB 1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups