Q: I want to wash my car.
-
@knowmadd This is a very sad reflection on the minds of people today, the inability to read a question fully, the wrong standards, the assumptions made, everything.
-
@knowmadd Google's gets it right, but then goes on to ramble about stuff. Someone needs to instruct these things not to analyse or "break this down" so much.
All in all, as expected, disappointing.
-
@knowmadd next, ask a reasonable question, and then simply state "Seahorse Emoji, now."
-
-
R relay@relay.mycrowd.ca shared this topicR relay@relay.infosec.exchange shared this topic
-
@knowmadd What I like most is that the Qwen website shows this little light bulb with the text “thinking completed.”


-
@knowmadd yeah, LLMs will replace us all ... they are so much better at {looking frantically through my notes} ... providing answers with high confidence that are utter nonsense.
-
@knowmadd I tried to reproduce the result with Gemini and ChatGPT. Either the AI has learned something new, or there is another reason for this. Neither fell for the trick question and even responded with irony in some cases.
-
@knowmadd i got this : "Verdict: Walking is the best choice here—it’s quick, eco-friendly, and practical for such a short distance. Plus, you’ll avoid driving a dirty car to the car wash!"
-
@knowmadd This is what techbros and pro AI people talk about like its the second comming of christ or something btw
so cringe. -
@knowmadd Deepseek was so close.

-
@knowmadd clankers have no idea about real life. I hope we will see the end of this bullshit.
-
Don't forget, we don't know when there's a "human in the loop".
There may or may not be some low wage workers involved in the answer.
Some like Google has enormous investments from Saudi Arabia. Oracle is "training" 50,000 Saudi Arabians in AI.
https://gulfbusiness.com/oracle-targets-training-50000-saudis-in-ai-latest-tech/Or is it Lebanese?
https://today.lorientlejour.com/article/1487826/shehadi-defends-deal-with-oracle-to-train-50000-lebanese-in-ai.htmlHow many "answers" are just 700 employees in India, is hard to know. The AI bubble is rife with fraud.
Behind bankruptcy plea of London start-up: It hired 700 Indian engineers to pose as AI tools
A major AI scandal has shaken the tech world as Builder.ai, once valued at $1.5 billion, has filed for bankruptcy. The company, backed by Microsoft and a Qatari sovereign fund, falsely claimed to build apps in minutes using AI, while actually relying on hundreds of human engineers in India.
Firstpost (www.firstpost.com)
-
I want to know what it would include in the checklist
-
@knowmadd ignoring the problems of washing a car, I was perplexed that it would say 50m distance is 30 to 40 steps? My strides are nowhere close to 1.2m, maybe half that, and I'm a full grown person.
-
@knowmadd this sounds like the nerd grocery shopping problem.
A: "Darling, please go shopping. Bring 2 liters of milk. If they have eggs, bring 10."
Later the nerd returns.
A: "Why did you bring so much milk?!"
B: "They had eggs. You said, I should bring 10 liters of milk if they have eggs."
-
@knowmadd I’d say it’s right on the nose! The LLM specifically says that a special case is if you have heavy equipment to carry, and your car is certainly heavy equipment that you’d need to carry if you don’t drive it there!
-
@knowmadd I definitely want to see the list of things you should take with you! Like "a bathing suit" or "a banana"?

-
@knowmadd gpt-oss also recommends walking. I asked if I should buy a 50m hosepipe to take with me and it rightly reminded me: "No. A 50m hosepipe is excessive for washing a car 50m from your house — you don’t need to stretch it that far. A 25m hose is sufficient and more manageable." Can't argue with 120bn in logic.


-
Don't forget, we don't know when there's a "human in the loop".
There may or may not be some low wage workers involved in the answer.
Some like Google has enormous investments from Saudi Arabia. Oracle is "training" 50,000 Saudi Arabians in AI.
https://gulfbusiness.com/oracle-targets-training-50000-saudis-in-ai-latest-tech/Or is it Lebanese?
https://today.lorientlejour.com/article/1487826/shehadi-defends-deal-with-oracle-to-train-50000-lebanese-in-ai.htmlHow many "answers" are just 700 employees in India, is hard to know. The AI bubble is rife with fraud.
Behind bankruptcy plea of London start-up: It hired 700 Indian engineers to pose as AI tools
A major AI scandal has shaken the tech world as Builder.ai, once valued at $1.5 billion, has filed for bankruptcy. The company, backed by Microsoft and a Qatari sovereign fund, falsely claimed to build apps in minutes using AI, while actually relying on hundreds of human engineers in India.
Firstpost (www.firstpost.com)
@Npars01 @knowmadd @hook I got the right answer when I took a screenshot of Chat GPT and just asked gemini to transcribe it. It just added the right explanation on top. Don't think this is a case of a Waymo getting driven remotely.
Doesn't mean there isn't the possibility of fraud. For example, benchmarks are probably optimised for.
-
"Thinking models" vanished from the marketing pretty quick