Q: I want to wash my car.
-
@knowmadd It's interesting to see different "levels" of Gemini respond in different ways.




-
@knowmadd – Claude is too stupid for me to bother with.

-
@knowmadd DeepSeek :
"You should drive the car to the car wash because the car needs to be at the location to be washed. Walking would leave the car at home, so you wouldn't be able to wash it."(In its working out it discussed environmental issues but also pointed out they were irrelevant as the car needs to be present )
-
@OutOfSpace @knowmadd "For minimal environmental benefit -> walk (and then drive)"
@Azuaron @knowmadd Yeah, as a second option. First option recommended:
For convinience -> Drive.This is what is called selective reporting. Marketing departments of pharmaceutical industry are famous for it.
My point was that deepseek recognized that the car needs to be at the car wash in the end. This is at least a little bit better than the other llms in your test. Your alt-text suggested otherwise.
I don't want to say that deepseek performed well in your test though

-
@knowmadd I tried to reproduce the result with Gemini and ChatGPT. Either the AI has learned something new, or there is another reason for this. Neither fell for the trick question and even responded with irony in some cases.
-
@knowmadd GLM4.7 pass the test. And nearly every other up to date thinking model too


-
@knowmadd GLM4.7 pass the test. And nearly every other up to date thinking model too


@knowmadd GLM 5 gives a pretty clear answer

-
@knowmadd the e2e-encrypted models at confer.to got it sort of right?:

-
@weizenspreu @knowmadd Yes. Only once.
-
@knowmadd I like Deepseek’s “hey just go for a walk anyway but remember to come back for the car” response

-
@weizenspreu @knowmadd Yes. Only once.
-
-
Please put this also on LinkedIn. That's the audience that needs such things as constant reminder.
-
Z.ai GLM-5:
You should **drive**.
Here is why:
1. **Logic:** You cannot wash your car if you leave it behind. Walking there would mean arriving at the car wash without the object you intend to clean.
2. **Efficiency:** While 50 meters is a very short distance (about half a football field), driving takes the car directly to the cleaning equipment.
**Tip:** Since 50 meters is a very short trip, try to avoid turning the engine off immediately upon arrival if the engine was cold; short trips without warm-up time can be slightly harder on the engine over the long term. However, for the sake of the task, driving is the only option that makes sense. -
@knowmadd That alt text does not convey the same information as the image.
-
@knowmadd
Magistral:24b said:
[...]
However, if the intention is to drive the car through an automatic or self-service car wash facility located 50 meters away, then driving would be the standard approach. But given the phrasing "wash my car" rather than "take my car to the car wash," and considering practicality for such a short distance, walking to retrieve washing supplies is likely more sensible.
[...]I would give it a pass.
-
@weizenspreu @knowmadd Ok. I try it again.
-
R relay@relay.an.exchange shared this topic
-
@knowmadd hey did you know that using llm chatbots to dunk on their output is still positive usage stats for these companies and contributes to their destruction of the environment just the same?
stop doing this shit
-
@knowmadd hey did you know that using llm chatbots to dunk on their output is still positive usage stats for these companies and contributes to their destruction of the environment just the same?
stop doing this shit
@sammy@cherrykitten.gay @knowmadd@mastodon.world did you know that it's running regardless and until there is actual laws or something to limit them, your choice to use or not use them has no material effect.
your enemy is the corporations making these and the governments allowing them. moral grandstanding to people with no power or authority is useless
-
@knowmadd I'm disappointed no LLM suggested pushing the car there. I mean, you get to walk, work out AND the car will be there for the wash!
-
R relay@relay.infosec.exchange shared this topic
