$30 billion and the water supply of a small city later
-
@atax1a @petrillic to be fair to sam, this one is on Anthropic
-
$30 billion and the water supply of a small city later
@mistakenotmy I guess it's an improvement?

-
$30 billion and the water supply of a small city later
@mistakenotmy
I feel like our fore fathers made a mistake, we should turn around and crawl back into the ocean. -
@mistakenotmy please don't take me for an ai apologist, but there are many multi billion tools that can't count letters.
The problem only arises when idiots claim a tool can do anything one can dream of. Blame the idiots, not the tools.
Try writing a book (or colonize Mars for that matter) with a space shuttle, and you are on the same mental plane as someone using an LLM for such a task.
NO ONE is running -multiple billion dollar hypes- and marketing schemes trying to get people to write books with the space shuttle.
Plenty of tech bros/corps -are- shoving these worse than useless LLMs into basic reading, writing, searching tools meant for everyday use in information environments for millions of people.
To pretend these products aren't being heavily marketed *by grifting tech bros* to millions of people for general info use, is really disingenuous.
-
$30 billion and the water supply of a small city later
-
@atax1a @petrillic fair enough
-
NO ONE is running -multiple billion dollar hypes- and marketing schemes trying to get people to write books with the space shuttle.
Plenty of tech bros/corps -are- shoving these worse than useless LLMs into basic reading, writing, searching tools meant for everyday use in information environments for millions of people.
To pretend these products aren't being heavily marketed *by grifting tech bros* to millions of people for general info use, is really disingenuous.
@CJPaloma true, but who's pretending that though?
My intention was to use an ad ridiculum to argue these tech bros are idiots.
-
@CJPaloma true, but who's pretending that though?
My intention was to use an ad ridiculum to argue these tech bros are idiots.
@iwein @mistakenotmy I"m sorry, that wasn't clear -to me- from your reply. I guess I just sort of get suspicious anytime a person says "I'm not an apologist, but..."
We are agreed then: the tech bros are idiots! Onward!
-
$30 billion and the water supply of a small city later
@mistakenotmy AI: artificial ignorance?
-
$30 billion and the water supply of a small city later
@mistakenotmy I got curious and tried adding a system prompt to Claude basically saying "always use tools, don't trust yourself, always verify if possible", and then it got it right. They tell the models to act like this because it drives more engagement when they are confident and answer quickly on things that are deemed "trivial". That makes it somehow worse.
-
$30 billion and the water supply of a small city later
@mistakenotmy locally run gemma4:26b gets it right, but it also gets suspicious about the intent behind my questioning…

-
$30 billion and the water supply of a small city later
@mistakenotmy pick your own answer

-
$30 billion and the water supply of a small city later
@mistakenotmy We have our LLM tools hooked up to our calendars. My favorite is to ask it to do some analysis like "count how many free two hour blocks during business hours I had last week" and getting the right answer, then asking it to do it for a month and having it tell me that I had no events for entire weeks.
Literally worse than "just glancing at a calendar".
-
@iwein @mistakenotmy I"m sorry, that wasn't clear -to me- from your reply. I guess I just sort of get suspicious anytime a person says "I'm not an apologist, but..."
We are agreed then: the tech bros are idiots! Onward!
@CJPaloma @iwein @mistakenotmy
It's a poorly formulated reply that makes you seem like an AI apologist to me. Especially this part:> The problem only arises when idiots claim a tool can do anything one can dream of. Blame the idiots, not the tools.
Tools & technology have politics. They aren't neutral. AI in particular, regardless of its capability or who wields it, has the politics of surveillance & oppression, environmental pollution & exploitation, labor disruption, and more.
I doubt this is the first time you've seen it characterized as such on the fediverse. So it also reads to me like you're turning a blind eye to those criticisms, many times expressed more eloquently than me.
If you are opposed to AI (genAI and LLMs), consider that you aren't coming off that way currently.
-
$30 billion and the water supply of a small city later
@mistakenotmy lol, fake news

-
@pwinn @mistakenotmy older model with different random word

-
@pwinn @mistakenotmy older model with different random word

@pwinn @mistakenotmy All models available under free subscription are able to produce correct answer for different random words


(haven’t tested with Opus 4.7 since I don’t have pro plan account under my hand) -
@pwinn @mistakenotmy All models available under free subscription are able to produce correct answer for different random words


(haven’t tested with Opus 4.7 since I don’t have pro plan account under my hand)@pwinn @mistakenotmy … and completely random (presumably not hardcoded) question, just in case

-
@pwinn @mistakenotmy … and completely random (presumably not hardcoded) question, just in case

@pwinn @mistakenotmy ... and to put some serious nails into the coffin of "LLMs are dumb and can't solve puzzles" take -- here's Hack The Box CTF profile of my Sonnet 4.5/4.6 based AI bot: it can solve insane difficulty tasks and performs on the same level with top 0.5% of human players. Most of these tasks are recent ones so it doesn't have any writeups or solutions in its training data. So yeah: trust no one and conduct your own experiments



-
$30 billion and the water supply of a small city later
@mistakenotmy Sorry, I call cap.

