As a software developer who took an elective in neural networks - when people call LLMs stochastic parrots, that's not criticism of their results.

alterelefant@mastodontech.de

@usuario
I also had to look it up, I am however not a native speaker.
@leeloo @knuxbbs

androcat@toot.cat

@leeloo If I want to disparage, I say "LLMs are just a word list with a randomizer". It's slightly less accurate, because it's a very specific kind of word list and a likewise specific kind of randomizer, but it gets the cultists all riled up. So that's cool.

uriel@x.keinpfusch.net

@leeloo

nope. What you describe as "stocastical parrot" is Markov, Hidden Markov Model (HMM) , not a VLLM.

You can find an HMM in your mobile phone, AKA T9, AKA "keyboard suggestions".

leeloo@chaosfem.tw

@uriel
What part exactly are you saying nope to.

Dispelling the magic and god-like status or some specific detail?

uriel@x.keinpfusch.net

@leeloo

nope to the bunch of bullshit you wrote under the assumption a VLLM is a Hidden Markov Model , aka "stochastic parrot".

tal@mastodon.social

@calcifer
> But the hype is unreal and legitimately dangerous.

I blame Sam Altman for that 100%

leeloo@chaosfem.tw

@uriel
Ah, so you are saying that you decided that I said something I never did, and then saying nope to that, so that you can pretend that you have a real argument.

Like when creationists try arguing against evolution using pseudo-scientific arguments to hide that they are defending the bible.

uriel@x.keinpfusch.net

@leeloo

Oh, the good old “I was misunderstood.” I genuinely hope your communication skills improve someday, so you can finally express your ideas clearly

leeloo@chaosfem.tw

@uriel
What I'm saying is that you are beating a strawman of your own making and putting words in my mouth.

robotistry@mstdn.ca

@wolf480pl @leeloo No. Maybe this will help.

0: one action, no choice (clockwork automaton, wind-up toy)
1: different actions, no choices (RC car)
2: choice, no plan (reactive robot)
3a: plan, no on-line or off-line learning (adaptive robot)
3b: plan, no on-line learning (same number for 3a and 3b because these are effectively the same when operating)
4: on-line learning - but only what and how it has been told
5a: ability to spontaneously generate new categories of output without being explicitly asked or told to do so (WBEAT)
5b: ability to spontaneously identify new categories of the same kinds of input WBEAT
6: ability to spontaneously identify new kinds of things to learn WBEAT
7: ability to spontaneously identify new ways to learn WBEAT
8: ability to choose new things to learn WBEAT

LLMs that you're not training are category 3b. They are static machines, responding to your input like an elevator responding to a button push.

LLMs that learn are category 4.

1/2

robotistry@mstdn.ca

@wolf480pl @leeloo Examples:

Category 5a: a text-based LLM that spontaneously, without being asked, learns to output musical notation.

Category 5b: a text-based LLM that spontaneously, unprompted, without being asked, learns that filenames can be used as input.

Category 6: a text-based LLM that spontaneously, without being asked (directly or indirectly) learns that it can output ascii images or generate sounds instead of sentences.

Category 7: a text-based LLM spontaneously changes its underlying code so that it can learn how to write novels by memorizing and imitating performances instead of via a matrix of probabilities (fundamental change to its internal capabilities)

Category 8: a text-based LLM chooses when to interact with the world.

(The original categories I developed years ago were based on what the system can modify: its weights, how many weights, what kinds of weights, etc. I think this might be clearer?)

I don't think even Moltbook is showing anything above 4.

troed@swecyb.com

@leeloo A much better answer is "So are humans".

(according to everything we've so far been able to document regarding our own processes)

leeloo@chaosfem.tw

@troed
The part that we understand about how our brain works is so simple that we can understand it.

The rest, we have no clue about.

Replicating the simple parts and pretending that will get us anywhere close to intelligence is the kind of magic I'm talking about.

troed@swecyb.com

@leeloo We don't know that. It's equally likely that we have a belief in that there must be some kind of "magic" in our brains that there simply isn't.

From a physics standpoint there can be no magic - the brain is just a large neural network with various inputs (wind blowing on arm hair etc) that results in outputs (mouth moving).

leeloo@chaosfem.tw

@troed
Be specific. "We don't know that" does not tell me anything about which part of my reply you are referring to.

Especially as my comment was a combination of obvious statements and claims that we don't know.

troed@swecyb.com

@leeloo We don't know that there are other things happening in the brain than what we have already documented.

The belief that there's "magic" happening in the brain is part of the argument between dualists and monists - that there's somehow a "mind" that's separate from the body. So far we've found nothing to support such a claim.

(My own studies in neuroscience are a decade old but I do follow the discourse)

wolf480pl@mstdn.io

@dragonfrog
I think an ML model trained to speedrun a platformer game is intelligent like a mosquito, but LLMs probably aren't.
@lmorchard @leeloo

leeloo@chaosfem.tw

@troed
If we are nothing but input -> math -> output, then human rights don't matter. Murdering someone is no different from switching a device off.

If that's the world view you want to argue, that's on you.

It also assumes that there are nothing left to discover. Which has been a mistake every time anyone has made the claim in any other area. Are humans really that much simpler than the rest of the universe?

To be clear, I did not say that there is any kind of magic involved in human intelligence. I said that the part of "AI" that people get on the defence over when we reduce it to math and software is magic, because unlike humans - where I must remind you that I said we don't know - we know exacly what those datacenters are doing: Math and software.

troed@swecyb.com

@leeloo I'm not arguing for a worldview - I'm just talking about what's the current state of science on the topic.

leeloo@chaosfem.tw

@troed
Yet you couldn't simply let my claim stand that we don't know what lied beyond the current state.

CIRCLE WITH A DOT

As a software developer who took an elective in neural networks - when people call LLMs stochastic parrots, that's not criticism of their results.