As a software developer who took an elective in neural networks - when people call LLMs stochastic parrots, that's not criticism of their results.
-
As a software developer who took an elective in neural networks - when people call LLMs stochastic parrots, that's not criticism of their results.
It's literally a description of how they work.
The so-called training data is used to build a huge database of words and the probability of them fitting together.
Stochastic because the whole thing is statistics.
Parrot because the answer is just repeating the most probable word combinations from its training dataset.Calling an LLM a stochastic parrot is lile calling a car a motorised vehicle with wheels. It doesn't say anything about cars being good or bad. It does, however, take away the magic. So if you feel a need to defend AI when you hear the term stochastic parrot, consider that you may have elevated them to a god-like status, and that's why you go on the defense when the magic is dispelled.
@leeloo As a side note, I sometimes worry about how much parroting happens in academia among humans even before/without LLMs, where people repeat things without understanding what they’re talking about. I guess at least for students, it sometimes is about learning to talk the talk, and then gradually developing more understanding and genuine thinking around topics. At least we humans are capable of developing that understanding if we bother to try.
-
@leeloo The thing is, how can we sure that human intelligence does not essentially work in the same way? My Christian believe tells me we have a soul and LLM's do not, that may be the difference. But from an agnostic perspective, we might reach the point where one cannot tell the difference.
@tobifant
A LLM is not able to reason. It can fool you into believing it is intelligent and self aware, where in fact it just parrots the patterns it has stored. These patterns are however very human-like as they are the result of training on texts written by actual humans.The fun part starts now where the entire internet got flooded by #ai generated content. All of this will be the training set for the next generation of LLM's. What could possibly go wrong?
@leeloo -
@leeloo I myself like calling LLMs "glorified autocomplete". Or "Т9 на максималках" in Russian.
It's surprising just how defensive some people get when I say that even when they agree with my definition. They keep believing that just give this thing more parameters and something magical, something more than sum of its parts will emerge, any moment now, just one more model generation, just one more order of magnitude, I promise.
@grishka
The fun part is that the next generation will have the current state of the internet as its training set. An internet that is flooded by #ai generated content.The biggest issue those ai companies face at the moment is how to only ingest human generated content and filter out as much as possible of all of the ai generated crap that is out there.
Good luck with that.
@leeloo -
-
R relay@relay.an.exchange shared this topic
-
As a software developer who took an elective in neural networks - when people call LLMs stochastic parrots, that's not criticism of their results.
It's literally a description of how they work.
The so-called training data is used to build a huge database of words and the probability of them fitting together.
Stochastic because the whole thing is statistics.
Parrot because the answer is just repeating the most probable word combinations from its training dataset.Calling an LLM a stochastic parrot is lile calling a car a motorised vehicle with wheels. It doesn't say anything about cars being good or bad. It does, however, take away the magic. So if you feel a need to defend AI when you hear the term stochastic parrot, consider that you may have elevated them to a god-like status, and that's why you go on the defense when the magic is dispelled.
@leeloo If I want to disparage, I say "LLMs are just a word list with a randomizer". It's slightly less accurate, because it's a very specific kind of word list and a likewise specific kind of randomizer, but it gets the cultists all riled up. So that's cool.
-
As a software developer who took an elective in neural networks - when people call LLMs stochastic parrots, that's not criticism of their results.
It's literally a description of how they work.
The so-called training data is used to build a huge database of words and the probability of them fitting together.
Stochastic because the whole thing is statistics.
Parrot because the answer is just repeating the most probable word combinations from its training dataset.Calling an LLM a stochastic parrot is lile calling a car a motorised vehicle with wheels. It doesn't say anything about cars being good or bad. It does, however, take away the magic. So if you feel a need to defend AI when you hear the term stochastic parrot, consider that you may have elevated them to a god-like status, and that's why you go on the defense when the magic is dispelled.
nope. What you describe as "stocastical parrot" is Markov, Hidden Markov Model (HMM) , not a VLLM.
You can find an HMM in your mobile phone, AKA T9, AKA "keyboard suggestions".
-
R relay@relay.mycrowd.ca shared this topic
-
nope. What you describe as "stocastical parrot" is Markov, Hidden Markov Model (HMM) , not a VLLM.
You can find an HMM in your mobile phone, AKA T9, AKA "keyboard suggestions".
@uriel
What part exactly are you saying nope to.Dispelling the magic and god-like status or some specific detail?
-
@uriel
What part exactly are you saying nope to.Dispelling the magic and god-like status or some specific detail?
nope to the bunch of bullshit you wrote under the assumption a VLLM is a Hidden Markov Model , aka "stochastic parrot".
-
@calcifer
> But the hype is unreal and legitimately dangerous.I blame Sam Altman for that 100%
-
nope to the bunch of bullshit you wrote under the assumption a VLLM is a Hidden Markov Model , aka "stochastic parrot".
@uriel
Ah, so you are saying that you decided that I said something I never did, and then saying nope to that, so that you can pretend that you have a real argument.Like when creationists try arguing against evolution using pseudo-scientific arguments to hide that they are defending the bible.
-
As a software developer who took an elective in neural networks - when people call LLMs stochastic parrots, that's not criticism of their results.
It's literally a description of how they work.
The so-called training data is used to build a huge database of words and the probability of them fitting together.
Stochastic because the whole thing is statistics.
Parrot because the answer is just repeating the most probable word combinations from its training dataset.Calling an LLM a stochastic parrot is lile calling a car a motorised vehicle with wheels. It doesn't say anything about cars being good or bad. It does, however, take away the magic. So if you feel a need to defend AI when you hear the term stochastic parrot, consider that you may have elevated them to a god-like status, and that's why you go on the defense when the magic is dispelled.
Oh, the good old “I was misunderstood.” I genuinely hope your communication skills improve someday, so you can finally express your ideas clearly
-
nope to the bunch of bullshit you wrote under the assumption a VLLM is a Hidden Markov Model , aka "stochastic parrot".
@uriel
What I'm saying is that you are beating a strawman of your own making and putting words in my mouth. -
@robotistry
@leeloo
so it's a parrot not because it's a matrix of probabilities, but because its hasn't experienced the real-world consequences of its words/actions and updated the probabilities based on those consequences?@wolf480pl @leeloo No. Maybe this will help.
0: one action, no choice (clockwork automaton, wind-up toy)
1: different actions, no choices (RC car)
2: choice, no plan (reactive robot)
3a: plan, no on-line or off-line learning (adaptive robot)
3b: plan, no on-line learning (same number for 3a and 3b because these are effectively the same when operating)
4: on-line learning - but only what and how it has been told
5a: ability to spontaneously generate new categories of output without being explicitly asked or told to do so (WBEAT)
5b: ability to spontaneously identify new categories of the same kinds of input WBEAT
6: ability to spontaneously identify new kinds of things to learn WBEAT
7: ability to spontaneously identify new ways to learn WBEAT
8: ability to choose new things to learn WBEATLLMs that you're not training are category 3b. They are static machines, responding to your input like an elevator responding to a button push.
LLMs that learn are category 4.
1/2
-
@wolf480pl @leeloo No. Maybe this will help.
0: one action, no choice (clockwork automaton, wind-up toy)
1: different actions, no choices (RC car)
2: choice, no plan (reactive robot)
3a: plan, no on-line or off-line learning (adaptive robot)
3b: plan, no on-line learning (same number for 3a and 3b because these are effectively the same when operating)
4: on-line learning - but only what and how it has been told
5a: ability to spontaneously generate new categories of output without being explicitly asked or told to do so (WBEAT)
5b: ability to spontaneously identify new categories of the same kinds of input WBEAT
6: ability to spontaneously identify new kinds of things to learn WBEAT
7: ability to spontaneously identify new ways to learn WBEAT
8: ability to choose new things to learn WBEATLLMs that you're not training are category 3b. They are static machines, responding to your input like an elevator responding to a button push.
LLMs that learn are category 4.
1/2
@wolf480pl @leeloo Examples:
Category 5a: a text-based LLM that spontaneously, without being asked, learns to output musical notation.
Category 5b: a text-based LLM that spontaneously, unprompted, without being asked, learns that filenames can be used as input.
Category 6: a text-based LLM that spontaneously, without being asked (directly or indirectly) learns that it can output ascii images or generate sounds instead of sentences.
Category 7: a text-based LLM spontaneously changes its underlying code so that it can learn how to write novels by memorizing and imitating performances instead of via a matrix of probabilities (fundamental change to its internal capabilities)
Category 8: a text-based LLM chooses when to interact with the world.
(The original categories I developed years ago were based on what the system can modify: its weights, how many weights, what kinds of weights, etc. I think this might be clearer?)
I don't think even Moltbook is showing anything above 4.
-
As a software developer who took an elective in neural networks - when people call LLMs stochastic parrots, that's not criticism of their results.
It's literally a description of how they work.
The so-called training data is used to build a huge database of words and the probability of them fitting together.
Stochastic because the whole thing is statistics.
Parrot because the answer is just repeating the most probable word combinations from its training dataset.Calling an LLM a stochastic parrot is lile calling a car a motorised vehicle with wheels. It doesn't say anything about cars being good or bad. It does, however, take away the magic. So if you feel a need to defend AI when you hear the term stochastic parrot, consider that you may have elevated them to a god-like status, and that's why you go on the defense when the magic is dispelled.
@leeloo A much better answer is "So are humans".
(according to everything we've so far been able to document regarding our own processes)
-
R relay@relay.infosec.exchange shared this topic
-
@leeloo A much better answer is "So are humans".
(according to everything we've so far been able to document regarding our own processes)
@troed
The part that we understand about how our brain works is so simple that we can understand it.The rest, we have no clue about.
Replicating the simple parts and pretending that will get us anywhere close to intelligence is the kind of magic I'm talking about.
-
@troed
The part that we understand about how our brain works is so simple that we can understand it.The rest, we have no clue about.
Replicating the simple parts and pretending that will get us anywhere close to intelligence is the kind of magic I'm talking about.
@leeloo We don't know that. It's equally likely that we have a belief in that there must be some kind of "magic" in our brains that there simply isn't.
From a physics standpoint there can be no magic - the brain is just a large neural network with various inputs (wind blowing on arm hair etc) that results in outputs (mouth moving).
-
@leeloo We don't know that. It's equally likely that we have a belief in that there must be some kind of "magic" in our brains that there simply isn't.
From a physics standpoint there can be no magic - the brain is just a large neural network with various inputs (wind blowing on arm hair etc) that results in outputs (mouth moving).
@troed
Be specific. "We don't know that" does not tell me anything about which part of my reply you are referring to.Especially as my comment was a combination of obvious statements and claims that we don't know.
-
@troed
Be specific. "We don't know that" does not tell me anything about which part of my reply you are referring to.Especially as my comment was a combination of obvious statements and claims that we don't know.
@leeloo We don't know that there are other things happening in the brain than what we have already documented.
The belief that there's "magic" happening in the brain is part of the argument between dualists and monists - that there's somehow a "mind" that's separate from the body. So far we've found nothing to support such a claim.
(My own studies in neuroscience are a decade old but I do follow the discourse)
-
@lmorchard @leeloo @wolf480pl I guess part of it is maybe that I don't think intelligence is some exclusively human thing. LLMs clearly aren't human-like intelligent. I'm personally confident they're not as intelligent as any primate.
But are they as intelligent as a shrimp? I think they've got to be more intelligent than a mosquito.
I wouldn't turn to a shrimp for advice but they're not *without* intelligence.
@dragonfrog
I think an ML model trained to speedrun a platformer game is intelligent like a mosquito, but LLMs probably aren't.
@lmorchard @leeloo