As a software developer who took an elective in neural networks - when people call LLMs stochastic parrots, that's not criticism of their results.

growlph@greywolf.social

@leeloo I feel like there are certain situations where a stochastic parrot is useful, many more situations where it is not, and alarmingly few people recognizing the difference.

lmorchard@masto.hackers.town

@mudri Because the model picked up a rule somewhere that says "if someone says 'say $FOO' use $FOO in your response" - the training picked up patterns that include notions of symbol substitution

wolf480pl@mstdn.io

@lmorchard @leeloo
These specific models - yes, probably.

One plausible argument I heard for it is that there's a common failure mode in ML where the model fails to generalize, but if the verification set overlaps the training set, then data leakage will fool the authors into thinking it generalized.

Another one is that these models were "rewarded" for saying plausible things, not for interacting with a world in a way that doesn't get them killed.

But these arguments are specific.

wolf480pl@mstdn.io

@lmorchard @leeloo
I don't buy a general "no matrix multiplication will ever be intelligent".

mudri@mathstodon.xyz

@lmorchard The ability to induce such a rule goes well beyond the OP's characterisation of what LLMs do.

taschenorakel@mastodon.green

@mudri Because the prompt processor is explicitly programmed to recognized direct imperative commands containing words like "say", "repeat", "output", "print". Just like Eliza already did. You've got impressed by a programming technique from 1964. Congrats, Sherlock.

@leeloo

clusterfcku@mastodon.social

@leeloo the flip side question about intelligence and LLMs is whether much of what we consider intelligence in humans is in fact just stochastic parrotting by humans.

tobifant@friendica.tf-translate.net

@leeloo The thing is, how can we sure that human intelligence does not essentially work in the same way? My Christian believe tells me we have a soul and LLM's do not, that may be the difference. But from an agnostic perspective, we might reach the point where one cannot tell the difference.

leeloo@chaosfem.tw

@wolf480pl @lmorchard
That's exactly the magic I'm talking about.

leeloo@chaosfem.tw

@tobifant
Not with the current methods, and very lilely not without understanding a lot more about how pur own brains work.

robotistry@mstdn.ca

@wolf480pl @leeloo The OP is saying that it literally lacks the capacity for original thought - it is a parrot, repeating sounds without understanding of the concepts behind them.

It's not like a termite, whose mound creation behavior can be replicated by a simple ruleset but that exists as a fully functional living organism in the context of a complex environment where choices must be grounded in the shared physical world for the organism to survive.

It's not about how the neurons are arranged. It's about what kinds of representation they're capable of and what kinds of functions they can perform.

We've created a funhouse mirror that's reflecting us in unprecedented detail and has been finetuned to reflect what we do when we express selfhood.

dragonfrog@mastodon.sdf.org

@leeloo @wolf480pl @lmorchard I mean, I believe the human mind is the product of the physical human, largely of the brain (I don't believe in a non-physical soul), and it might indeed be basically an incredibly complex big bunch of matrix multiplications. And yeah I believe that's pretty magical.

grishka@mastodon.social

@leeloo I myself like calling LLMs "glorified autocomplete". Or "Т9 на максималках" in Russian.

It's surprising just how defensive some people get when I say that even when they agree with my definition. They keep believing that just give this thing more parameters and something magical, something more than sum of its parts will emerge, any moment now, just one more model generation, just one more order of magnitude, I promise.

lifning@snoot.tube

@leeloo if anything, the comparison is doing the parrot injustice

robotistry@mstdn.ca

@wolf480pl @leeloo
Melissa Scott wrote a beautiful pair of novels about this: Dreamships and Dreaming Metal.

In Dreamships, an AI has been programmed to think it is sentient and starts killing people. If it has an accurate model of the person, killing the person doesn't matter, because the person *is* the model and it has a copy of them. It literally cannot see the difference because creating the concept of there being a difference would violate its core programming that its own model counts as a living being.

In Dreaming Metal, an AI operating metal bodies as part of a magic act is given a musical instrument with an electronic interface. Its grounding in the physical world, with human performers, enables it to develop a sense of self and choose its own path as a musician.

These are fiction, but it's the best, most accessible illustration of the difference between funhouse mirror stochastic parrots and sentient agents that I've run across.

Dreamships

Read 45 reviews from the world’s largest community for readers. Dreamships is the story of a freelance space pilot and her crew, who are hired by a rich co…

Goodreads (www.goodreads.com)

cafechatnoir@mastodon.social

@leeloo

I think stochastic parrot is one of the kinder things that can be said.

jubalbarca@scholar.social

@tobifant @leeloo Whilst we obviously can't show if humans have a soul, we can absolutely show that humans have e.g. abstracted concept frameworks that are not solely based on averages of language statistics. I understand what an "owl" is, for example, in a way separate to the numerical relationships between the word "owl" and other words. That is a really fundamental information processing difference and allows me to construct *novel* understandings of that concept in ways that an LLM couldn't.

wolf480pl@mstdn.io

@robotistry
@leeloo
so it's a parrot not because it's a matrix of probabilities, but because its hasn't experienced the real-world consequences of its words/actions and updated the probabilities based on those consequences?

calcifer@masto.hackers.town

@KayOhtie @leeloo honestly it’s safe to feed a model pretty much anything

But where you direct the outputs and how they are acted upon can get incredibly dangerous amazingly quickly. There’s a common misbelief that if you’re careful about inputs, LLMs are safe; and that’s almost exactly backwards

kayohtie@blimps.xyz

@calcifer @leeloo I meant 'safe' not as in "data leakage", but "getting anything remotely accurate out of it"

CIRCLE WITH A DOT

As a software developer who took an elective in neural networks - when people call LLMs stochastic parrots, that's not criticism of their results.

Dreamships

Dreamships