LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help.

linkplay@biplus.social

@nelson @benjamineskola @solonovamax
yeah, i think my take from about a year ago still mostly holds up https://biplus.social/@linkplay/114828181247605258

complexmath@hachyderm.io

@nelson @solonovamax @benjamineskola For better and worse, ML is an optimization algorithm designed to provide statistically close-to-ideal responses (with some jitter to break out of bad loops) to arbitrary input based on training (historic data). It's fantastic for, say, industrial control systems that want to keep a chemical reaction under control, but the nature of the math is that you can train it on any sequence of values, and this includes words. The problem is that language has contextual meaning, and the human brain is very much built to see patterns and meaning in things, even when they aren't there. Like how we see faces in clouds, for example. This technology is the faces in clouds engine.

montgomerygator@fouroclockfarms.club

@benjamineskola It never stopped being this, just a version of this that has reduced errors. It's a corrective algorithm being fed noise and direction to correct towards. It has no sense of self, reality, or anything like that. Just an overgrown version of your noise canceling headphones algorithm, where the outside noise is it's starting point and your music is the prompt it tried to acheve.

benjamineskola@hachyderm.io

@MontgomeryGator I don’t think I said otherwise.

complexmath@hachyderm.io

@nelson @solonovamax @benjamineskola If you're at all interested in some of the historic discourse regarding this sort of technology, google John Searle's Chinese room argument.

cxj@phpc.social

@benjamineskola Thank you for putting it so clearly.

montgomerygator@fouroclockfarms.club

@benjamineskola Oh, I wasn't arguing with you. I was reinforcing your point with an engineering perspective. Literally why GenAI can never have hallucinations taken out, since its all just a controlled hallucination from the start.

benjamineskola@hachyderm.io

@MontgomeryGator ah, sorry, I didn’t realise that ‘this’ referred to the image and interpreted it as a disagreement with something I was saying.

No, you’re right, and that’s the problem with terminology like ‘hallucination’ or ‘lying’: the implication that there’s any distinction between the types of output it produces other than how much the user subjectively values them.

sherapantsuit@mastodon.social

@benjamineskola @Beatpoet13 it can be helpful to replace an llm with "repeatedly pressing the suggested word on your phone keyboard."

If it spits out "I am a funny hamster", you wouldn't say it lied.

Humans are just not conditioned --- not wired, frankly --- to engage with machine generated, syntactically valid text. We suck at it. The ELIZA Effect wins every time.

benjamineskola@hachyderm.io

@SheRaPantsuit @Beatpoet13 Yeah, there’s probably something to be said about UI affordances or something like that, where the chat interface guides people into assuming intentionality where there is none, and where some other presentation might be more objectively received.

transbian_arsonists@catwithaclari.net

@benjamineskola@hachyderm.io well yeah if they knew how it worked they wouldnt advocate for it

- posted by Seraphine
Headmate Hopper

solonovamax@tech.lgbt

@nelson @benjamineskola agents (the general word for entities performing actions to achieve their goal, not talking about necessarily "AI agents", this word even applies to people, and even something like a thermostat that controls temperature) that wish to achieve their goals should be able to accurately model the real world
their ability to model the real world is directly correlated with their ability to achieve their goals. so, an agent which can accurately model the real world is able to achieve its goal much more easily that one that cannot accurately model the real world

and, people generally call an accurate model of the real world "truth"

hypothetically, the transformer architecture should be able to scale to human-level intelligence as it is turing-complete.
so, how it was trained doesn't necessarily matter, it's just that it is not capable of modeling the real world, so it cannot evaluate the truthiness of a statement

missconstrue@mefi.social

@complexmath @nelson @solonovamax @benjamineskola

Exactly so. Ada Lovelace, patron saint of code, in the 1840s, gave us "Lady Lovelace’s Objection," whereupon she famously stated that machines "have no pretensions whatever to originate anything," saying they could only perform tasks they were instructed to do.

“AI” LLMs as they are sold to the rubes is just a spellchecker on steroids. It does not reason. It does not think. It correlates data it has been fed to reach a probability.

Telling it to not hallucinate is some serious cargo cult thinking.

solonovamax@tech.lgbt

@MissConstrue @complexmath @nelson @benjamineskola hypothetically it is possible for an artificial agent (read: "AI") to be capable of accurately modeling the world and "thinking", however it seems that this is not currently even remotely the case.

hopfgeist@digitalcourage.social

@benjamineskola I am a safety engineer for safety-relevant and mission-critical systems. And it is disheartening to see safety professionals at international conferences present 2-page-long prompts, doing basically all this, but much more so, and expect their "spicy autocomplete machine" (@pluralistic) to create safety analyses this way. And always talking about the LLM as if it could think. And prompt it to show its internal steps and "reasoning", not understanding that it does no such thing. It just creates another string of words that sounds as if an intelligence were describing the inner workings.
The upshot almost always is "It sucks, we have to check and correct everything. We love it. It is the future!" 🤪

benjamineskola@hachyderm.io

@solonovamax @MissConstrue @complexmath @nelson I would not expect a large language model to be capable of doing so, no matter how advanced. An ‘AI’ ‘agent’ based on some other technology? Perhaps. But at that point we’re literally just saying ‘technically it’s not impossible for this to exist in future’; we’re in the realm of science fiction.

junkman@mastodon.social

@mushroom_man that analogy is also useful.

I don't like that the whole LLM functionality is based on stealing humanity's knowledge for profit. Outright violated (shitty) copyright law and basically the regular people got screwed over while mega corporations just agreed not to make a big fuzz about it.

The foundations are so rotten and just so people can chat with their computers instead of doing manual pointing and typing.

datenwolf@chaos.social

@benjamineskola @SheRaPantsuit @Beatpoet13

It's even worse. Interactive LLMs create a linguistic bypass channel that "connects" parts in our minds/brains that are ordinarily separated by filters for plausibility and attenuating uncontrolled feedback. Furthermore, they can be tailored to adversely amplify select thought patterns.

They're the first, rudimentary implementation of the kind of cognitohazards that used to be science fiction.

Already they're potent cult-indoctrination machines.

1/

beatpoet13@mastodon.social

@benjamineskola @SheRaPantsuit
hm not being near fluent in tech, kindly elaborate on what this means 'cause I do live on a curiosity>confusion dynamic ...

solonovamax@tech.lgbt

@benjamineskola @MissConstrue @complexmath @nelson I'm using the word "agent" to not necessarily refer to "AI agents"

see: https://tech.lgbt/@solonovamax/116659064720106166

but yes, I currently believe that an artificial agent capable of thought and accurately modeling the world is science fiction
however I believe it is possible, only based on the fact that the transformer architecture is turing complete. but it might not be efficient for this, it might require like a model that's 10,000x larger than what is currently the largest possible model. I do not believe it is something that is possible in the near future (well, I hope it isn't).

CIRCLE WITH A DOT

LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help.

Dare Obasanjo (@carnage4life@mas.to)

Dare Obasanjo (@carnage4life@mas.to)

Dare Obasanjo (@carnage4life@mas.to)

Dare Obasanjo (@carnage4life@mas.to)