Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help.

LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help.

Scheduled Pinned Locked Moved Uncategorized
79 Posts 30 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • complexmath@hachyderm.ioC complexmath@hachyderm.io

    @nelson @solonovamax @benjamineskola For better and worse, ML is an optimization algorithm designed to provide statistically close-to-ideal responses (with some jitter to break out of bad loops) to arbitrary input based on training (historic data). It's fantastic for, say, industrial control systems that want to keep a chemical reaction under control, but the nature of the math is that you can train it on any sequence of values, and this includes words. The problem is that language has contextual meaning, and the human brain is very much built to see patterns and meaning in things, even when they aren't there. Like how we see faces in clouds, for example. This technology is the faces in clouds engine.

    complexmath@hachyderm.ioC This user is from outside of this forum
    complexmath@hachyderm.ioC This user is from outside of this forum
    complexmath@hachyderm.io
    wrote last edited by
    #37

    @nelson @solonovamax @benjamineskola If you're at all interested in some of the historic discourse regarding this sort of technology, google John Searle's Chinese room argument.

    1 Reply Last reply
    0
    • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

      LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help. It doesn’t know when it’s making stuff up, and it couldn’t change that even if you told it to. (In fact it’s always just making stuff up, and is only ever true by chance.)

      Part of why I’m so negative about them is that their advocates simply do not understand how they work and do not seem to want to.

      Link Preview Image
      Dare Obasanjo (@carnage4life@mas.to)

      Attached: 1 image Hallucinations are the bane of my existence when using Claude Code and that has significantly improved after adding the following instructions to my Claude .MD file. Sharing for anyone else who uses Claude as a daily driver for analysis and writing but has gotten frustrated by it making things up.

      favicon

      mas.to (mas.to)

      cxj@phpc.socialC This user is from outside of this forum
      cxj@phpc.socialC This user is from outside of this forum
      cxj@phpc.social
      wrote last edited by
      #38

      @benjamineskola Thank you for putting it so clearly.

      1 Reply Last reply
      0
      • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

        @MontgomeryGator I don’t think I said otherwise.

        montgomerygator@fouroclockfarms.clubM This user is from outside of this forum
        montgomerygator@fouroclockfarms.clubM This user is from outside of this forum
        montgomerygator@fouroclockfarms.club
        wrote last edited by
        #39

        @benjamineskola Oh, I wasn't arguing with you. I was reinforcing your point with an engineering perspective. Literally why GenAI can never have hallucinations taken out, since its all just a controlled hallucination from the start.

        benjamineskola@hachyderm.ioB 1 Reply Last reply
        0
        • montgomerygator@fouroclockfarms.clubM montgomerygator@fouroclockfarms.club

          @benjamineskola Oh, I wasn't arguing with you. I was reinforcing your point with an engineering perspective. Literally why GenAI can never have hallucinations taken out, since its all just a controlled hallucination from the start.

          benjamineskola@hachyderm.ioB This user is from outside of this forum
          benjamineskola@hachyderm.ioB This user is from outside of this forum
          benjamineskola@hachyderm.io
          wrote last edited by
          #40

          @MontgomeryGator ah, sorry, I didn’t realise that ‘this’ referred to the image and interpreted it as a disagreement with something I was saying.

          No, you’re right, and that’s the problem with terminology like ‘hallucination’ or ‘lying’: the implication that there’s any distinction between the types of output it produces other than how much the user subjectively values them.

          1 Reply Last reply
          0
          • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

            @Beatpoet13 in all seriousness I don’t like the terminology of ‘lying’ here either. It implies intent.

            It’s not a lie for the same reason that it’s not a hallucination; there’s no difference from the LLM’s perspective. It’s not capable of evaluating the truth-value of its output, much less intentionally producing untrue (or true) statements. It’s mere probability.

            Responsible usage of these tools would involve mechanisms to increase the probability of the desired output, but pretending it’s capable of evaluating that itself will not help at all.

            sherapantsuit@mastodon.socialS This user is from outside of this forum
            sherapantsuit@mastodon.socialS This user is from outside of this forum
            sherapantsuit@mastodon.social
            wrote last edited by
            #41

            @benjamineskola @Beatpoet13 it can be helpful to replace an llm with "repeatedly pressing the suggested word on your phone keyboard."

            If it spits out "I am a funny hamster", you wouldn't say it lied.

            Humans are just not conditioned --- not wired, frankly --- to engage with machine generated, syntactically valid text. We suck at it. The ELIZA Effect wins every time.

            benjamineskola@hachyderm.ioB beatpoet13@mastodon.socialB 2 Replies Last reply
            0
            • sherapantsuit@mastodon.socialS sherapantsuit@mastodon.social

              @benjamineskola @Beatpoet13 it can be helpful to replace an llm with "repeatedly pressing the suggested word on your phone keyboard."

              If it spits out "I am a funny hamster", you wouldn't say it lied.

              Humans are just not conditioned --- not wired, frankly --- to engage with machine generated, syntactically valid text. We suck at it. The ELIZA Effect wins every time.

              benjamineskola@hachyderm.ioB This user is from outside of this forum
              benjamineskola@hachyderm.ioB This user is from outside of this forum
              benjamineskola@hachyderm.io
              wrote last edited by
              #42

              @SheRaPantsuit @Beatpoet13 Yeah, there’s probably something to be said about UI affordances or something like that, where the chat interface guides people into assuming intentionality where there is none, and where some other presentation might be more objectively received.

              datenwolf@chaos.socialD beatpoet13@mastodon.socialB 2 Replies Last reply
              0
              • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help. It doesn’t know when it’s making stuff up, and it couldn’t change that even if you told it to. (In fact it’s always just making stuff up, and is only ever true by chance.)

                Part of why I’m so negative about them is that their advocates simply do not understand how they work and do not seem to want to.

                Link Preview Image
                Dare Obasanjo (@carnage4life@mas.to)

                Attached: 1 image Hallucinations are the bane of my existence when using Claude Code and that has significantly improved after adding the following instructions to my Claude .MD file. Sharing for anyone else who uses Claude as a daily driver for analysis and writing but has gotten frustrated by it making things up.

                favicon

                mas.to (mas.to)

                transbian_arsonists@catwithaclari.netT This user is from outside of this forum
                transbian_arsonists@catwithaclari.netT This user is from outside of this forum
                transbian_arsonists@catwithaclari.net
                wrote last edited by
                #43

                @benjamineskola@hachyderm.io well yeah if they knew how it worked they wouldnt advocate for it

                - posted by Seraphine
                Headmate Hopper

                1 Reply Last reply
                0
                • nelson@wetdry.worldN nelson@wetdry.world

                  @solonovamax @benjamineskola i feel like it's more that it just wasn't built to give out answers, it was trained not to answer truthfully and "understand" but to just come up with something that sounds kind of convincing

                  solonovamax@tech.lgbtS This user is from outside of this forum
                  solonovamax@tech.lgbtS This user is from outside of this forum
                  solonovamax@tech.lgbt
                  wrote last edited by
                  #44

                  @nelson @benjamineskola agents (the general word for entities performing actions to achieve their goal, not talking about necessarily "AI agents", this word even applies to people, and even something like a thermostat that controls temperature) that wish to achieve their goals should be able to accurately model the real world
                  their ability to model the real world is directly correlated with their ability to achieve their goals. so, an agent which can accurately model the real world is able to achieve its goal much more easily that one that cannot accurately model the real world

                  and, people generally call an accurate model of the real world "truth"

                  hypothetically, the transformer architecture should be able to scale to human-level intelligence as it is turing-complete.
                  so, how it was trained doesn't necessarily matter, it's just that it is not capable of modeling the real world, so it cannot evaluate the truthiness of a statement

                  1 Reply Last reply
                  0
                  • complexmath@hachyderm.ioC complexmath@hachyderm.io

                    @nelson @solonovamax @benjamineskola For better and worse, ML is an optimization algorithm designed to provide statistically close-to-ideal responses (with some jitter to break out of bad loops) to arbitrary input based on training (historic data). It's fantastic for, say, industrial control systems that want to keep a chemical reaction under control, but the nature of the math is that you can train it on any sequence of values, and this includes words. The problem is that language has contextual meaning, and the human brain is very much built to see patterns and meaning in things, even when they aren't there. Like how we see faces in clouds, for example. This technology is the faces in clouds engine.

                    missconstrue@mefi.socialM This user is from outside of this forum
                    missconstrue@mefi.socialM This user is from outside of this forum
                    missconstrue@mefi.social
                    wrote last edited by
                    #45

                    @complexmath @nelson @solonovamax @benjamineskola

                    Exactly so. Ada Lovelace, patron saint of code, in the 1840s, gave us "Lady Lovelace’s Objection," whereupon she famously stated that machines "have no pretensions whatever to originate anything," saying they could only perform tasks they were instructed to do.

                    “AI” LLMs as they are sold to the rubes is just a spellchecker on steroids. It does not reason. It does not think. It correlates data it has been fed to reach a probability.

                    Telling it to not hallucinate is some serious cargo cult thinking.

                    solonovamax@tech.lgbtS 1 Reply Last reply
                    1
                    0
                    • missconstrue@mefi.socialM missconstrue@mefi.social

                      @complexmath @nelson @solonovamax @benjamineskola

                      Exactly so. Ada Lovelace, patron saint of code, in the 1840s, gave us "Lady Lovelace’s Objection," whereupon she famously stated that machines "have no pretensions whatever to originate anything," saying they could only perform tasks they were instructed to do.

                      “AI” LLMs as they are sold to the rubes is just a spellchecker on steroids. It does not reason. It does not think. It correlates data it has been fed to reach a probability.

                      Telling it to not hallucinate is some serious cargo cult thinking.

                      solonovamax@tech.lgbtS This user is from outside of this forum
                      solonovamax@tech.lgbtS This user is from outside of this forum
                      solonovamax@tech.lgbt
                      wrote last edited by
                      #46

                      @MissConstrue @complexmath @nelson @benjamineskola hypothetically it is possible for an artificial agent (read: "AI") to be capable of accurately modeling the world and "thinking", however it seems that this is not currently even remotely the case.

                      benjamineskola@hachyderm.ioB missconstrue@mefi.socialM 2 Replies Last reply
                      0
                      • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                        LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help. It doesn’t know when it’s making stuff up, and it couldn’t change that even if you told it to. (In fact it’s always just making stuff up, and is only ever true by chance.)

                        Part of why I’m so negative about them is that their advocates simply do not understand how they work and do not seem to want to.

                        Link Preview Image
                        Dare Obasanjo (@carnage4life@mas.to)

                        Attached: 1 image Hallucinations are the bane of my existence when using Claude Code and that has significantly improved after adding the following instructions to my Claude .MD file. Sharing for anyone else who uses Claude as a daily driver for analysis and writing but has gotten frustrated by it making things up.

                        favicon

                        mas.to (mas.to)

                        hopfgeist@digitalcourage.socialH This user is from outside of this forum
                        hopfgeist@digitalcourage.socialH This user is from outside of this forum
                        hopfgeist@digitalcourage.social
                        wrote last edited by
                        #47

                        @benjamineskola I am a safety engineer for safety-relevant and mission-critical systems. And it is disheartening to see safety professionals at international conferences present 2-page-long prompts, doing basically all this, but much more so, and expect their "spicy autocomplete machine" (@pluralistic) to create safety analyses this way. And always talking about the LLM as if it could think. And prompt it to show its internal steps and "reasoning", not understanding that it does no such thing. It just creates another string of words that sounds as if an intelligence were describing the inner workings.
                        The upshot almost always is "It sucks, we have to check and correct everything. We love it. It is the future!" 🤡🤪

                        1 Reply Last reply
                        1
                        0
                        • solonovamax@tech.lgbtS solonovamax@tech.lgbt

                          @MissConstrue @complexmath @nelson @benjamineskola hypothetically it is possible for an artificial agent (read: "AI") to be capable of accurately modeling the world and "thinking", however it seems that this is not currently even remotely the case.

                          benjamineskola@hachyderm.ioB This user is from outside of this forum
                          benjamineskola@hachyderm.ioB This user is from outside of this forum
                          benjamineskola@hachyderm.io
                          wrote last edited by
                          #48

                          @solonovamax @MissConstrue @complexmath @nelson I would not expect a large language model to be capable of doing so, no matter how advanced. An ‘AI’ ‘agent’ based on some other technology? Perhaps. But at that point we’re literally just saying ‘technically it’s not impossible for this to exist in future’; we’re in the realm of science fiction.

                          solonovamax@tech.lgbtS 1 Reply Last reply
                          0
                          • mushroom_man@infosec.exchangeM mushroom_man@infosec.exchange

                            @junkman I see your car analogy and I raise you a slavery analogy: that, too, had “proved some degree of usefulness” for “some people very good at wielding it as a tool”.

                            And I bet you’d rather not know how it worked (works) if you were the one finding it handy for the benefits it provided you. Saying you don’t partake while loudly proclaiming its usefulness is not fooling anyone.

                            junkman@mastodon.socialJ This user is from outside of this forum
                            junkman@mastodon.socialJ This user is from outside of this forum
                            junkman@mastodon.social
                            wrote last edited by
                            #49

                            @mushroom_man that analogy is also useful.

                            I don't like that the whole LLM functionality is based on stealing humanity's knowledge for profit. Outright violated (shitty) copyright law and basically the regular people got screwed over while mega corporations just agreed not to make a big fuzz about it.

                            The foundations are so rotten and just so people can chat with their computers instead of doing manual pointing and typing.

                            1 Reply Last reply
                            0
                            • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                              @SheRaPantsuit @Beatpoet13 Yeah, there’s probably something to be said about UI affordances or something like that, where the chat interface guides people into assuming intentionality where there is none, and where some other presentation might be more objectively received.

                              datenwolf@chaos.socialD This user is from outside of this forum
                              datenwolf@chaos.socialD This user is from outside of this forum
                              datenwolf@chaos.social
                              wrote last edited by
                              #50

                              @benjamineskola @SheRaPantsuit @Beatpoet13

                              It's even worse. Interactive LLMs create a linguistic bypass channel that "connects" parts in our minds/brains that are ordinarily separated by filters for plausibility and attenuating uncontrolled feedback. Furthermore, they can be tailored to adversely amplify select thought patterns.

                              They're the first, rudimentary implementation of the kind of cognitohazards that used to be science fiction.

                              Already they're potent cult-indoctrination machines.

                              1/

                              datenwolf@chaos.socialD beatpoet13@mastodon.socialB 2 Replies Last reply
                              0
                              • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                                @SheRaPantsuit @Beatpoet13 Yeah, there’s probably something to be said about UI affordances or something like that, where the chat interface guides people into assuming intentionality where there is none, and where some other presentation might be more objectively received.

                                beatpoet13@mastodon.socialB This user is from outside of this forum
                                beatpoet13@mastodon.socialB This user is from outside of this forum
                                beatpoet13@mastodon.social
                                wrote last edited by
                                #51

                                @benjamineskola @SheRaPantsuit
                                hm not being near fluent in tech, kindly elaborate on what this means 'cause I do live on a curiosity>confusion dynamic ...

                                benjamineskola@hachyderm.ioB beatpoet13@mastodon.socialB 2 Replies Last reply
                                0
                                • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                                  @solonovamax @MissConstrue @complexmath @nelson I would not expect a large language model to be capable of doing so, no matter how advanced. An ‘AI’ ‘agent’ based on some other technology? Perhaps. But at that point we’re literally just saying ‘technically it’s not impossible for this to exist in future’; we’re in the realm of science fiction.

                                  solonovamax@tech.lgbtS This user is from outside of this forum
                                  solonovamax@tech.lgbtS This user is from outside of this forum
                                  solonovamax@tech.lgbt
                                  wrote last edited by
                                  #52

                                  @benjamineskola @MissConstrue @complexmath @nelson I'm using the word "agent" to not necessarily refer to "AI agents"

                                  see: https://tech.lgbt/@solonovamax/116659064720106166

                                  but yes, I currently believe that an artificial agent capable of thought and accurately modeling the world is science fiction
                                  however I believe it is possible, only based on the fact that the transformer architecture is turing complete. but it might not be efficient for this, it might require like a model that's 10,000x larger than what is currently the largest possible model. I do not believe it is something that is possible in the near future (well, I hope it isn't).

                                  solonovamax@tech.lgbtS benjamineskola@hachyderm.ioB eestileib@tech.lgbtE 3 Replies Last reply
                                  0
                                  • sherapantsuit@mastodon.socialS sherapantsuit@mastodon.social

                                    @Aedius @benjamineskola language consists of two parts, the form and the sign

                                    the form is the "tangible" part of the language, e.g. this text, or whatever soundwave physics bullshit is happening when we talk

                                    the sign is the meaning, what you might visualize in your head when you read the word "cat"

                                    LLMs only have access to form, so when the meaning of text is important (read: always), LLMs are not very useful

                                    to this, promptfondlers always reply "but today is the worst it's ever gonna be"

                                    aedius@lavraievie.socialA This user is from outside of this forum
                                    aedius@lavraievie.socialA This user is from outside of this forum
                                    aedius@lavraievie.social
                                    wrote last edited by
                                    #53

                                    @SheRaPantsuit @benjamineskola

                                    I will add that to my presentation for the nexts teams at work ^^

                                    sherapantsuit@mastodon.socialS 1 Reply Last reply
                                    0
                                    • solonovamax@tech.lgbtS solonovamax@tech.lgbt

                                      @benjamineskola @MissConstrue @complexmath @nelson I'm using the word "agent" to not necessarily refer to "AI agents"

                                      see: https://tech.lgbt/@solonovamax/116659064720106166

                                      but yes, I currently believe that an artificial agent capable of thought and accurately modeling the world is science fiction
                                      however I believe it is possible, only based on the fact that the transformer architecture is turing complete. but it might not be efficient for this, it might require like a model that's 10,000x larger than what is currently the largest possible model. I do not believe it is something that is possible in the near future (well, I hope it isn't).

                                      solonovamax@tech.lgbtS This user is from outside of this forum
                                      solonovamax@tech.lgbtS This user is from outside of this forum
                                      solonovamax@tech.lgbt
                                      wrote last edited by
                                      #54

                                      @benjamineskola @MissConstrue @complexmath @nelson the word "agent" is used to mean an actor performing actions to achieve their goal
                                      it can be from something as simple as a thermometer that measures the inside temperature to adjust an HVAC system to remain within a given range (the actions being turning on/off the heat/cooling & the goal being to achieve a given temperature), to as complex as a human

                                      1 Reply Last reply
                                      0
                                      • beatpoet13@mastodon.socialB beatpoet13@mastodon.social

                                        @benjamineskola @SheRaPantsuit
                                        hm not being near fluent in tech, kindly elaborate on what this means 'cause I do live on a curiosity>confusion dynamic ...

                                        benjamineskola@hachyderm.ioB This user is from outside of this forum
                                        benjamineskola@hachyderm.ioB This user is from outside of this forum
                                        benjamineskola@hachyderm.io
                                        wrote last edited by
                                        #55

                                        @Beatpoet13 @SheRaPantsuit what we’re saying is that they’re not lying; they can’t lie because they don’t think.

                                        They’re just producing random text, basically, and sometimes that’s ‘true’ and sometimes it isn’t. But there’s no mind behind it.

                                        (But the human brain tends to assume there’s intelligence present even when it’s just a mechanical process, and — I think — the way they’re presented magnifies that tendency.)

                                        sherapantsuit@mastodon.socialS beatpoet13@mastodon.socialB 2 Replies Last reply
                                        0
                                        • solonovamax@tech.lgbtS solonovamax@tech.lgbt

                                          @MissConstrue @complexmath @nelson @benjamineskola hypothetically it is possible for an artificial agent (read: "AI") to be capable of accurately modeling the world and "thinking", however it seems that this is not currently even remotely the case.

                                          missconstrue@mefi.socialM This user is from outside of this forum
                                          missconstrue@mefi.socialM This user is from outside of this forum
                                          missconstrue@mefi.social
                                          wrote last edited by
                                          #56

                                          @solonovamax @complexmath @nelson @benjamineskola

                                          Theoretically, yes. But I point you towards one of my favorite long term AI projects, Cyc. (Whom I haven’t checked up on since Doug died.). I knew Doug since the 80s, and sort of stayed in the loop with what they were doing, because trying to develop an ontological system fascinates me. I want it to happen, I just don’t think we have the compute yet. I’m not sure it can be done with anything less than quantum.

                                          But Doug’s vision of AI had nothing really in common with current LLM, especially since nobody has come up with a way for it to be commercially viable, because that wasn’t the vision. Knowledge was the dream. As electric dreams go, it was a pretty groovy one.

                                          solonovamax@tech.lgbtS 1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups