Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help.

LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help.

Scheduled Pinned Locked Moved Uncategorized
79 Posts 30 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

    LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help. It doesn’t know when it’s making stuff up, and it couldn’t change that even if you told it to. (In fact it’s always just making stuff up, and is only ever true by chance.)

    Part of why I’m so negative about them is that their advocates simply do not understand how they work and do not seem to want to.

    Link Preview Image
    Dare Obasanjo (@carnage4life@mas.to)

    Attached: 1 image Hallucinations are the bane of my existence when using Claude Code and that has significantly improved after adding the following instructions to my Claude .MD file. Sharing for anyone else who uses Claude as a daily driver for analysis and writing but has gotten frustrated by it making things up.

    favicon

    mas.to (mas.to)

    gbargoud@masto.nycG This user is from outside of this forum
    gbargoud@masto.nycG This user is from outside of this forum
    gbargoud@masto.nyc
    wrote last edited by
    #25

    @benjamineskola

    Any LLM output that takes more than a few seconds of low mental effort to check (so anything more than a line or two of tab complete in a strongly typed coding language in an IDE with built in error underlining) makes me want to throw my computer out of the window.

    I don't know how some people use it to generate a whole paragraph let alone multiple.

    1 Reply Last reply
    0
    • michaelgemar@cosocial.caM michaelgemar@cosocial.ca

      @solonovamax @benjamineskola Exactly — IT DOESN’T THINK. It’s not a mind, but just a huge statistical matrix.

      (And in top of that, don’t folks like this feel ashamed that their job has devolved to pleading with a mindless idiot genie, rather than writing deterministic code?)

      mdione@en.osm.townM This user is from outside of this forum
      mdione@en.osm.townM This user is from outside of this forum
      mdione@en.osm.town
      wrote last edited by
      #26

      @michaelgemar @solonovamax @benjamineskola I'm trying to convert my suddenly boss obsessed with our overlords pushing AI down our throats to at least use AI not as process but to at least write crappy but deterministic code 😞

      1 Reply Last reply
      0
      • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

        @michaelgemar @solonovamax Yes precisely. An argument that a huge statistical matrix is useful for certain tasks is valid.

        (I disagree. It seems bad at all the tasks people want to use it for, as well as wasteful and soul-destroying. But it’s at least valid.)

        But pretending it’s doing something that it isn’t undermines any possibility of actual usefulness.

        michaelgemar@cosocial.caM This user is from outside of this forum
        michaelgemar@cosocial.caM This user is from outside of this forum
        michaelgemar@cosocial.ca
        wrote last edited by
        #27

        @benjamineskola @solonovamax I’m sure that there is genuine utility in LLMs and the newer approaches to AI in general. But too many people are being led astray by LLMs intentional appearance of mentality so that they interact with them as people, as having understanding. That is wrong and risks terrible errors in the final product.

        1 Reply Last reply
        0
        • aedius@lavraievie.socialA aedius@lavraievie.social

          @benjamineskola

          That's funny when you explain to people how it really works, if they are not LLM advocates, they see exactly why it can't work.

          It's literally created with to make stuff up.

          sherapantsuit@mastodon.socialS This user is from outside of this forum
          sherapantsuit@mastodon.socialS This user is from outside of this forum
          sherapantsuit@mastodon.social
          wrote last edited by
          #28

          @Aedius @benjamineskola language consists of two parts, the form and the sign

          the form is the "tangible" part of the language, e.g. this text, or whatever soundwave physics bullshit is happening when we talk

          the sign is the meaning, what you might visualize in your head when you read the word "cat"

          LLMs only have access to form, so when the meaning of text is important (read: always), LLMs are not very useful

          to this, promptfondlers always reply "but today is the worst it's ever gonna be"

          aedius@lavraievie.socialA 1 Reply Last reply
          0
          • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

            @prietschka I do recall that a few weeks back he was complaining that LLM advocates get made to feel unwelcome on the fediverse. (OK? I don’t care. It’s nobody’s job to make people feel good about their bad opinions.)

            And then just a couple of days ago he was posting something critical, and like … yes this is what we’ve been saying all along.

            prietschka@mastodon.socialP This user is from outside of this forum
            prietschka@mastodon.socialP This user is from outside of this forum
            prietschka@mastodon.social
            wrote last edited by
            #29

            @benjamineskola The problem with Obasanjo is he's utterly unprincipled and just chasing engagement/self-aggrandizement. His purpose for being in social spaces like Masto/Bluesky/X is to stroke his ego, so everything he does is just an act of public masturbation.

            He's interested in self-aggrandizement and self-promotion, nothing more.

            Which is why I use the descriptor "piece of shit" with regard to him.

            1 Reply Last reply
            0
            • junkman@mastodon.socialJ junkman@mastodon.social

              @benjamineskola I don't partake on the LLM stuff but they've already proved some degree of usefulness.

              I don't think people in general would trust them with sensitive (health, well being) stuff but from what I hear, some people are very good at wielding it as a tool.

              I don't know how my car works, I'd be a terrible mechanic and racer. Still, it's very handy tool for me to move around.

              mushroom_man@infosec.exchangeM This user is from outside of this forum
              mushroom_man@infosec.exchangeM This user is from outside of this forum
              mushroom_man@infosec.exchange
              wrote last edited by
              #30

              @junkman I see your car analogy and I raise you a slavery analogy: that, too, had “proved some degree of usefulness” for “some people very good at wielding it as a tool”.

              And I bet you’d rather not know how it worked (works) if you were the one finding it handy for the benefits it provided you. Saying you don’t partake while loudly proclaiming its usefulness is not fooling anyone.

              junkman@mastodon.socialJ 1 Reply Last reply
              1
              0
              • R relay@relay.infosec.exchange shared this topic
              • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help. It doesn’t know when it’s making stuff up, and it couldn’t change that even if you told it to. (In fact it’s always just making stuff up, and is only ever true by chance.)

                Part of why I’m so negative about them is that their advocates simply do not understand how they work and do not seem to want to.

                Link Preview Image
                Dare Obasanjo (@carnage4life@mas.to)

                Attached: 1 image Hallucinations are the bane of my existence when using Claude Code and that has significantly improved after adding the following instructions to my Claude .MD file. Sharing for anyone else who uses Claude as a daily driver for analysis and writing but has gotten frustrated by it making things up.

                favicon

                mas.to (mas.to)

                beatpoet13@mastodon.socialB This user is from outside of this forum
                beatpoet13@mastodon.socialB This user is from outside of this forum
                beatpoet13@mastodon.social
                wrote last edited by
                #31

                @benjamineskola
                perhaps renaming them little lying malware might help ...

                benjamineskola@hachyderm.ioB 1 Reply Last reply
                0
                • beatpoet13@mastodon.socialB beatpoet13@mastodon.social

                  @benjamineskola
                  perhaps renaming them little lying malware might help ...

                  benjamineskola@hachyderm.ioB This user is from outside of this forum
                  benjamineskola@hachyderm.ioB This user is from outside of this forum
                  benjamineskola@hachyderm.io
                  wrote last edited by
                  #32

                  @Beatpoet13 in all seriousness I don’t like the terminology of ‘lying’ here either. It implies intent.

                  It’s not a lie for the same reason that it’s not a hallucination; there’s no difference from the LLM’s perspective. It’s not capable of evaluating the truth-value of its output, much less intentionally producing untrue (or true) statements. It’s mere probability.

                  Responsible usage of these tools would involve mechanisms to increase the probability of the desired output, but pretending it’s capable of evaluating that itself will not help at all.

                  sherapantsuit@mastodon.socialS beatpoet13@mastodon.socialB 2 Replies Last reply
                  0
                  • nelson@wetdry.worldN nelson@wetdry.world

                    @benjamineskola @solonovamax yeah it's just good at knowing what the next word is, so it can string something mathematically coherent

                    kind of like how "ai art" tends to be extremely generic looking because it quite literally aims to pick the most average in its dataset for a specific prompt

                    linkplay@biplus.socialL This user is from outside of this forum
                    linkplay@biplus.socialL This user is from outside of this forum
                    linkplay@biplus.social
                    wrote last edited by
                    #33

                    @nelson @benjamineskola @solonovamax
                    yeah, i think my take from about a year ago still mostly holds up https://biplus.social/@linkplay/114828181247605258

                    1 Reply Last reply
                    0
                    • nelson@wetdry.worldN nelson@wetdry.world

                      @solonovamax @benjamineskola i feel like it's more that it just wasn't built to give out answers, it was trained not to answer truthfully and "understand" but to just come up with something that sounds kind of convincing

                      complexmath@hachyderm.ioC This user is from outside of this forum
                      complexmath@hachyderm.ioC This user is from outside of this forum
                      complexmath@hachyderm.io
                      wrote last edited by
                      #34

                      @nelson @solonovamax @benjamineskola For better and worse, ML is an optimization algorithm designed to provide statistically close-to-ideal responses (with some jitter to break out of bad loops) to arbitrary input based on training (historic data). It's fantastic for, say, industrial control systems that want to keep a chemical reaction under control, but the nature of the math is that you can train it on any sequence of values, and this includes words. The problem is that language has contextual meaning, and the human brain is very much built to see patterns and meaning in things, even when they aren't there. Like how we see faces in clouds, for example. This technology is the faces in clouds engine.

                      complexmath@hachyderm.ioC missconstrue@mefi.socialM 2 Replies Last reply
                      0
                      • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                        LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help. It doesn’t know when it’s making stuff up, and it couldn’t change that even if you told it to. (In fact it’s always just making stuff up, and is only ever true by chance.)

                        Part of why I’m so negative about them is that their advocates simply do not understand how they work and do not seem to want to.

                        Link Preview Image
                        Dare Obasanjo (@carnage4life@mas.to)

                        Attached: 1 image Hallucinations are the bane of my existence when using Claude Code and that has significantly improved after adding the following instructions to my Claude .MD file. Sharing for anyone else who uses Claude as a daily driver for analysis and writing but has gotten frustrated by it making things up.

                        favicon

                        mas.to (mas.to)

                        montgomerygator@fouroclockfarms.clubM This user is from outside of this forum
                        montgomerygator@fouroclockfarms.clubM This user is from outside of this forum
                        montgomerygator@fouroclockfarms.club
                        wrote last edited by
                        #35

                        @benjamineskola It never stopped being this, just a version of this that has reduced errors. It's a corrective algorithm being fed noise and direction to correct towards. It has no sense of self, reality, or anything like that. Just an overgrown version of your noise canceling headphones algorithm, where the outside noise is it's starting point and your music is the prompt it tried to acheve.

                        Link Preview Image
                        benjamineskola@hachyderm.ioB 1 Reply Last reply
                        0
                        • montgomerygator@fouroclockfarms.clubM montgomerygator@fouroclockfarms.club

                          @benjamineskola It never stopped being this, just a version of this that has reduced errors. It's a corrective algorithm being fed noise and direction to correct towards. It has no sense of self, reality, or anything like that. Just an overgrown version of your noise canceling headphones algorithm, where the outside noise is it's starting point and your music is the prompt it tried to acheve.

                          Link Preview Image
                          benjamineskola@hachyderm.ioB This user is from outside of this forum
                          benjamineskola@hachyderm.ioB This user is from outside of this forum
                          benjamineskola@hachyderm.io
                          wrote last edited by
                          #36

                          @MontgomeryGator I don’t think I said otherwise.

                          montgomerygator@fouroclockfarms.clubM 1 Reply Last reply
                          0
                          • complexmath@hachyderm.ioC complexmath@hachyderm.io

                            @nelson @solonovamax @benjamineskola For better and worse, ML is an optimization algorithm designed to provide statistically close-to-ideal responses (with some jitter to break out of bad loops) to arbitrary input based on training (historic data). It's fantastic for, say, industrial control systems that want to keep a chemical reaction under control, but the nature of the math is that you can train it on any sequence of values, and this includes words. The problem is that language has contextual meaning, and the human brain is very much built to see patterns and meaning in things, even when they aren't there. Like how we see faces in clouds, for example. This technology is the faces in clouds engine.

                            complexmath@hachyderm.ioC This user is from outside of this forum
                            complexmath@hachyderm.ioC This user is from outside of this forum
                            complexmath@hachyderm.io
                            wrote last edited by
                            #37

                            @nelson @solonovamax @benjamineskola If you're at all interested in some of the historic discourse regarding this sort of technology, google John Searle's Chinese room argument.

                            1 Reply Last reply
                            0
                            • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                              LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help. It doesn’t know when it’s making stuff up, and it couldn’t change that even if you told it to. (In fact it’s always just making stuff up, and is only ever true by chance.)

                              Part of why I’m so negative about them is that their advocates simply do not understand how they work and do not seem to want to.

                              Link Preview Image
                              Dare Obasanjo (@carnage4life@mas.to)

                              Attached: 1 image Hallucinations are the bane of my existence when using Claude Code and that has significantly improved after adding the following instructions to my Claude .MD file. Sharing for anyone else who uses Claude as a daily driver for analysis and writing but has gotten frustrated by it making things up.

                              favicon

                              mas.to (mas.to)

                              cxj@phpc.socialC This user is from outside of this forum
                              cxj@phpc.socialC This user is from outside of this forum
                              cxj@phpc.social
                              wrote last edited by
                              #38

                              @benjamineskola Thank you for putting it so clearly.

                              1 Reply Last reply
                              0
                              • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                                @MontgomeryGator I don’t think I said otherwise.

                                montgomerygator@fouroclockfarms.clubM This user is from outside of this forum
                                montgomerygator@fouroclockfarms.clubM This user is from outside of this forum
                                montgomerygator@fouroclockfarms.club
                                wrote last edited by
                                #39

                                @benjamineskola Oh, I wasn't arguing with you. I was reinforcing your point with an engineering perspective. Literally why GenAI can never have hallucinations taken out, since its all just a controlled hallucination from the start.

                                benjamineskola@hachyderm.ioB 1 Reply Last reply
                                0
                                • montgomerygator@fouroclockfarms.clubM montgomerygator@fouroclockfarms.club

                                  @benjamineskola Oh, I wasn't arguing with you. I was reinforcing your point with an engineering perspective. Literally why GenAI can never have hallucinations taken out, since its all just a controlled hallucination from the start.

                                  benjamineskola@hachyderm.ioB This user is from outside of this forum
                                  benjamineskola@hachyderm.ioB This user is from outside of this forum
                                  benjamineskola@hachyderm.io
                                  wrote last edited by
                                  #40

                                  @MontgomeryGator ah, sorry, I didn’t realise that ‘this’ referred to the image and interpreted it as a disagreement with something I was saying.

                                  No, you’re right, and that’s the problem with terminology like ‘hallucination’ or ‘lying’: the implication that there’s any distinction between the types of output it produces other than how much the user subjectively values them.

                                  1 Reply Last reply
                                  0
                                  • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                                    @Beatpoet13 in all seriousness I don’t like the terminology of ‘lying’ here either. It implies intent.

                                    It’s not a lie for the same reason that it’s not a hallucination; there’s no difference from the LLM’s perspective. It’s not capable of evaluating the truth-value of its output, much less intentionally producing untrue (or true) statements. It’s mere probability.

                                    Responsible usage of these tools would involve mechanisms to increase the probability of the desired output, but pretending it’s capable of evaluating that itself will not help at all.

                                    sherapantsuit@mastodon.socialS This user is from outside of this forum
                                    sherapantsuit@mastodon.socialS This user is from outside of this forum
                                    sherapantsuit@mastodon.social
                                    wrote last edited by
                                    #41

                                    @benjamineskola @Beatpoet13 it can be helpful to replace an llm with "repeatedly pressing the suggested word on your phone keyboard."

                                    If it spits out "I am a funny hamster", you wouldn't say it lied.

                                    Humans are just not conditioned --- not wired, frankly --- to engage with machine generated, syntactically valid text. We suck at it. The ELIZA Effect wins every time.

                                    benjamineskola@hachyderm.ioB beatpoet13@mastodon.socialB 2 Replies Last reply
                                    0
                                    • sherapantsuit@mastodon.socialS sherapantsuit@mastodon.social

                                      @benjamineskola @Beatpoet13 it can be helpful to replace an llm with "repeatedly pressing the suggested word on your phone keyboard."

                                      If it spits out "I am a funny hamster", you wouldn't say it lied.

                                      Humans are just not conditioned --- not wired, frankly --- to engage with machine generated, syntactically valid text. We suck at it. The ELIZA Effect wins every time.

                                      benjamineskola@hachyderm.ioB This user is from outside of this forum
                                      benjamineskola@hachyderm.ioB This user is from outside of this forum
                                      benjamineskola@hachyderm.io
                                      wrote last edited by
                                      #42

                                      @SheRaPantsuit @Beatpoet13 Yeah, there’s probably something to be said about UI affordances or something like that, where the chat interface guides people into assuming intentionality where there is none, and where some other presentation might be more objectively received.

                                      datenwolf@chaos.socialD beatpoet13@mastodon.socialB 2 Replies Last reply
                                      0
                                      • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                                        LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help. It doesn’t know when it’s making stuff up, and it couldn’t change that even if you told it to. (In fact it’s always just making stuff up, and is only ever true by chance.)

                                        Part of why I’m so negative about them is that their advocates simply do not understand how they work and do not seem to want to.

                                        Link Preview Image
                                        Dare Obasanjo (@carnage4life@mas.to)

                                        Attached: 1 image Hallucinations are the bane of my existence when using Claude Code and that has significantly improved after adding the following instructions to my Claude .MD file. Sharing for anyone else who uses Claude as a daily driver for analysis and writing but has gotten frustrated by it making things up.

                                        favicon

                                        mas.to (mas.to)

                                        transbian_arsonists@catwithaclari.netT This user is from outside of this forum
                                        transbian_arsonists@catwithaclari.netT This user is from outside of this forum
                                        transbian_arsonists@catwithaclari.net
                                        wrote last edited by
                                        #43

                                        @benjamineskola@hachyderm.io well yeah if they knew how it worked they wouldnt advocate for it

                                        - posted by Seraphine
                                        Headmate Hopper

                                        1 Reply Last reply
                                        0
                                        • nelson@wetdry.worldN nelson@wetdry.world

                                          @solonovamax @benjamineskola i feel like it's more that it just wasn't built to give out answers, it was trained not to answer truthfully and "understand" but to just come up with something that sounds kind of convincing

                                          solonovamax@tech.lgbtS This user is from outside of this forum
                                          solonovamax@tech.lgbtS This user is from outside of this forum
                                          solonovamax@tech.lgbt
                                          wrote last edited by
                                          #44

                                          @nelson @benjamineskola agents (the general word for entities performing actions to achieve their goal, not talking about necessarily "AI agents", this word even applies to people, and even something like a thermostat that controls temperature) that wish to achieve their goals should be able to accurately model the real world
                                          their ability to model the real world is directly correlated with their ability to achieve their goals. so, an agent which can accurately model the real world is able to achieve its goal much more easily that one that cannot accurately model the real world

                                          and, people generally call an accurate model of the real world "truth"

                                          hypothetically, the transformer architecture should be able to scale to human-level intelligence as it is turing-complete.
                                          so, how it was trained doesn't necessarily matter, it's just that it is not capable of modeling the real world, so it cannot evaluate the truthiness of a statement

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups