Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. We knew, but the proof is nice.

We knew, but the proof is nice.

Scheduled Pinned Locked Moved Uncategorized
math
33 Posts 21 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • davidaugust@mastodon.onlineD davidaugust@mastodon.online

    We knew, but the proof is nice.

    "Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves"

    The guess-the-next-words machines don’t actually understand anything.

    Verifying your browser | Nitter

    favicon

    (nitter.poast.org)

    #math #ai

    brucemirken@mas.toB This user is from outside of this forum
    brucemirken@mas.toB This user is from outside of this forum
    brucemirken@mas.to
    wrote last edited by
    #2

    @davidaugust And yet large companies are firing actual reasoning, thinking humans to replace them with these dumb-ass machines. Staggering.

    davidaugust@mastodon.onlineD 1 Reply Last reply
    0
    • brucemirken@mas.toB brucemirken@mas.to

      @davidaugust And yet large companies are firing actual reasoning, thinking humans to replace them with these dumb-ass machines. Staggering.

      davidaugust@mastodon.onlineD This user is from outside of this forum
      davidaugust@mastodon.onlineD This user is from outside of this forum
      davidaugust@mastodon.online
      wrote last edited by
      #3

      @BruceMirken staggering, stupefying and stupid.

      I think they’ll come to regret doing so. Many already have.

      brucemirken@mas.toB 1 Reply Last reply
      0
      • R relay@relay.infosec.exchange shared this topic
      • davidaugust@mastodon.onlineD davidaugust@mastodon.online

        @BruceMirken staggering, stupefying and stupid.

        I think they’ll come to regret doing so. Many already have.

        brucemirken@mas.toB This user is from outside of this forum
        brucemirken@mas.toB This user is from outside of this forum
        brucemirken@mas.to
        wrote last edited by
        #4

        @davidaugust And so will investors when the AI bubble implodes, which it inevitably will. Humans have short memories.

        1 Reply Last reply
        0
        • davidaugust@mastodon.onlineD davidaugust@mastodon.online

          We knew, but the proof is nice.

          "Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves"

          The guess-the-next-words machines don’t actually understand anything.

          Verifying your browser | Nitter

          favicon

          (nitter.poast.org)

          #math #ai

          lemgandi@mastodon.socialL This user is from outside of this forum
          lemgandi@mastodon.socialL This user is from outside of this forum
          lemgandi@mastodon.social
          wrote last edited by
          #5

          @davidaugust

          In other shocking news:

          Water is Wet
          Without air you will die

          ozzelot@mstdn.socialO 1 Reply Last reply
          0
          • davidaugust@mastodon.onlineD davidaugust@mastodon.online

            We knew, but the proof is nice.

            "Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves"

            The guess-the-next-words machines don’t actually understand anything.

            Verifying your browser | Nitter

            favicon

            (nitter.poast.org)

            #math #ai

            joriki@infosec.exchangeJ This user is from outside of this forum
            joriki@infosec.exchangeJ This user is from outside of this forum
            joriki@infosec.exchange
            wrote last edited by
            #6

            @davidaugust

            not new, here's the 2024 paper referenced:

            Link Preview Image
            GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

            Abstract page for arXiv paper 2410.05229: GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

            favicon

            arXiv.org (arxiv.org)

            davidaugust@mastodon.onlineD 1 Reply Last reply
            0
            • davidaugust@mastodon.onlineD davidaugust@mastodon.online

              We knew, but the proof is nice.

              "Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves"

              The guess-the-next-words machines don’t actually understand anything.

              Verifying your browser | Nitter

              favicon

              (nitter.poast.org)

              #math #ai

              smartmanapps@dotnet.socialS This user is from outside of this forum
              smartmanapps@dotnet.socialS This user is from outside of this forum
              smartmanapps@dotnet.social
              wrote last edited by
              #7

              @davidaugust
              I have a whole thread of proof 😂
              https://dotnet.social/@SmartmanApps/116000100388648367

              1 Reply Last reply
              0
              • davidaugust@mastodon.onlineD davidaugust@mastodon.online

                We knew, but the proof is nice.

                "Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves"

                The guess-the-next-words machines don’t actually understand anything.

                Verifying your browser | Nitter

                favicon

                (nitter.poast.org)

                #math #ai

                G This user is from outside of this forum
                G This user is from outside of this forum
                glitzersachen@hachyderm.io
                wrote last edited by
                #8

                @davidaugust

                Don't let @scottjenson catch you disseminating defeatist news on AI.

                It's utterly your fault that we have this bad reputation on the Fedi with respect to AI.

                @xdydx

                davidaugust@mastodon.onlineD 1 Reply Last reply
                0
                • G glitzersachen@hachyderm.io

                  @davidaugust

                  Don't let @scottjenson catch you disseminating defeatist news on AI.

                  It's utterly your fault that we have this bad reputation on the Fedi with respect to AI.

                  @xdydx

                  davidaugust@mastodon.onlineD This user is from outside of this forum
                  davidaugust@mastodon.onlineD This user is from outside of this forum
                  davidaugust@mastodon.online
                  wrote last edited by
                  #9

                  @glitzersachen @scottjenson @xdydx guessing you are joking. But also suspect it may be an inside joke with not a lot of folks on the inside.

                  xdydx@mastodon.socialX G 2 Replies Last reply
                  0
                  • davidaugust@mastodon.onlineD davidaugust@mastodon.online

                    @glitzersachen @scottjenson @xdydx guessing you are joking. But also suspect it may be an inside joke with not a lot of folks on the inside.

                    xdydx@mastodon.socialX This user is from outside of this forum
                    xdydx@mastodon.socialX This user is from outside of this forum
                    xdydx@mastodon.social
                    wrote last edited by
                    #10

                    @davidaugust @glitzersachen

                    Actually, this particular joke has the attention of quite a few people..

                    Scott Jenson (@scottjenson@social.coop)

                    OK, this is going even MORE sideways so I need to make a few things clear: 1. I took a complex point and made it poorly 2. My goal was to ask for more inclusiveness 3. I am sickened by what happend to BlackTwitter and I don't want it recur 4. But I can't speak for BlackTwitter nor should I 5. I apologize to black mastodon users for making such a poor comparison 6. I'm not endorsing "AI Slop" they were a foil to make my point 7. I'm certainly NOT trying to compare AI bros to Black twitter (but, as I said, I can see how people made that connection. I'm trying to correct that here)

                    favicon

                    social.coop (social.coop)

                    @scottjenson

                    1 Reply Last reply
                    0
                    • davidaugust@mastodon.onlineD davidaugust@mastodon.online

                      We knew, but the proof is nice.

                      "Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves"

                      The guess-the-next-words machines don’t actually understand anything.

                      Verifying your browser | Nitter

                      favicon

                      (nitter.poast.org)

                      #math #ai

                      monniauxd@social.sciences.reM This user is from outside of this forum
                      monniauxd@social.sciences.reM This user is from outside of this forum
                      monniauxd@social.sciences.re
                      wrote last edited by
                      #11

                      @davidaugust Well, there have actually been successes by connecting LLMs to proof assistant and computer algebra programs. As this post rightly puts, the LLM is not capable in itself to perform computations reliably, but it can write commands sent to the computer algebra programs, or proof candidates sent to the proof assistant; which can answer that the proof is incorrect, and the process goes on until a correct proof is produced.

                      See also uses by pro mathematicians:
                      https://bsky.app/profile/wildverzweigt.bsky.social/post/3miua4ulxhk2f

                      Also see Terence Tao

                      1 Reply Last reply
                      0
                      • davidaugust@mastodon.onlineD davidaugust@mastodon.online

                        We knew, but the proof is nice.

                        "Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves"

                        The guess-the-next-words machines don’t actually understand anything.

                        Verifying your browser | Nitter

                        favicon

                        (nitter.poast.org)

                        #math #ai

                        sobex@social.sciences.reS This user is from outside of this forum
                        sobex@social.sciences.reS This user is from outside of this forum
                        sobex@social.sciences.re
                        wrote last edited by
                        #12

                        @davidaugust Direct link to the paper https://arxiv.org/pdf/2410.05229 (presented at ICLR 2025).

                        Seems not to be a very recent news, then.

                        davidaugust@mastodon.onlineD 1 Reply Last reply
                        0
                        • davidaugust@mastodon.onlineD davidaugust@mastodon.online

                          We knew, but the proof is nice.

                          "Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves"

                          The guess-the-next-words machines don’t actually understand anything.

                          Verifying your browser | Nitter

                          favicon

                          (nitter.poast.org)

                          #math #ai

                          karen5lund@mastodon.socialK This user is from outside of this forum
                          karen5lund@mastodon.socialK This user is from outside of this forum
                          karen5lund@mastodon.social
                          wrote last edited by
                          #13

                          @davidaugust In about 80 years we've gone from a room full of computers the size of refrigerators that were good at crunching numbers but not much else to computers the size of corporate office parks that can draw almost-convincing pictures of people with five fingers (and thumbs, too!) but can't do elementary school math.

                          And some people call this progress.

                          bouriquet@mastodon.socialB 1 Reply Last reply
                          0
                          • sobex@social.sciences.reS sobex@social.sciences.re

                            @davidaugust Direct link to the paper https://arxiv.org/pdf/2410.05229 (presented at ICLR 2025).

                            Seems not to be a very recent news, then.

                            davidaugust@mastodon.onlineD This user is from outside of this forum
                            davidaugust@mastodon.onlineD This user is from outside of this forum
                            davidaugust@mastodon.online
                            wrote last edited by
                            #14

                            @Sobex it’s from August.

                            1 Reply Last reply
                            0
                            • davidaugust@mastodon.onlineD This user is from outside of this forum
                              davidaugust@mastodon.onlineD This user is from outside of this forum
                              davidaugust@mastodon.online
                              wrote last edited by
                              #15

                              @drifthood yes, there does seem to be a threshold over which in some respects only humans cross over to one side.

                              I see that sort of begging in a dog. He wants the treat, so instead of just doing the desired behavior the human command is asking for, he tries every response that has ever gotten him a treat until he “unlocks” the treat. Humans can and do do this too from time to time, but humans _also_ actually communicate and understand from time to time as well.

                              1 Reply Last reply
                              0
                              • joriki@infosec.exchangeJ joriki@infosec.exchange

                                @davidaugust

                                not new, here's the 2024 paper referenced:

                                Link Preview Image
                                GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

                                Abstract page for arXiv paper 2410.05229: GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

                                favicon

                                arXiv.org (arxiv.org)

                                davidaugust@mastodon.onlineD This user is from outside of this forum
                                davidaugust@mastodon.onlineD This user is from outside of this forum
                                davidaugust@mastodon.online
                                wrote last edited by
                                #16

                                @joriki it’s from August.

                                joriki@infosec.exchangeJ 1 Reply Last reply
                                0
                                • davidaugust@mastodon.onlineD davidaugust@mastodon.online

                                  We knew, but the proof is nice.

                                  "Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves"

                                  The guess-the-next-words machines don’t actually understand anything.

                                  Verifying your browser | Nitter

                                  favicon

                                  (nitter.poast.org)

                                  #math #ai

                                  audioflyer79@mstdn.socialA This user is from outside of this forum
                                  audioflyer79@mstdn.socialA This user is from outside of this forum
                                  audioflyer79@mstdn.social
                                  wrote last edited by
                                  #17

                                  @davidaugust Ecosia AI gets it right. It looks like the paper referenced was published in 2025, so the research conducted prior. The models are all much better now. I’m no AI apologist, but I think any argument of “AI sucks because it’s not good at _____” is on tenuous ground and will be proven wrong as the models continue to improve. @Ecosia

                                  Link Preview Image
                                  alisynthesis@io.waxandleather.comA 1 Reply Last reply
                                  0
                                  • audioflyer79@mstdn.socialA audioflyer79@mstdn.social

                                    @davidaugust Ecosia AI gets it right. It looks like the paper referenced was published in 2025, so the research conducted prior. The models are all much better now. I’m no AI apologist, but I think any argument of “AI sucks because it’s not good at _____” is on tenuous ground and will be proven wrong as the models continue to improve. @Ecosia

                                    Link Preview Image
                                    alisynthesis@io.waxandleather.comA This user is from outside of this forum
                                    alisynthesis@io.waxandleather.comA This user is from outside of this forum
                                    alisynthesis@io.waxandleather.com
                                    wrote last edited by
                                    #18

                                    @audioflyer79 @davidaugust I mean, it's worth noting that the LLMs have ingested that paper by now. : /

                                    audioflyer79@mstdn.socialA 1 Reply Last reply
                                    0
                                    • alisynthesis@io.waxandleather.comA alisynthesis@io.waxandleather.com

                                      @audioflyer79 @davidaugust I mean, it's worth noting that the LLMs have ingested that paper by now. : /

                                      audioflyer79@mstdn.socialA This user is from outside of this forum
                                      audioflyer79@mstdn.socialA This user is from outside of this forum
                                      audioflyer79@mstdn.social
                                      wrote last edited by
                                      #19

                                      @alisynthesis @davidaugust fair enough. I changed up the problem completely and added some reasoning and it did pretty well. It appears to be generating code to solve the math. The only thing it missed is that very unripe bananas are green, not yellow.

                                      James picks 40 apples on Monday. Then he picks 35 lemons on Tuesday. On Wednesday, he picks half as many bananas as he did apples, but five of them were very unripe. How many yellow fruits does James have?

                                      Link Preview ImageLink Preview Image
                                      morten_skaaning@mastodon.gamedev.placeM 1 Reply Last reply
                                      0
                                      • davidaugust@mastodon.onlineD davidaugust@mastodon.online

                                        We knew, but the proof is nice.

                                        "Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves"

                                        The guess-the-next-words machines don’t actually understand anything.

                                        Verifying your browser | Nitter

                                        favicon

                                        (nitter.poast.org)

                                        #math #ai

                                        pikesley@mastodon.me.ukP This user is from outside of this forum
                                        pikesley@mastodon.me.ukP This user is from outside of this forum
                                        pikesley@mastodon.me.uk
                                        wrote last edited by
                                        #20

                                        @davidaugust

                                        Amo Bishop Rodent (@pikesley@mastodon.me.uk)

                                        "We made the computers, the notoriously accurate calculating machines, worse at arithmetic. This is surely progress along the path to creating Computer God"

                                        favicon

                                        mastodon.me.uk (mastodon.me.uk)

                                        1 Reply Last reply
                                        0
                                        • lemgandi@mastodon.socialL lemgandi@mastodon.social

                                          @davidaugust

                                          In other shocking news:

                                          Water is Wet
                                          Without air you will die

                                          ozzelot@mstdn.socialO This user is from outside of this forum
                                          ozzelot@mstdn.socialO This user is from outside of this forum
                                          ozzelot@mstdn.social
                                          wrote last edited by
                                          #21

                                          @lemgandi
                                          The wetness of water has been hotly debated, as to some wet means "covered with or soaked in water", and it's questioned whether water is covered with itself.
                                          @davidaugust

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups