Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. I heard a reporter from Axios interviewed on NPR the other day (Marketplace Tech, I think) talking about how the tech companies are putting out new models every 6 months to 1 year and how each model is more "powerful" than the previous.

I heard a reporter from Axios interviewed on NPR the other day (Marketplace Tech, I think) talking about how the tech companies are putting out new models every 6 months to 1 year and how each model is more "powerful" than the previous.

Scheduled Pinned Locked Moved Uncategorized
27 Posts 8 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • emilymbender@dair-community.socialE emilymbender@dair-community.social

    And then finally there is the sense of powerful like an engine that generates power --- this is, I think, the deepest fantasy: you, the user (or better yet owner), have access to the "raw power" of the model.

    So I urge journalists and others who are tempted to describe models as "powerful" (or transcribe the PR copy of the companies that call them "powerful") to reflect on what you think that means, and what evidence you have that it is true.

    /fin

    joe@beige.partyJ This user is from outside of this forum
    joe@beige.partyJ This user is from outside of this forum
    joe@beige.party
    wrote last edited by
    #6

    @emilymbender not to argue - but Google I think is showing in some ways power in how some of its proof of concept tools do planning and action sequences. Generating text is the first iteration, then the chat response model/training, and the combination of tool use and planning seems to be a leap forward. Claude code, open claw and others are versions of AutoGPT - just kind of ripping off agile workflows for execution. I would say when the model can “consider” what to do, then execute using external tools - that’s a version of power. It’s the aspect that should excite and worry us. It does it with confidence, and not poorly in some limited cases.

    These kinds of tools tend to amaze me. They are a step beyond. I think as they train on more tokens they are just seeing a side effect of things like this working better, being able to read documents and file tax forms. Agreed to be very skeptical because it’s marketing hype and Claude has clogged the internet with paid astroturfing. It’s disgusting.

    Link Preview Image
    Stitch - Design with AI

    Stitch generates UIs for mobile and web applications, making design ideation fast and easy.

    favicon

    Stitch (stitch.withgoogle.com)

    joe@beige.partyJ emilymbender@dair-community.socialE fartnuggets@jorts.horseF 3 Replies Last reply
    0
    • emilymbender@dair-community.socialE emilymbender@dair-community.social

      And then finally there is the sense of powerful like an engine that generates power --- this is, I think, the deepest fantasy: you, the user (or better yet owner), have access to the "raw power" of the model.

      So I urge journalists and others who are tempted to describe models as "powerful" (or transcribe the PR copy of the companies that call them "powerful") to reflect on what you think that means, and what evidence you have that it is true.

      /fin

      mathaetaes@infosec.exchangeM This user is from outside of this forum
      mathaetaes@infosec.exchangeM This user is from outside of this forum
      mathaetaes@infosec.exchange
      wrote last edited by
      #7

      @emilymbender You're far more an expert on this than me, so I will defer to your experience.

      But in all these examples, "powerful" describes utility. A more powerful engine can do more things than a weaker one. A more advanced spreadsheet software can help users calculate/track more things than a less advanced one.

      By that definition, wouldn't "powerful" for LLMs just mean less wrong, or wrong less often?

      At the end of the day, a model is just predicting text. A model can't use a tool, but it can predict the text required to use a tool. A model can't write code, but it can predict the text that a compiler will turn into a program. We keep building integrations that allow tools to be driven by text, which allows text prediction models to 'use' them... but really it's still just predicting text.

      The only real metrics that apply to an LLM are size, speed, and accuracy. For frontier models, size and speed is always compensated by throwing more hardware at it, so users never see it. Thus, the only reasonable measure of power, for the journalistic contexts you're talking about, is the accuracy of the text it's predicting.

      Thus, a "more powerful AI model" is just one that is less wrong than the previous generations. No?

      That said, I do agree with your points that journalists are doing the PR firms' jobs for them when they use "more powerful" as a stand-in for "less wrong."

      1 Reply Last reply
      0
      • joe@beige.partyJ joe@beige.party

        @emilymbender not to argue - but Google I think is showing in some ways power in how some of its proof of concept tools do planning and action sequences. Generating text is the first iteration, then the chat response model/training, and the combination of tool use and planning seems to be a leap forward. Claude code, open claw and others are versions of AutoGPT - just kind of ripping off agile workflows for execution. I would say when the model can “consider” what to do, then execute using external tools - that’s a version of power. It’s the aspect that should excite and worry us. It does it with confidence, and not poorly in some limited cases.

        These kinds of tools tend to amaze me. They are a step beyond. I think as they train on more tokens they are just seeing a side effect of things like this working better, being able to read documents and file tax forms. Agreed to be very skeptical because it’s marketing hype and Claude has clogged the internet with paid astroturfing. It’s disgusting.

        Link Preview Image
        Stitch - Design with AI

        Stitch generates UIs for mobile and web applications, making design ideation fast and easy.

        favicon

        Stitch (stitch.withgoogle.com)

        joe@beige.partyJ This user is from outside of this forum
        joe@beige.partyJ This user is from outside of this forum
        joe@beige.party
        wrote last edited by
        #8

        @emilymbender sorry! I agree, there can be limited use of the word power - but being able to generate fingers better isn’t it. You need leaps.

        1 Reply Last reply
        0
        • joe@beige.partyJ joe@beige.party

          @emilymbender not to argue - but Google I think is showing in some ways power in how some of its proof of concept tools do planning and action sequences. Generating text is the first iteration, then the chat response model/training, and the combination of tool use and planning seems to be a leap forward. Claude code, open claw and others are versions of AutoGPT - just kind of ripping off agile workflows for execution. I would say when the model can “consider” what to do, then execute using external tools - that’s a version of power. It’s the aspect that should excite and worry us. It does it with confidence, and not poorly in some limited cases.

          These kinds of tools tend to amaze me. They are a step beyond. I think as they train on more tokens they are just seeing a side effect of things like this working better, being able to read documents and file tax forms. Agreed to be very skeptical because it’s marketing hype and Claude has clogged the internet with paid astroturfing. It’s disgusting.

          Link Preview Image
          Stitch - Design with AI

          Stitch generates UIs for mobile and web applications, making design ideation fast and easy.

          favicon

          Stitch (stitch.withgoogle.com)

          emilymbender@dair-community.socialE This user is from outside of this forum
          emilymbender@dair-community.socialE This user is from outside of this forum
          emilymbender@dair-community.social
          wrote last edited by
          #9

          @joe "Not to argue but" ... argue argue argue google PR argue argue

          joe@beige.partyJ 1 Reply Last reply
          0
          • emilymbender@dair-community.socialE emilymbender@dair-community.social

            And then finally there is the sense of powerful like an engine that generates power --- this is, I think, the deepest fantasy: you, the user (or better yet owner), have access to the "raw power" of the model.

            So I urge journalists and others who are tempted to describe models as "powerful" (or transcribe the PR copy of the companies that call them "powerful") to reflect on what you think that means, and what evidence you have that it is true.

            /fin

            joshg@mathstodon.xyzJ This user is from outside of this forum
            joshg@mathstodon.xyzJ This user is from outside of this forum
            joshg@mathstodon.xyz
            wrote last edited by
            #10

            @emilymbender yeah. I really suspect it's mostly the consumption.= aspect. it *must* be more powerful because it consumes so much power, like revving your loud gas-guzzling V8 hemi to show off

            1 Reply Last reply
            0
            • emilymbender@dair-community.socialE emilymbender@dair-community.social

              And then finally there is the sense of powerful like an engine that generates power --- this is, I think, the deepest fantasy: you, the user (or better yet owner), have access to the "raw power" of the model.

              So I urge journalists and others who are tempted to describe models as "powerful" (or transcribe the PR copy of the companies that call them "powerful") to reflect on what you think that means, and what evidence you have that it is true.

              /fin

              fuzzy@beige.partyF This user is from outside of this forum
              fuzzy@beige.partyF This user is from outside of this forum
              fuzzy@beige.party
              wrote last edited by
              #11

              @emilymbender which one word would you use?

              emilymbender@dair-community.socialE 1 Reply Last reply
              0
              • fuzzy@beige.partyF fuzzy@beige.party

                @emilymbender which one word would you use?

                emilymbender@dair-community.socialE This user is from outside of this forum
                emilymbender@dair-community.socialE This user is from outside of this forum
                emilymbender@dair-community.social
                wrote last edited by
                #12

                @fuzzy Why do you think there should be one word? Specificity is what is required here.

                fuzzy@beige.partyF 1 Reply Last reply
                0
                • emilymbender@dair-community.socialE emilymbender@dair-community.social

                  @fuzzy Why do you think there should be one word? Specificity is what is required here.

                  fuzzy@beige.partyF This user is from outside of this forum
                  fuzzy@beige.partyF This user is from outside of this forum
                  fuzzy@beige.party
                  wrote last edited by
                  #13

                  @emilymbender I don't believe that there should be a single word, but which one word would you use (instead of "powerful")?

                  emilymbender@dair-community.socialE 1 Reply Last reply
                  0
                  • emilymbender@dair-community.socialE emilymbender@dair-community.social

                    I heard a reporter from Axios interviewed on NPR the other day (Marketplace Tech, I think) talking about how the tech companies are putting out new models every 6 months to 1 year and how each model is more "powerful" than the previous.

                    This got me thinking about what it means when we describe technology and this technology in particular as "powerful".

                    🧵>>

                    bltpizza@mastodon.socialB This user is from outside of this forum
                    bltpizza@mastodon.socialB This user is from outside of this forum
                    bltpizza@mastodon.social
                    wrote last edited by
                    #14

                    @emilymbender Marketplace Tech segments have been enthusiastically promoting AI for over two years. I don't recall anything approaching a tough question for any guests or unbiased journalism.

                    1 Reply Last reply
                    0
                    • fuzzy@beige.partyF fuzzy@beige.party

                      @emilymbender I don't believe that there should be a single word, but which one word would you use (instead of "powerful")?

                      emilymbender@dair-community.socialE This user is from outside of this forum
                      emilymbender@dair-community.socialE This user is from outside of this forum
                      emilymbender@dair-community.social
                      wrote last edited by
                      #15

                      @fuzzy Which property of the models are you trying to label? .My thread was about how that word is being used with insidious ambiguity.

                      1 Reply Last reply
                      0
                      • joe@beige.partyJ joe@beige.party

                        @emilymbender not to argue - but Google I think is showing in some ways power in how some of its proof of concept tools do planning and action sequences. Generating text is the first iteration, then the chat response model/training, and the combination of tool use and planning seems to be a leap forward. Claude code, open claw and others are versions of AutoGPT - just kind of ripping off agile workflows for execution. I would say when the model can “consider” what to do, then execute using external tools - that’s a version of power. It’s the aspect that should excite and worry us. It does it with confidence, and not poorly in some limited cases.

                        These kinds of tools tend to amaze me. They are a step beyond. I think as they train on more tokens they are just seeing a side effect of things like this working better, being able to read documents and file tax forms. Agreed to be very skeptical because it’s marketing hype and Claude has clogged the internet with paid astroturfing. It’s disgusting.

                        Link Preview Image
                        Stitch - Design with AI

                        Stitch generates UIs for mobile and web applications, making design ideation fast and easy.

                        favicon

                        Stitch (stitch.withgoogle.com)

                        fartnuggets@jorts.horseF This user is from outside of this forum
                        fartnuggets@jorts.horseF This user is from outside of this forum
                        fartnuggets@jorts.horse
                        wrote last edited by
                        #16

                        @joe @emilymbender bro, you need to slow tf down, and learn a bit more about whom you're correcting. Tuck your ego back in your pants, and don't try to "not to argue" argue your talking points.

                        Knock it off.

                        joe@beige.partyJ 1 Reply Last reply
                        0
                        • fartnuggets@jorts.horseF fartnuggets@jorts.horse

                          @joe @emilymbender bro, you need to slow tf down, and learn a bit more about whom you're correcting. Tuck your ego back in your pants, and don't try to "not to argue" argue your talking points.

                          Knock it off.

                          joe@beige.partyJ This user is from outside of this forum
                          joe@beige.partyJ This user is from outside of this forum
                          joe@beige.party
                          wrote last edited by
                          #17

                          @fartnuggets @emilymbender lol I knew it, you can’t discuss anything on social media. Gender is more important. Thanks for putting me in my place.

                          joe@beige.partyJ fartnuggets@jorts.horseF 3 Replies Last reply
                          0
                          • joe@beige.partyJ joe@beige.party

                            @fartnuggets @emilymbender lol I knew it, you can’t discuss anything on social media. Gender is more important. Thanks for putting me in my place.

                            joe@beige.partyJ This user is from outside of this forum
                            joe@beige.partyJ This user is from outside of this forum
                            joe@beige.party
                            wrote last edited by
                            #18

                            @fartnuggets @emilymbender literally didn’t “correct” anyone. This is peak social media. Love when men attack men for existing in spaces.

                            joe@beige.partyJ svenouille@mastodon.socialS 2 Replies Last reply
                            0
                            • em0nm4stodon@infosec.exchangeE em0nm4stodon@infosec.exchange shared this topic
                              R relay@relay.infosec.exchange shared this topic
                            • emilymbender@dair-community.socialE emilymbender@dair-community.social

                              @joe "Not to argue but" ... argue argue argue google PR argue argue

                              joe@beige.partyJ This user is from outside of this forum
                              joe@beige.partyJ This user is from outside of this forum
                              joe@beige.party
                              wrote last edited by
                              #19

                              @emilymbender you’re right, being objective is silly.

                              joe@beige.partyJ 1 Reply Last reply
                              0
                              • joe@beige.partyJ joe@beige.party

                                @emilymbender you’re right, being objective is silly.

                                joe@beige.partyJ This user is from outside of this forum
                                joe@beige.partyJ This user is from outside of this forum
                                joe@beige.party
                                wrote last edited by
                                #20

                                @emilymbender you used the word evidence… it’s wild. Sorry I forgot I was wrong anyways. I can’t possibly be right.

                                1 Reply Last reply
                                0
                                • joe@beige.partyJ joe@beige.party

                                  @fartnuggets @emilymbender lol I knew it, you can’t discuss anything on social media. Gender is more important. Thanks for putting me in my place.

                                  fartnuggets@jorts.horseF This user is from outside of this forum
                                  fartnuggets@jorts.horseF This user is from outside of this forum
                                  fartnuggets@jorts.horse
                                  wrote last edited by
                                  #21

                                  @joe @emilymbender Not to argue, but thinking before you open your mouth is a good idea. Correcting an expert with a profound expertise, based on a misread of their posts, however, is not.

                                  Gender only comes into it because men do this all the fucking time.

                                  You'd be just as much of an ignorant moron correcting a man, and you'd fail to learn anything just the same. It's just got that special sprinkle of privilege on top when you do it to Dr Bender

                                  joe@beige.partyJ 1 Reply Last reply
                                  0
                                  • joe@beige.partyJ joe@beige.party

                                    @fartnuggets @emilymbender literally didn’t “correct” anyone. This is peak social media. Love when men attack men for existing in spaces.

                                    joe@beige.partyJ This user is from outside of this forum
                                    joe@beige.partyJ This user is from outside of this forum
                                    joe@beige.party
                                    wrote last edited by
                                    #22

                                    @fartnuggets @emilymbender I was said to giving PR. I think I’m supposed to be mansplaining. Just tell me to kill my self. This is what social media is about - making each other feel bad.

                                    1 Reply Last reply
                                    0
                                    • fartnuggets@jorts.horseF fartnuggets@jorts.horse

                                      @joe @emilymbender Not to argue, but thinking before you open your mouth is a good idea. Correcting an expert with a profound expertise, based on a misread of their posts, however, is not.

                                      Gender only comes into it because men do this all the fucking time.

                                      You'd be just as much of an ignorant moron correcting a man, and you'd fail to learn anything just the same. It's just got that special sprinkle of privilege on top when you do it to Dr Bender

                                      joe@beige.partyJ This user is from outside of this forum
                                      joe@beige.partyJ This user is from outside of this forum
                                      joe@beige.party
                                      wrote last edited by
                                      #23

                                      @fartnuggets @emilymbender I am sorry please give me more orders

                                      joe@beige.partyJ 1 Reply Last reply
                                      0
                                      • joe@beige.partyJ joe@beige.party

                                        @fartnuggets @emilymbender lol I knew it, you can’t discuss anything on social media. Gender is more important. Thanks for putting me in my place.

                                        fartnuggets@jorts.horseF This user is from outside of this forum
                                        fartnuggets@jorts.horseF This user is from outside of this forum
                                        fartnuggets@jorts.horse
                                        wrote last edited by
                                        #24

                                        @joe @emilymbender it's also fascinating that you're the one bringing gender into it. Bravo, you've somehow embodied the stereotype even more profoundly.

                                        1 Reply Last reply
                                        0
                                        • joe@beige.partyJ joe@beige.party

                                          @fartnuggets @emilymbender I am sorry please give me more orders

                                          joe@beige.partyJ This user is from outside of this forum
                                          joe@beige.partyJ This user is from outside of this forum
                                          joe@beige.party
                                          wrote last edited by
                                          #25

                                          @fartnuggets @emilymbender I didn’t correct her. I tried to talk to someone on social media. This is insanity.

                                          joe@beige.partyJ 1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups