Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. gemma 4 e4b isn't half shabby, but i didn't think it would run in llama.cpp-vulkan in ubuntu on this lenovo yoga laptop with an AMD Radeon 860M GPU.

gemma 4 e4b isn't half shabby, but i didn't think it would run in llama.cpp-vulkan in ubuntu on this lenovo yoga laptop with an AMD Radeon 860M GPU.

Scheduled Pinned Locked Moved Uncategorized
s0up
56 Posts 7 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • allo@chaos.socialA allo@chaos.social

    @lritter I've once read they are a multiplier. Making the dumb people dumber and the clever people more clever.

    Like you can outsource things and blindly believe the output and fail hard, or you know exactly how to use them and speed up your work a lot.

    Another interesting aspect: First people reported burnout from using LLMs, because they are much more productive, and that led to doing much more in a day than they would when doing things themselves, while the work is still mentally straining.

    allo@chaos.socialA This user is from outside of this forum
    allo@chaos.socialA This user is from outside of this forum
    allo@chaos.social
    wrote last edited by
    #33

    @lritter
    The AI assisted 10x engineer, I guess.

    lritter@mastodon.gamedev.placeL 1 Reply Last reply
    0
    • allo@chaos.socialA allo@chaos.social

      @lritter
      The AI assisted 10x engineer, I guess.

      lritter@mastodon.gamedev.placeL This user is from outside of this forum
      lritter@mastodon.gamedev.placeL This user is from outside of this forum
      lritter@mastodon.gamedev.place
      wrote last edited by
      #34

      @allo all this sounds more like mythbuilding to me than truth.

      allo@chaos.socialA 1 Reply Last reply
      0
      • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

        @allo i know of that aspect.

        > Making the dumb people dumber and the clever people more clever.

        yes but which of the two am i!

        allo@chaos.socialA This user is from outside of this forum
        allo@chaos.socialA This user is from outside of this forum
        allo@chaos.social
        wrote last edited by
        #35

        @lritter
        Be the zero, its not affected by multipliers! 🙂

        1 Reply Last reply
        0
        • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

          @allo all this sounds more like mythbuilding to me than truth.

          allo@chaos.socialA This user is from outside of this forum
          allo@chaos.socialA This user is from outside of this forum
          allo@chaos.social
          wrote last edited by
          #36

          @lritter
          No idea, butI think it is plausibel that doing more even with a tool is more stressful than doing less by hand. I think it was particularly about coding work.

          lritter@mastodon.gamedev.placeL 1 Reply Last reply
          0
          • allo@chaos.socialA allo@chaos.social

            @lritter
            No idea, butI think it is plausibel that doing more even with a tool is more stressful than doing less by hand. I think it was particularly about coding work.

            lritter@mastodon.gamedev.placeL This user is from outside of this forum
            lritter@mastodon.gamedev.placeL This user is from outside of this forum
            lritter@mastodon.gamedev.place
            wrote last edited by
            #37

            @allo well it turns you into a bit of a CEO. so it would be logical that you get the same problems as one. which predicts an eventual coke habit 😉

            1 Reply Last reply
            0
            • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

              jarvis, err i mean gemma can now do the original example i proposed.

              i added tools to:
              * get date and time
              * write to file in a special bucket dir
              * append to file in the bucket dir
              * read files (completely)
              * change directory
              * list directory

              it was pretty useless in understanding my language projects. i asked it to write a tutorial for nudl and despite seeing several examples, it used tokens from C++ and python.

              the future - today!

              #s0up

              Link Preview Image
              stompyrobot@mastodon.gamedev.placeS This user is from outside of this forum
              stompyrobot@mastodon.gamedev.placeS This user is from outside of this forum
              stompyrobot@mastodon.gamedev.place
              wrote last edited by
              #38

              @lritter Gemma is a very small model.
              Did you try asking opus to write a tutorial in the same repository?
              And, because it's computers, then ask it to verify and correct itself?
              (That's currently the state of the art in how to get useful stuff out. Why can't it do it automatically? IDK!)

              lritter@mastodon.gamedev.placeL 1 Reply Last reply
              0
              • stompyrobot@mastodon.gamedev.placeS stompyrobot@mastodon.gamedev.place

                @lritter Gemma is a very small model.
                Did you try asking opus to write a tutorial in the same repository?
                And, because it's computers, then ask it to verify and correct itself?
                (That's currently the state of the art in how to get useful stuff out. Why can't it do it automatically? IDK!)

                lritter@mastodon.gamedev.placeL This user is from outside of this forum
                lritter@mastodon.gamedev.placeL This user is from outside of this forum
                lritter@mastodon.gamedev.place
                wrote last edited by
                #39

                @StompyRobot but you see the problem in asking a politician to investigate their own dealings yes?

                stompyrobot@mastodon.gamedev.placeS 1 Reply Last reply
                0
                • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

                  @StompyRobot but you see the problem in asking a politician to investigate their own dealings yes?

                  stompyrobot@mastodon.gamedev.placeS This user is from outside of this forum
                  stompyrobot@mastodon.gamedev.placeS This user is from outside of this forum
                  stompyrobot@mastodon.gamedev.place
                  wrote last edited by
                  #40

                  @lritter
                  Models aren't conscious, don't have volition, and aren't trained to have self preservation behavior. They are surprisingly OK at diagnosing their own output when given specific instructions!

                  Programming them is a whole new way of thinking, but they *can* be made into a useful part of a useful system.

                  As you note, we're still being much in a "batteries not included" early stage, despite boosters claiming it's all done.

                  lritter@mastodon.gamedev.placeL 1 Reply Last reply
                  0
                  • stompyrobot@mastodon.gamedev.placeS stompyrobot@mastodon.gamedev.place

                    @lritter
                    Models aren't conscious, don't have volition, and aren't trained to have self preservation behavior. They are surprisingly OK at diagnosing their own output when given specific instructions!

                    Programming them is a whole new way of thinking, but they *can* be made into a useful part of a useful system.

                    As you note, we're still being much in a "batteries not included" early stage, despite boosters claiming it's all done.

                    lritter@mastodon.gamedev.placeL This user is from outside of this forum
                    lritter@mastodon.gamedev.placeL This user is from outside of this forum
                    lritter@mastodon.gamedev.place
                    wrote last edited by
                    #41

                    @StompyRobot did you just lazily outsource your rebuttal to the machine? 😉

                    you know what i mean. if the machine makes mistakes generating, it will make mistakes verifying (whose output is also generation)

                    stompyrobot@mastodon.gamedev.placeS 1 Reply Last reply
                    0
                    • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

                      @StompyRobot did you just lazily outsource your rebuttal to the machine? 😉

                      you know what i mean. if the machine makes mistakes generating, it will make mistakes verifying (whose output is also generation)

                      stompyrobot@mastodon.gamedev.placeS This user is from outside of this forum
                      stompyrobot@mastodon.gamedev.placeS This user is from outside of this forum
                      stompyrobot@mastodon.gamedev.place
                      wrote last edited by
                      #42

                      @lritter
                      What I'm saying is that that's not at all as certain as with people.
                      Or, to put another way, the prompt is a hash function into one of billions of possible programs stored in the model, and you'll get different bugs with a different prompt.
                      Getting the same model to work on the same problem in three different ways absolutely increases the rate of correctness, especially if you make a "best two of three" kind of setup.
                      It's really quite counter intuitive that it should work!

                      lritter@mastodon.gamedev.placeL 1 Reply Last reply
                      0
                      • stompyrobot@mastodon.gamedev.placeS stompyrobot@mastodon.gamedev.place

                        @lritter
                        What I'm saying is that that's not at all as certain as with people.
                        Or, to put another way, the prompt is a hash function into one of billions of possible programs stored in the model, and you'll get different bugs with a different prompt.
                        Getting the same model to work on the same problem in three different ways absolutely increases the rate of correctness, especially if you make a "best two of three" kind of setup.
                        It's really quite counter intuitive that it should work!

                        lritter@mastodon.gamedev.placeL This user is from outside of this forum
                        lritter@mastodon.gamedev.placeL This user is from outside of this forum
                        lritter@mastodon.gamedev.place
                        wrote last edited by
                        #43

                        @StompyRobot and this is supposed to be good?

                        stompyrobot@mastodon.gamedev.placeS 1 Reply Last reply
                        0
                        • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

                          @StompyRobot and this is supposed to be good?

                          stompyrobot@mastodon.gamedev.placeS This user is from outside of this forum
                          stompyrobot@mastodon.gamedev.placeS This user is from outside of this forum
                          stompyrobot@mastodon.gamedev.place
                          wrote last edited by
                          #44

                          @lritter for Gemma, no idea!
                          For the frontier models, yes, it can actually be good.
                          Think of it as an intern you can foist off certain tasks to when you specify them well, while you do higher thought work.

                          Creating docs and tutorials are a great test case actually -- I'd expect the good models to do above average-human quality on that.

                          lritter@mastodon.gamedev.placeL 1 Reply Last reply
                          0
                          • stompyrobot@mastodon.gamedev.placeS stompyrobot@mastodon.gamedev.place

                            @lritter for Gemma, no idea!
                            For the frontier models, yes, it can actually be good.
                            Think of it as an intern you can foist off certain tasks to when you specify them well, while you do higher thought work.

                            Creating docs and tutorials are a great test case actually -- I'd expect the good models to do above average-human quality on that.

                            lritter@mastodon.gamedev.placeL This user is from outside of this forum
                            lritter@mastodon.gamedev.placeL This user is from outside of this forum
                            lritter@mastodon.gamedev.place
                            wrote last edited by
                            #45

                            @StompyRobot i have no use for a technology that emulates the flaws of people

                            lritter@mastodon.gamedev.placeL 1 Reply Last reply
                            0
                            • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

                              @StompyRobot i have no use for a technology that emulates the flaws of people

                              lritter@mastodon.gamedev.placeL This user is from outside of this forum
                              lritter@mastodon.gamedev.placeL This user is from outside of this forum
                              lritter@mastodon.gamedev.place
                              wrote last edited by
                              #46

                              @StompyRobot if you have a subscription, you go and let them write docs for my stuff. let's see if you're not exaggerating.

                              but no cheating!

                              1 Reply Last reply
                              0
                              • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

                                my impression so far is that a lot of infrastructurd is being built on top the assumption that transformer llm's will eventually be replaced by something that actually works and learns. all of this has tech demo quality. i feel sorry for everyone forced by their boss to argue with the machine like they are in a douglas adams novel.

                                #s0up

                                lritter@mastodon.gamedev.placeL This user is from outside of this forum
                                lritter@mastodon.gamedev.placeL This user is from outside of this forum
                                lritter@mastodon.gamedev.place
                                wrote last edited by
                                #47

                                apparently MCP servers are now the replacement for openai tools protocol. you can sort of convert the existing scripts for it. wrote one, ran it with a thing called fastmcp (a cornucopia of 986175 dependencies), connected to it in llama.cpp: doesn't work. these cardboard & sharpie solutions are begining to annoy me.

                                i notice that most users seem to be happy *when* it works, never asking many questions about *how* it works which is how all these abysmal security failures happen.

                                #s0up

                                dunkelstern@corteximplant.comD neo@soc.psynet.meN lritter@mastodon.gamedev.placeL 3 Replies Last reply
                                0
                                • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

                                  apparently MCP servers are now the replacement for openai tools protocol. you can sort of convert the existing scripts for it. wrote one, ran it with a thing called fastmcp (a cornucopia of 986175 dependencies), connected to it in llama.cpp: doesn't work. these cardboard & sharpie solutions are begining to annoy me.

                                  i notice that most users seem to be happy *when* it works, never asking many questions about *how* it works which is how all these abysmal security failures happen.

                                  #s0up

                                  dunkelstern@corteximplant.comD This user is from outside of this forum
                                  dunkelstern@corteximplant.comD This user is from outside of this forum
                                  dunkelstern@corteximplant.com
                                  wrote last edited by
                                  #48

                                  @lritter that’s absolutely the same thing i am feeling on this. I have been tasked at work to build a copilot/teams “Agent”. The official SDK from microsoft is version “alpha 50”, documentation is completely wrong and the “usecase” is built on hope and prayer and does not work in 60% of all cases. Copilot apparently does so much “Reasoning” in the background it burns through the token context window in no time and then starts to do weird things. Tool calls work in 80% of the cases but in the 20% they do not work it does hilarious things and wrecks the workflow completely. Doesn’t help that copilot has at least 4 ways to build “Agents” where 1 is basically just a prompt, 1 does not work and 2 need copilot pro subscriptions and ms developer accounts which cost you an arm and a leg.

                                  I completely do not understand why anyone wants to use this or how this should replace workers…

                                  lritter@mastodon.gamedev.placeL 1 Reply Last reply
                                  0
                                  • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

                                    apparently MCP servers are now the replacement for openai tools protocol. you can sort of convert the existing scripts for it. wrote one, ran it with a thing called fastmcp (a cornucopia of 986175 dependencies), connected to it in llama.cpp: doesn't work. these cardboard & sharpie solutions are begining to annoy me.

                                    i notice that most users seem to be happy *when* it works, never asking many questions about *how* it works which is how all these abysmal security failures happen.

                                    #s0up

                                    neo@soc.psynet.meN This user is from outside of this forum
                                    neo@soc.psynet.meN This user is from outside of this forum
                                    neo@soc.psynet.me
                                    wrote last edited by
                                    #49

                                    @lritter Hehe. 😄 Here is my fleet of MCP servers. Six of them with about 200 tools in total I vibecoded over a couple of days using https://github.com/modelcontextprotocol/python-sdk. 🙈

                                    My current challenge is to tune the tool descriptions in a way to nudge agents into using the correct tools more intuitively (since I like to write rather abstract queries that often require a combination of tools from different MCPs to be used). Weird way of software development, but pretty much how imagined the future to be. 😂

                                    Link Preview Image
                                    lritter@mastodon.gamedev.placeL 2 Replies Last reply
                                    0
                                    • neo@soc.psynet.meN neo@soc.psynet.me

                                      @lritter Hehe. 😄 Here is my fleet of MCP servers. Six of them with about 200 tools in total I vibecoded over a couple of days using https://github.com/modelcontextprotocol/python-sdk. 🙈

                                      My current challenge is to tune the tool descriptions in a way to nudge agents into using the correct tools more intuitively (since I like to write rather abstract queries that often require a combination of tools from different MCPs to be used). Weird way of software development, but pretty much how imagined the future to be. 😂

                                      Link Preview Image
                                      lritter@mastodon.gamedev.placeL This user is from outside of this forum
                                      lritter@mastodon.gamedev.placeL This user is from outside of this forum
                                      lritter@mastodon.gamedev.place
                                      wrote last edited by
                                      #50

                                      @neo what can i say. the mcp runs, the port is 8000, the protocol is http, llama.cpp connects to http://localhost:8000 but then complains about errors in the protocol itself. i haven't even gotten to taskIng the model yet.

                                      i'll try again with the docs here. there are details the other example didn't have.

                                      1 Reply Last reply
                                      0
                                      • dunkelstern@corteximplant.comD dunkelstern@corteximplant.com

                                        @lritter that’s absolutely the same thing i am feeling on this. I have been tasked at work to build a copilot/teams “Agent”. The official SDK from microsoft is version “alpha 50”, documentation is completely wrong and the “usecase” is built on hope and prayer and does not work in 60% of all cases. Copilot apparently does so much “Reasoning” in the background it burns through the token context window in no time and then starts to do weird things. Tool calls work in 80% of the cases but in the 20% they do not work it does hilarious things and wrecks the workflow completely. Doesn’t help that copilot has at least 4 ways to build “Agents” where 1 is basically just a prompt, 1 does not work and 2 need copilot pro subscriptions and ms developer accounts which cost you an arm and a leg.

                                        I completely do not understand why anyone wants to use this or how this should replace workers…

                                        lritter@mastodon.gamedev.placeL This user is from outside of this forum
                                        lritter@mastodon.gamedev.placeL This user is from outside of this forum
                                        lritter@mastodon.gamedev.place
                                        wrote last edited by
                                        #51

                                        @dunkelstern all "knitted with a hot needle" as we say around here

                                        1 Reply Last reply
                                        0
                                        • neo@soc.psynet.meN neo@soc.psynet.me

                                          @lritter Hehe. 😄 Here is my fleet of MCP servers. Six of them with about 200 tools in total I vibecoded over a couple of days using https://github.com/modelcontextprotocol/python-sdk. 🙈

                                          My current challenge is to tune the tool descriptions in a way to nudge agents into using the correct tools more intuitively (since I like to write rather abstract queries that often require a combination of tools from different MCPs to be used). Weird way of software development, but pretty much how imagined the future to be. 😂

                                          Link Preview Image
                                          lritter@mastodon.gamedev.placeL This user is from outside of this forum
                                          lritter@mastodon.gamedev.placeL This user is from outside of this forum
                                          lritter@mastodon.gamedev.place
                                          wrote last edited by
                                          #52

                                          @neo *looking at the list* fascinating that the question "what am i doing with my life" is not coming up once for you - with or without bots.

                                          i guess i could do this for money. in a "row row row your boat" kind of way. i would have to be careful to pretend to take this seriously and not see it as the cube-esque dream logic that it is.

                                          i mean it's one thing when life itself is this way; that can't be helped. but living inside a sentient bureaucratic monster takes new levels of acceptance.

                                          neo@soc.psynet.meN 1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups