Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. gemma 4 e4b isn't half shabby, but i didn't think it would run in llama.cpp-vulkan in ubuntu on this lenovo yoga laptop with an AMD Radeon 860M GPU.

gemma 4 e4b isn't half shabby, but i didn't think it would run in llama.cpp-vulkan in ubuntu on this lenovo yoga laptop with an AMD Radeon 860M GPU.

Scheduled Pinned Locked Moved Uncategorized
s0up
56 Posts 7 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

    my impression so far is that a lot of infrastructurd is being built on top the assumption that transformer llm's will eventually be replaced by something that actually works and learns. all of this has tech demo quality. i feel sorry for everyone forced by their boss to argue with the machine like they are in a douglas adams novel.

    #s0up

    lritter@mastodon.gamedev.placeL This user is from outside of this forum
    lritter@mastodon.gamedev.placeL This user is from outside of this forum
    lritter@mastodon.gamedev.place
    wrote last edited by
    #47

    apparently MCP servers are now the replacement for openai tools protocol. you can sort of convert the existing scripts for it. wrote one, ran it with a thing called fastmcp (a cornucopia of 986175 dependencies), connected to it in llama.cpp: doesn't work. these cardboard & sharpie solutions are begining to annoy me.

    i notice that most users seem to be happy *when* it works, never asking many questions about *how* it works which is how all these abysmal security failures happen.

    #s0up

    dunkelstern@corteximplant.comD neo@soc.psynet.meN lritter@mastodon.gamedev.placeL 3 Replies Last reply
    0
    • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

      apparently MCP servers are now the replacement for openai tools protocol. you can sort of convert the existing scripts for it. wrote one, ran it with a thing called fastmcp (a cornucopia of 986175 dependencies), connected to it in llama.cpp: doesn't work. these cardboard & sharpie solutions are begining to annoy me.

      i notice that most users seem to be happy *when* it works, never asking many questions about *how* it works which is how all these abysmal security failures happen.

      #s0up

      dunkelstern@corteximplant.comD This user is from outside of this forum
      dunkelstern@corteximplant.comD This user is from outside of this forum
      dunkelstern@corteximplant.com
      wrote last edited by
      #48

      @lritter that’s absolutely the same thing i am feeling on this. I have been tasked at work to build a copilot/teams “Agent”. The official SDK from microsoft is version “alpha 50”, documentation is completely wrong and the “usecase” is built on hope and prayer and does not work in 60% of all cases. Copilot apparently does so much “Reasoning” in the background it burns through the token context window in no time and then starts to do weird things. Tool calls work in 80% of the cases but in the 20% they do not work it does hilarious things and wrecks the workflow completely. Doesn’t help that copilot has at least 4 ways to build “Agents” where 1 is basically just a prompt, 1 does not work and 2 need copilot pro subscriptions and ms developer accounts which cost you an arm and a leg.

      I completely do not understand why anyone wants to use this or how this should replace workers…

      lritter@mastodon.gamedev.placeL 1 Reply Last reply
      0
      • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

        apparently MCP servers are now the replacement for openai tools protocol. you can sort of convert the existing scripts for it. wrote one, ran it with a thing called fastmcp (a cornucopia of 986175 dependencies), connected to it in llama.cpp: doesn't work. these cardboard & sharpie solutions are begining to annoy me.

        i notice that most users seem to be happy *when* it works, never asking many questions about *how* it works which is how all these abysmal security failures happen.

        #s0up

        neo@soc.psynet.meN This user is from outside of this forum
        neo@soc.psynet.meN This user is from outside of this forum
        neo@soc.psynet.me
        wrote last edited by
        #49

        @lritter Hehe. 😄 Here is my fleet of MCP servers. Six of them with about 200 tools in total I vibecoded over a couple of days using https://github.com/modelcontextprotocol/python-sdk. 🙈

        My current challenge is to tune the tool descriptions in a way to nudge agents into using the correct tools more intuitively (since I like to write rather abstract queries that often require a combination of tools from different MCPs to be used). Weird way of software development, but pretty much how imagined the future to be. 😂

        Link Preview Image
        lritter@mastodon.gamedev.placeL 2 Replies Last reply
        0
        • neo@soc.psynet.meN neo@soc.psynet.me

          @lritter Hehe. 😄 Here is my fleet of MCP servers. Six of them with about 200 tools in total I vibecoded over a couple of days using https://github.com/modelcontextprotocol/python-sdk. 🙈

          My current challenge is to tune the tool descriptions in a way to nudge agents into using the correct tools more intuitively (since I like to write rather abstract queries that often require a combination of tools from different MCPs to be used). Weird way of software development, but pretty much how imagined the future to be. 😂

          Link Preview Image
          lritter@mastodon.gamedev.placeL This user is from outside of this forum
          lritter@mastodon.gamedev.placeL This user is from outside of this forum
          lritter@mastodon.gamedev.place
          wrote last edited by
          #50

          @neo what can i say. the mcp runs, the port is 8000, the protocol is http, llama.cpp connects to http://localhost:8000 but then complains about errors in the protocol itself. i haven't even gotten to taskIng the model yet.

          i'll try again with the docs here. there are details the other example didn't have.

          1 Reply Last reply
          0
          • dunkelstern@corteximplant.comD dunkelstern@corteximplant.com

            @lritter that’s absolutely the same thing i am feeling on this. I have been tasked at work to build a copilot/teams “Agent”. The official SDK from microsoft is version “alpha 50”, documentation is completely wrong and the “usecase” is built on hope and prayer and does not work in 60% of all cases. Copilot apparently does so much “Reasoning” in the background it burns through the token context window in no time and then starts to do weird things. Tool calls work in 80% of the cases but in the 20% they do not work it does hilarious things and wrecks the workflow completely. Doesn’t help that copilot has at least 4 ways to build “Agents” where 1 is basically just a prompt, 1 does not work and 2 need copilot pro subscriptions and ms developer accounts which cost you an arm and a leg.

            I completely do not understand why anyone wants to use this or how this should replace workers…

            lritter@mastodon.gamedev.placeL This user is from outside of this forum
            lritter@mastodon.gamedev.placeL This user is from outside of this forum
            lritter@mastodon.gamedev.place
            wrote last edited by
            #51

            @dunkelstern all "knitted with a hot needle" as we say around here

            1 Reply Last reply
            0
            • neo@soc.psynet.meN neo@soc.psynet.me

              @lritter Hehe. 😄 Here is my fleet of MCP servers. Six of them with about 200 tools in total I vibecoded over a couple of days using https://github.com/modelcontextprotocol/python-sdk. 🙈

              My current challenge is to tune the tool descriptions in a way to nudge agents into using the correct tools more intuitively (since I like to write rather abstract queries that often require a combination of tools from different MCPs to be used). Weird way of software development, but pretty much how imagined the future to be. 😂

              Link Preview Image
              lritter@mastodon.gamedev.placeL This user is from outside of this forum
              lritter@mastodon.gamedev.placeL This user is from outside of this forum
              lritter@mastodon.gamedev.place
              wrote last edited by
              #52

              @neo *looking at the list* fascinating that the question "what am i doing with my life" is not coming up once for you - with or without bots.

              i guess i could do this for money. in a "row row row your boat" kind of way. i would have to be careful to pretend to take this seriously and not see it as the cube-esque dream logic that it is.

              i mean it's one thing when life itself is this way; that can't be helped. but living inside a sentient bureaucratic monster takes new levels of acceptance.

              neo@soc.psynet.meN 1 Reply Last reply
              0
              • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

                @neo *looking at the list* fascinating that the question "what am i doing with my life" is not coming up once for you - with or without bots.

                i guess i could do this for money. in a "row row row your boat" kind of way. i would have to be careful to pretend to take this seriously and not see it as the cube-esque dream logic that it is.

                i mean it's one thing when life itself is this way; that can't be helped. but living inside a sentient bureaucratic monster takes new levels of acceptance.

                neo@soc.psynet.meN This user is from outside of this forum
                neo@soc.psynet.meN This user is from outside of this forum
                neo@soc.psynet.me
                wrote last edited by
                #53

                @lritter It's called a "job". Some people have to do that to earn money so they can pay their bills. 😉

                Also it's not that bad. Actually shipping something can be quite satisfying. You should try it one day. 😜

                lritter@mastodon.gamedev.placeL 1 Reply Last reply
                0
                • neo@soc.psynet.meN neo@soc.psynet.me

                  @lritter It's called a "job". Some people have to do that to earn money so they can pay their bills. 😉

                  Also it's not that bad. Actually shipping something can be quite satisfying. You should try it one day. 😜

                  lritter@mastodon.gamedev.placeL This user is from outside of this forum
                  lritter@mastodon.gamedev.placeL This user is from outside of this forum
                  lritter@mastodon.gamedev.place
                  wrote last edited by
                  #54

                  @neo you're not wrong.

                  1 Reply Last reply
                  0
                  • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

                    apparently MCP servers are now the replacement for openai tools protocol. you can sort of convert the existing scripts for it. wrote one, ran it with a thing called fastmcp (a cornucopia of 986175 dependencies), connected to it in llama.cpp: doesn't work. these cardboard & sharpie solutions are begining to annoy me.

                    i notice that most users seem to be happy *when* it works, never asking many questions about *how* it works which is how all these abysmal security failures happen.

                    #s0up

                    lritter@mastodon.gamedev.placeL This user is from outside of this forum
                    lritter@mastodon.gamedev.placeL This user is from outside of this forum
                    lritter@mastodon.gamedev.place
                    wrote last edited by
                    #55

                    aha. the reason why the llama.cpp webfrontend can't reach the MCP server seems to be that for reasons beyond me, the communication with the server is attempted in the browser client rather than in the backend, and neither firefox nor chrome seem to support this out of the box (?)

                    Link Preview Image
                    Does not work with browser based MCP clients due to lack of CORS support · Issue #840 · PrefectHQ/fastmcp

                    Description Does not work with browser based MCP clients due to lack of CORS support Example Code Version Information Should allow for CORS header config to support browser clients Additional Context No response

                    favicon

                    GitHub (github.com)

                    according to this, you have to install something called "starlette" and then modify the mcp server so it works and i am already exhausted by this fuck shit stack.

                    #s0up

                    lritter@mastodon.gamedev.placeL 1 Reply Last reply
                    0
                    • lritter@mastodon.gamedev.placeL lritter@mastodon.gamedev.place

                      aha. the reason why the llama.cpp webfrontend can't reach the MCP server seems to be that for reasons beyond me, the communication with the server is attempted in the browser client rather than in the backend, and neither firefox nor chrome seem to support this out of the box (?)

                      Link Preview Image
                      Does not work with browser based MCP clients due to lack of CORS support · Issue #840 · PrefectHQ/fastmcp

                      Description Does not work with browser based MCP clients due to lack of CORS support Example Code Version Information Should allow for CORS header config to support browser clients Additional Context No response

                      favicon

                      GitHub (github.com)

                      according to this, you have to install something called "starlette" and then modify the mcp server so it works and i am already exhausted by this fuck shit stack.

                      #s0up

                      lritter@mastodon.gamedev.placeL This user is from outside of this forum
                      lritter@mastodon.gamedev.placeL This user is from outside of this forum
                      lritter@mastodon.gamedev.place
                      wrote last edited by
                      #56

                      okay. there is an experimental llama.cpp flag "--webui-mcp-proxy", not to be used in production, which puts a proxy in front so the CORS interface (?) that the browser needs is available.

                      you also have to check "use llama-server proxy" in the webui, and then it connects properly.

                      #s0up

                      Link Preview Image
                      1 Reply Last reply
                      0
                      • R relay@relay.infosec.exchange shared this topic
                      Reply
                      • Reply as topic
                      Log in to reply
                      • Oldest to Newest
                      • Newest to Oldest
                      • Most Votes


                      • Login

                      • Login or register to search.
                      • First post
                        Last post
                      0
                      • Categories
                      • Recent
                      • Tags
                      • Popular
                      • World
                      • Users
                      • Groups