Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. now that i am... writing my own agentic LLM framework thing... because if you're going to have a shitposting IRC bot you may as well go completely overkill, i have Opinions on the state of the world.

now that i am... writing my own agentic LLM framework thing... because if you're going to have a shitposting IRC bot you may as well go completely overkill, i have Opinions on the state of the world.

Scheduled Pinned Locked Moved Uncategorized
51 Posts 18 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • ariadne@social.treehouse.systemsA This user is from outside of this forum
    ariadne@social.treehouse.systemsA This user is from outside of this forum
    ariadne@social.treehouse.systems
    wrote last edited by
    #1

    now that i am... writing my own agentic LLM framework thing... because if you're going to have a shitposting IRC bot you may as well go completely overkill, i have Opinions on the state of the world.

    openclaw, especially, seems to be hot garbage, actually, because i was able to teach my LLM (which i trained from scratch on the highest quality artisanal IRC logs, 2003 to present, so i can assure you it is not a very good LLM) to use tools in the context of my own framework quite easily.

    dysfun@social.treehouse.systemsD thomholwerda@exquisite.socialT alys@selfy.armyA dan@discuss.systemsD slyecho@mdon.eeS 11 Replies Last reply
    0
    • ariadne@social.treehouse.systemsA ariadne@social.treehouse.systems

      now that i am... writing my own agentic LLM framework thing... because if you're going to have a shitposting IRC bot you may as well go completely overkill, i have Opinions on the state of the world.

      openclaw, especially, seems to be hot garbage, actually, because i was able to teach my LLM (which i trained from scratch on the highest quality artisanal IRC logs, 2003 to present, so i can assure you it is not a very good LLM) to use tools in the context of my own framework quite easily.

      dysfun@social.treehouse.systemsD This user is from outside of this forum
      dysfun@social.treehouse.systemsD This user is from outside of this forum
      dysfun@social.treehouse.systems
      wrote last edited by
      #2

      @ariadne say it's not true

      1 Reply Last reply
      0
      • ariadne@social.treehouse.systemsA ariadne@social.treehouse.systems

        now that i am... writing my own agentic LLM framework thing... because if you're going to have a shitposting IRC bot you may as well go completely overkill, i have Opinions on the state of the world.

        openclaw, especially, seems to be hot garbage, actually, because i was able to teach my LLM (which i trained from scratch on the highest quality artisanal IRC logs, 2003 to present, so i can assure you it is not a very good LLM) to use tools in the context of my own framework quite easily.

        thomholwerda@exquisite.socialT This user is from outside of this forum
        thomholwerda@exquisite.socialT This user is from outside of this forum
        thomholwerda@exquisite.social
        wrote last edited by
        #3

        @ariadne A shitpost bot trained on IRC logs?

        Holy fucking shit you found a valid use for "AI".

        ariadne@social.treehouse.systemsA 1 Reply Last reply
        0
        • ariadne@social.treehouse.systemsA ariadne@social.treehouse.systems

          now that i am... writing my own agentic LLM framework thing... because if you're going to have a shitposting IRC bot you may as well go completely overkill, i have Opinions on the state of the world.

          openclaw, especially, seems to be hot garbage, actually, because i was able to teach my LLM (which i trained from scratch on the highest quality artisanal IRC logs, 2003 to present, so i can assure you it is not a very good LLM) to use tools in the context of my own framework quite easily.

          alys@selfy.armyA This user is from outside of this forum
          alys@selfy.armyA This user is from outside of this forum
          alys@selfy.army
          wrote last edited by
          #4

          @ariadne as someone who trained an llm on the 1913 Webster's Dictionary, "training an llm on tiny corpuses" is among the only kinds of llm experiment i'm interested in hearing about.

          1 Reply Last reply
          0
          • ariadne@social.treehouse.systemsA ariadne@social.treehouse.systems

            now that i am... writing my own agentic LLM framework thing... because if you're going to have a shitposting IRC bot you may as well go completely overkill, i have Opinions on the state of the world.

            openclaw, especially, seems to be hot garbage, actually, because i was able to teach my LLM (which i trained from scratch on the highest quality artisanal IRC logs, 2003 to present, so i can assure you it is not a very good LLM) to use tools in the context of my own framework quite easily.

            dan@discuss.systemsD This user is from outside of this forum
            dan@discuss.systemsD This user is from outside of this forum
            dan@discuss.systems
            wrote last edited by
            #5

            @ariadne many years ago, I trained a Markov model on a decade or two of my IRC utterances to see if I could get it to replace me.

            Now I'm realizing I could have described that as an early AI agent and run off with a huge pile of VC money.

            1 Reply Last reply
            1
            0
            • ariadne@social.treehouse.systemsA ariadne@social.treehouse.systems

              now that i am... writing my own agentic LLM framework thing... because if you're going to have a shitposting IRC bot you may as well go completely overkill, i have Opinions on the state of the world.

              openclaw, especially, seems to be hot garbage, actually, because i was able to teach my LLM (which i trained from scratch on the highest quality artisanal IRC logs, 2003 to present, so i can assure you it is not a very good LLM) to use tools in the context of my own framework quite easily.

              slyecho@mdon.eeS This user is from outside of this forum
              slyecho@mdon.eeS This user is from outside of this forum
              slyecho@mdon.ee
              wrote last edited by
              #6

              @ariadne They are all quite bad and not really production-ready. Maybe support Docker at the minimum, but of course local volume mounts with mutable files. But imagine if it could scale workloads in Kubernetes, save to a database and use S3 storage.

              1 Reply Last reply
              0
              • thomholwerda@exquisite.socialT thomholwerda@exquisite.social

                @ariadne A shitpost bot trained on IRC logs?

                Holy fucking shit you found a valid use for "AI".

                ariadne@social.treehouse.systemsA This user is from outside of this forum
                ariadne@social.treehouse.systemsA This user is from outside of this forum
                ariadne@social.treehouse.systems
                wrote last edited by
                #7

                @thomholwerda i trained it from scratch, this is peak IRC

                thomholwerda@exquisite.socialT 1 Reply Last reply
                0
                • ariadne@social.treehouse.systemsA ariadne@social.treehouse.systems

                  now that i am... writing my own agentic LLM framework thing... because if you're going to have a shitposting IRC bot you may as well go completely overkill, i have Opinions on the state of the world.

                  openclaw, especially, seems to be hot garbage, actually, because i was able to teach my LLM (which i trained from scratch on the highest quality artisanal IRC logs, 2003 to present, so i can assure you it is not a very good LLM) to use tools in the context of my own framework quite easily.

                  dvshkn@social.treehouse.systemsD This user is from outside of this forum
                  dvshkn@social.treehouse.systemsD This user is from outside of this forum
                  dvshkn@social.treehouse.systems
                  wrote last edited by
                  #8

                  @ariadne Did you pull in a tool use data set to fine tune on, or was this accomplished entirely through prompting? I've always been interested in how lean the models can get.

                  ariadne@social.treehouse.systemsA 1 Reply Last reply
                  0
                  • dvshkn@social.treehouse.systemsD dvshkn@social.treehouse.systems

                    @ariadne Did you pull in a tool use data set to fine tune on, or was this accomplished entirely through prompting? I've always been interested in how lean the models can get.

                    ariadne@social.treehouse.systemsA This user is from outside of this forum
                    ariadne@social.treehouse.systemsA This user is from outside of this forum
                    ariadne@social.treehouse.systems
                    wrote last edited by
                    #9

                    @dvshkn i generated a bunch of examples of valid and invalid JSON document fragments and then prompted it with "reply in JSON" and then a spec on what it can do.

                    the hardest thing has been convincing it to shut the fuck up actually.

                    dvshkn@social.treehouse.systemsD 1 Reply Last reply
                    0
                    • ariadne@social.treehouse.systemsA ariadne@social.treehouse.systems

                      @dvshkn i generated a bunch of examples of valid and invalid JSON document fragments and then prompted it with "reply in JSON" and then a spec on what it can do.

                      the hardest thing has been convincing it to shut the fuck up actually.

                      dvshkn@social.treehouse.systemsD This user is from outside of this forum
                      dvshkn@social.treehouse.systemsD This user is from outside of this forum
                      dvshkn@social.treehouse.systems
                      wrote last edited by
                      #10

                      @ariadne It might not be well received by everyone, but would read a blog post if you do write one

                      ariadne@social.treehouse.systemsA 1 Reply Last reply
                      0
                      • ariadne@social.treehouse.systemsA ariadne@social.treehouse.systems

                        @thomholwerda i trained it from scratch, this is peak IRC

                        thomholwerda@exquisite.socialT This user is from outside of this forum
                        thomholwerda@exquisite.socialT This user is from outside of this forum
                        thomholwerda@exquisite.social
                        wrote last edited by
                        #11

                        @ariadne If there are plans to make its... Musings available outside of IRC, I'm bookmarking that.

                        ariadne@social.treehouse.systemsA 1 Reply Last reply
                        0
                        • thomholwerda@exquisite.socialT thomholwerda@exquisite.social

                          @ariadne If there are plans to make its... Musings available outside of IRC, I'm bookmarking that.

                          ariadne@social.treehouse.systemsA This user is from outside of this forum
                          ariadne@social.treehouse.systemsA This user is from outside of this forum
                          ariadne@social.treehouse.systems
                          wrote last edited by
                          #12

                          @thomholwerda i have no idea how to grant it the level of autonomy that would allow it to go full bcachefs

                          thomholwerda@exquisite.socialT 1 Reply Last reply
                          0
                          • dvshkn@social.treehouse.systemsD dvshkn@social.treehouse.systems

                            @ariadne It might not be well received by everyone, but would read a blog post if you do write one

                            ariadne@social.treehouse.systemsA This user is from outside of this forum
                            ariadne@social.treehouse.systemsA This user is from outside of this forum
                            ariadne@social.treehouse.systems
                            wrote last edited by
                            #13

                            @dvshkn *shrug* i think my opinions on commercial AI are well understood by now (namely that i am quite skeptical of it)

                            ariadne@social.treehouse.systemsA 1 Reply Last reply
                            0
                            • ariadne@social.treehouse.systemsA ariadne@social.treehouse.systems

                              @dvshkn *shrug* i think my opinions on commercial AI are well understood by now (namely that i am quite skeptical of it)

                              ariadne@social.treehouse.systemsA This user is from outside of this forum
                              ariadne@social.treehouse.systemsA This user is from outside of this forum
                              ariadne@social.treehouse.systems
                              wrote last edited by
                              #14

                              @dvshkn and, if anything, this exercise has only made me *more* skeptical

                              1 Reply Last reply
                              0
                              • ariadne@social.treehouse.systemsA ariadne@social.treehouse.systems

                                now that i am... writing my own agentic LLM framework thing... because if you're going to have a shitposting IRC bot you may as well go completely overkill, i have Opinions on the state of the world.

                                openclaw, especially, seems to be hot garbage, actually, because i was able to teach my LLM (which i trained from scratch on the highest quality artisanal IRC logs, 2003 to present, so i can assure you it is not a very good LLM) to use tools in the context of my own framework quite easily.

                                beaiouns@is.nota.liveB This user is from outside of this forum
                                beaiouns@is.nota.liveB This user is from outside of this forum
                                beaiouns@is.nota.live
                                wrote last edited by
                                #15

                                @ariadne I have suspected this but never possessed the patience (and possibly the skill) to actually implement it. props

                                1 Reply Last reply
                                0
                                • ariadne@social.treehouse.systemsA ariadne@social.treehouse.systems

                                  @thomholwerda i have no idea how to grant it the level of autonomy that would allow it to go full bcachefs

                                  thomholwerda@exquisite.socialT This user is from outside of this forum
                                  thomholwerda@exquisite.socialT This user is from outside of this forum
                                  thomholwerda@exquisite.social
                                  wrote last edited by
                                  #16

                                  @ariadne The world is not ready for that.

                                  1 Reply Last reply
                                  0
                                  • ariadne@social.treehouse.systemsA ariadne@social.treehouse.systems

                                    now that i am... writing my own agentic LLM framework thing... because if you're going to have a shitposting IRC bot you may as well go completely overkill, i have Opinions on the state of the world.

                                    openclaw, especially, seems to be hot garbage, actually, because i was able to teach my LLM (which i trained from scratch on the highest quality artisanal IRC logs, 2003 to present, so i can assure you it is not a very good LLM) to use tools in the context of my own framework quite easily.

                                    ariadne@social.treehouse.systemsA This user is from outside of this forum
                                    ariadne@social.treehouse.systemsA This user is from outside of this forum
                                    ariadne@social.treehouse.systems
                                    wrote last edited by
                                    #17

                                    first of all, when i began i was quite skeptical on commercial AI.

                                    this exercise has only made me more skeptical, for a few reasons:

                                    first: you actually can hit the "good enough" point for text prediction with very little data. 80GB of low-quality (but ethically sourced from $HOME/logs) training data yielded a bot that can compose english and french prose reasonably well. if i additionally trained it on a creative commons licensed source like a wikipedia dump, it would probably be *way* more than enough. i don't have the compute power to do that though.

                                    second: reasoning models seem to largely be "mixture of experts" which are just more LLMs bolted on to each other. there's some cool consensus stuff going on, but that's all there is. this could possibly be considered a form of "thinking" in the framing of minsky's society of mind, but i don't think there is enough here that i would want to invest in companies doing this long term.

                                    third: from my own experiences teaching my LLM how to use tools, i can tell you that claude code and openai codex are just chatbots with a really well-written system prompt backed by a "mixture of experts" model. it is like that one scene where neo unlocks god mode in the matrix, i see how all this bullshit works now. (there is still a lot i do not know about the specifics, but i'm a person who works on the fuzzy side of things so it does not matter).

                                    fourth: i built my own LLM with a threadripper, some IRC logs gathered from various hard drives, a $10k GPU, a look at the qwen3 training scripts (i have Opinions on py3-transformers) and few days of training. it is pretty capable of generating plausible text. what is the big intellectual property asset that OpenAI has that the little guys can't duplicate? if i can do it in my condo, a startup can certainly compete with OpenAI.

                                    given these things, I really just don't understand how it is justifiable for all of this AI stuff to be some double-digit % of global GDP.

                                    if anything, i just have stronger conviction in that now.

                                    dysfun@social.treehouse.systemsD dvshkn@social.treehouse.systemsD mirth@mastodon.sdf.orgM dngrs@chaos.socialD goakam@mastodon.socialG 7 Replies Last reply
                                    0
                                    • ariadne@social.treehouse.systemsA ariadne@social.treehouse.systems

                                      first of all, when i began i was quite skeptical on commercial AI.

                                      this exercise has only made me more skeptical, for a few reasons:

                                      first: you actually can hit the "good enough" point for text prediction with very little data. 80GB of low-quality (but ethically sourced from $HOME/logs) training data yielded a bot that can compose english and french prose reasonably well. if i additionally trained it on a creative commons licensed source like a wikipedia dump, it would probably be *way* more than enough. i don't have the compute power to do that though.

                                      second: reasoning models seem to largely be "mixture of experts" which are just more LLMs bolted on to each other. there's some cool consensus stuff going on, but that's all there is. this could possibly be considered a form of "thinking" in the framing of minsky's society of mind, but i don't think there is enough here that i would want to invest in companies doing this long term.

                                      third: from my own experiences teaching my LLM how to use tools, i can tell you that claude code and openai codex are just chatbots with a really well-written system prompt backed by a "mixture of experts" model. it is like that one scene where neo unlocks god mode in the matrix, i see how all this bullshit works now. (there is still a lot i do not know about the specifics, but i'm a person who works on the fuzzy side of things so it does not matter).

                                      fourth: i built my own LLM with a threadripper, some IRC logs gathered from various hard drives, a $10k GPU, a look at the qwen3 training scripts (i have Opinions on py3-transformers) and few days of training. it is pretty capable of generating plausible text. what is the big intellectual property asset that OpenAI has that the little guys can't duplicate? if i can do it in my condo, a startup can certainly compete with OpenAI.

                                      given these things, I really just don't understand how it is justifiable for all of this AI stuff to be some double-digit % of global GDP.

                                      if anything, i just have stronger conviction in that now.

                                      dysfun@social.treehouse.systemsD This user is from outside of this forum
                                      dysfun@social.treehouse.systemsD This user is from outside of this forum
                                      dysfun@social.treehouse.systems
                                      wrote last edited by
                                      #18

                                      @ariadne it was never justifiable, but investors don't have your ability to just go play.

                                      1 Reply Last reply
                                      0
                                      • ariadne@social.treehouse.systemsA ariadne@social.treehouse.systems

                                        first of all, when i began i was quite skeptical on commercial AI.

                                        this exercise has only made me more skeptical, for a few reasons:

                                        first: you actually can hit the "good enough" point for text prediction with very little data. 80GB of low-quality (but ethically sourced from $HOME/logs) training data yielded a bot that can compose english and french prose reasonably well. if i additionally trained it on a creative commons licensed source like a wikipedia dump, it would probably be *way* more than enough. i don't have the compute power to do that though.

                                        second: reasoning models seem to largely be "mixture of experts" which are just more LLMs bolted on to each other. there's some cool consensus stuff going on, but that's all there is. this could possibly be considered a form of "thinking" in the framing of minsky's society of mind, but i don't think there is enough here that i would want to invest in companies doing this long term.

                                        third: from my own experiences teaching my LLM how to use tools, i can tell you that claude code and openai codex are just chatbots with a really well-written system prompt backed by a "mixture of experts" model. it is like that one scene where neo unlocks god mode in the matrix, i see how all this bullshit works now. (there is still a lot i do not know about the specifics, but i'm a person who works on the fuzzy side of things so it does not matter).

                                        fourth: i built my own LLM with a threadripper, some IRC logs gathered from various hard drives, a $10k GPU, a look at the qwen3 training scripts (i have Opinions on py3-transformers) and few days of training. it is pretty capable of generating plausible text. what is the big intellectual property asset that OpenAI has that the little guys can't duplicate? if i can do it in my condo, a startup can certainly compete with OpenAI.

                                        given these things, I really just don't understand how it is justifiable for all of this AI stuff to be some double-digit % of global GDP.

                                        if anything, i just have stronger conviction in that now.

                                        dvshkn@social.treehouse.systemsD This user is from outside of this forum
                                        dvshkn@social.treehouse.systemsD This user is from outside of this forum
                                        dvshkn@social.treehouse.systems
                                        wrote last edited by
                                        #19

                                        @ariadne I think your question in the fourth point is answered by your first point. A lot of the secret sauce is just hoarding compute.

                                        ariadne@social.treehouse.systemsA 1 Reply Last reply
                                        0
                                        • ariadne@social.treehouse.systemsA ariadne@social.treehouse.systems

                                          now that i am... writing my own agentic LLM framework thing... because if you're going to have a shitposting IRC bot you may as well go completely overkill, i have Opinions on the state of the world.

                                          openclaw, especially, seems to be hot garbage, actually, because i was able to teach my LLM (which i trained from scratch on the highest quality artisanal IRC logs, 2003 to present, so i can assure you it is not a very good LLM) to use tools in the context of my own framework quite easily.

                                          schrotthaufen@mastodon.socialS This user is from outside of this forum
                                          schrotthaufen@mastodon.socialS This user is from outside of this forum
                                          schrotthaufen@mastodon.social
                                          wrote last edited by
                                          #20

                                          @ariadne If you market it right*, you too can sell for a fuck ton of money to Meta.

                                          * Shitposts better than any LLM on Moltbook 🙊

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups