now that i am... writing my own agentic LLM framework thing... because if you're going to have a shitposting IRC bot you may as well go completely overkill, i have Opinions on the state of the world.

ariadne@social.treehouse.systems

@dngrs I wanted something cooler than a Markov bot, and was already researching SLM (small language model, e.g. language strictly as I/O) technology for a Siri-like thing anyway.

ariadne@social.treehouse.systems

@mirth the question is why compete with them at all? it has same energy as the unix wars. large, proprietary models that lock people in. I would rather see a world of small, modular libre models that anyone with a weekend and a GPU can reproduce.

ariadne@social.treehouse.systems

@mirth interesting. what I've built is a modular pipeline which takes language input, converts it into structured data, enriches that structured data with other relevant information, processes the final query into a plan (which is also structured data) and then uses that plan to formulate a response

mirth@mastodon.sdf.org

@ariadne To me it's a question of sufficient output quality, the strongest models available just barely function enough to do a little bit of general purpose instructed information processing unreliably. That will improve over time but the current stuff is very early.

The reason I'm a bit skeptical of a proliferation of weekend-sized models is that that size sacrifices the key ingredient enabling the whole LLM craze: the magical-looking ability to run plain language instructions.

ariadne@social.treehouse.systems

@mirth i mean, i don't think that necessarily holds *if* you have the ability to build whatever you need with legos.

in many cases simply translating natural language to a specification for an expert system is enough

mirth@mastodon.sdf.org

@ariadne I'm not sure if there's a common name in the research but I think that kind of multi-step system that put the whole gloopy mess of linear algebra on some kind of rails is inevitably going to be necessary to make these things reliable. Even the smartest and most highly trained human specialists still rely on lookup tables and checklists and so forth to do their jobs.

ariadne@social.treehouse.systems

@mirth back in the earlier AI wars, these were called "expert systems"

my idea is basically SLMs for I/O with other small models and tools governed by a user-generated expert system

mirth@mastodon.sdf.org

@ariadne Going back to "reasoning" models, they are generally trained with reinforcement learning towards some goal rather than pure supervised prediction. What the biggest labs do is somewhat secret sauce but a technique called "GRPO" was made famous by DeepSeek and I think it or something much like it is what's used to post-train models to code and so forth.

$Link Preview Image$

Post Training Qwen3 for Math Reasoning Using GRPO - PyImageSearch

Fine-tuning Qwen3 for advanced math reasoning using GRPO: boosting precision, structure, and problem-solving accuracy post-training.

PyImageSearch (pyimagesearch.com)

pixx@merveilles.town

@ariadne
I'm kinda wondering if only using your logs is actually an advantage.

I'm sure there's dumb stuff in there but you've filtered out _so much_ of the dumbness on the internet that it might actually be a step up

mirth@mastodon.sdf.org

@ariadne I think there's a lot of merit to that idea although I don't understand how to build it. As models get more powerful the harnesses required to make them write coherent code or whatever aren't getting any simpler, so I think that's a strong argument for the "small pieces in a structured formation" kind of arrangement. Big LLMs have the attracting property that a user can start with a small description and see something happen right away, I wonder how to replicate that.

pinskia@hachyderm.io

@mirth @ariadne This here explains why the US companies are so upset with China here.

ariadne@social.treehouse.systems

@pinskia @mirth yep they broke the illusion.

IMO the real reason OpenAI reserved all of this RAM and shit is to prevent competitors from buying it

jannem@fosstodon.org

@ariadne @pinskia @mirth
What they are doing is forcing competitors to do more with less. Smaller models with a clever architecture, not huge monoliths trained by brute force. Might come back to bite them sooner or later.

I'd like to see more hybrid models, where the LLM largely sticks to being the language module, and other models (possibly not even NN) specialize in other functions.

ariadne@social.treehouse.systems

@jannem @pinskia @mirth yes, this is what i eventually want to build. a set of libre building blocks for building ethical, libre and personal agentic systems that are self-contained.

the shit Big AI is doing is not interesting to me, but SLMs and other specialized neural models legitimately provide a useful set of tools to have in the toolbox.

today, however, I just want to prove the ideas out by shitposting in IRC

ariadne@social.treehouse.systems

@jannem @pinskia @mirth that said, i think that OpenAI and other hardware/resource hoarders need to be called out on the fact that they don't need all of this to ship product

there really is no need to destroy the climate or make professional GPUs cost as much as a recent vintage used car

pixx@merveilles.town

@ariadne
Yeah, one thing I've wondered is how much simpler a system that, instead of processing code, took the plain english "refactor this to blah blah" and just processed the language and figured out what to tell the IDE and etc for everything else, could be.

Run a calculator instead of being one - and you have a much simpler problem to solve.

Could the reliability and ethical problems all be solved -- maybe, i dunno, but - yet another case of "tech could be cool if the harmful parts go away..."

@mirth

ariadne@social.treehouse.systems

@pixx @mirth i think small LLMs do not really have an ethical problem: i trained a 1.3B parameter LLM off of my own personal data in my apartment by simply being patient enough to wait. no copyright violations, no boiling oceans, just patience and a professional workstation GPU with 96GB RAM.

the ethical problem is with the Big AI companies who feel that the only path forward is to make bigger and bigger and bigger monolithic prediction models rather than properly engineer the damn thing.

that same ethical problem is driving the hoarding, because companies are buying the hardware to prevent their competitors from having it IMO.

goakam@mastodon.social

ngl this matches what ive seen running small ops. the hype is way disconnected from whats actually useful day to day. the real value isnt some magic in the model, its finding what problem it actually solves for your specific situation. most companies just buying in because theyre afraid of missing out.

iswyrm@mastodon.uno

@ariadne I do not talk as an educated in the field, but my wild guess, the AI craze is like the evolution of cloud computing business model that some corporations are running from a decade or more.
A way to move workflow into their services even when this workflow could be done offline.

pixx@merveilles.town

@ariadne
Yeah the hoarding one seems pretty obvious

I wonder whether openai can affkrd to hold onto so many chips for more than a year orntwo...
@mirth

CIRCLE WITH A DOT

now that i am... writing my own agentic LLM framework thing... because if you're going to have a shitposting IRC bot you may as well go completely overkill, i have Opinions on the state of the world.

Post Training Qwen3 for Math Reasoning Using GRPO - PyImageSearch