Reply to Is the #LLM race actually a race to the bottom? on Thu, 23 Apr 2026 14:19:37 GMT

mamba@mstdn.ca — Thu, 23 Apr 2026 14:19:37 GMT

Since Gemma4 came out, I agree it's been the gold standard for performance vs compute. If SoC is the way forward for local compute (and I think its clear it is) the real jump happens when unified memory architectures can actually handle the token volume an agentic harness needs.

Progress on memory overhead for long-context agents, combined with advancements in unified pool architecture, make this a real possibility in the near future.