<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[Is the #LLM race actually a race to the bottom?]]></title><description><![CDATA[<p>Is the <a href="https://mstdn.ca/tags/LLM" rel="tag">#<span>LLM</span></a> race actually a race to the bottom? In the short time I've been tracking model development, the jump in what's possible on consumer hardware has been impressive. </p><p>Every other week, we see a new model that does more with lighter weights and fewer parameters.</p><p><a href="https://mstdn.ca/tags/AI" rel="tag">#<span>AI</span></a> <a href="https://mstdn.ca/tags/qwen" rel="tag">#<span>qwen</span></a> <a href="https://mstdn.ca/tags/kimi" rel="tag">#<span>kimi</span></a> <a href="https://mstdn.ca/tags/gemma4" rel="tag">#<span>gemma4</span></a> <a href="https://mstdn.ca/tags/selfhosting" rel="tag">#<span>selfhosting</span></a></p>]]></description><link>https://board.circlewithadot.net/topic/210c762e-8499-4a0d-bd84-9809fb2b171c/is-the-llm-race-actually-a-race-to-the-bottom</link><generator>RSS for Node</generator><lastBuildDate>Fri, 15 May 2026 04:41:46 GMT</lastBuildDate><atom:link href="https://board.circlewithadot.net/topic/210c762e-8499-4a0d-bd84-9809fb2b171c.rss" rel="self" type="application/rss+xml"/><pubDate>Thu, 23 Apr 2026 13:58:17 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to Is the #LLM race actually a race to the bottom? on Thu, 23 Apr 2026 14:19:37 GMT]]></title><description><![CDATA[<p><span><a href="/user/perpetuum_mobile%40mastodon.social">@<span>perpetuum_mobile</span></a></span> </p><p>Since Gemma4 came out, I agree it's been the gold standard for performance vs compute. If SoC is the way forward for local compute (and I think its clear it is) the real jump happens when unified memory architectures can actually handle the token volume an agentic harness needs. </p><p>Progress on memory overhead for long-context agents, combined with advancements in unified pool architecture, make this a real possibility in the near future.</p>]]></description><link>https://board.circlewithadot.net/post/https://mstdn.ca/users/mamba/statuses/116454455867083033</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://mstdn.ca/users/mamba/statuses/116454455867083033</guid><dc:creator><![CDATA[mamba@mstdn.ca]]></dc:creator><pubDate>Thu, 23 Apr 2026 14:19:37 GMT</pubDate></item></channel></rss>