<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[Damn those Mythos benchmarks seem very promising]]></title><description><![CDATA[<p>Damn those Mythos benchmarks seem very promising</p>]]></description><link>https://board.circlewithadot.net/topic/1729f588-bfc4-4fb4-be07-d51ce437bbdf/damn-those-mythos-benchmarks-seem-very-promising</link><generator>RSS for Node</generator><lastBuildDate>Sat, 02 May 2026 19:22:52 GMT</lastBuildDate><atom:link href="https://board.circlewithadot.net/topic/1729f588-bfc4-4fb4-be07-d51ce437bbdf.rss" rel="self" type="application/rss+xml"/><pubDate>Tue, 07 Apr 2026 20:52:49 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 13:47:59 GMT]]></title><description><![CDATA[What's the fix for the people behind it explicitly having the goal of replacing the human mind as a tool of thought?<br /><br />CC: <span><a href="/user/justin%40toot.io">@justin@toot.io</a></span><br />]]></description><link>https://board.circlewithadot.net/post/https://hj.9fs.net/ori/p/1775656079.937365</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://hj.9fs.net/ori/p/1775656079.937365</guid><dc:creator><![CDATA[ori@hj.9fs.net]]></dc:creator><pubDate>Wed, 08 Apr 2026 13:47:59 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 13:44:58 GMT]]></title><description><![CDATA[<p><span><a href="/user/pojntfx%40mastodon.social">@<span>pojntfx</span></a></span> nod. it does have me thinking hard about other forms of baked-in safety. i'll admit this is the first point in my career where i've ever taken elixir seriously.</p><p>(well, ok, not really... <span><a href="/user/abnv%40fantastic.earth">@<span>abnv</span></a></span> ran a team at nilenso that did some amazing work with it for an quiz app that ran in parallel to a tv show. but i've never previously been tempted to learn it.)</p>]]></description><link>https://board.circlewithadot.net/post/https://fantastic.earth/users/deobald/statuses/116369384956721332</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://fantastic.earth/users/deobald/statuses/116369384956721332</guid><dc:creator><![CDATA[deobald@fantastic.earth]]></dc:creator><pubDate>Wed, 08 Apr 2026 13:44:58 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 13:41:20 GMT]]></title><description><![CDATA[<p><span><a href="/user/pojntfx%40mastodon.social">@<span>pojntfx</span></a></span> are you using nanobot for hacking or were you just pointing me to the provider section?</p>]]></description><link>https://board.circlewithadot.net/post/https://fantastic.earth/users/deobald/statuses/116369370657032720</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://fantastic.earth/users/deobald/statuses/116369370657032720</guid><dc:creator><![CDATA[deobald@fantastic.earth]]></dc:creator><pubDate>Wed, 08 Apr 2026 13:41:20 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 13:33:34 GMT]]></title><description><![CDATA[<p><span><a href="/user/pojntfx%40mastodon.social">@<span>pojntfx</span></a></span> <span><a href="/user/deobald%40fantastic.earth">@<span>deobald</span></a></span> You found glm 5.1 was better than opus4.6 at coding?? Want to split an h200 ?</p>]]></description><link>https://board.circlewithadot.net/post/https://mastodon.social/users/purpleidea/statuses/116369340146244840</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://mastodon.social/users/purpleidea/statuses/116369340146244840</guid><dc:creator><![CDATA[purpleidea@mastodon.social]]></dc:creator><pubDate>Wed, 08 Apr 2026 13:33:34 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 07:23:27 GMT]]></title><description><![CDATA[<p><span><a href="/user/deobald%40fantastic.earth">@<span>deobald</span></a></span> If you'e like to try for yourself I've documented it here: <a href="https://gist.github.com/pojntfx/5916ceb7ec35eb010010400447e9c034" rel="nofollow noopener"><span>https://</span><span>gist.github.com/pojntfx/5916ce</span><span>b7ec35eb010010400447e9c034</span></a></p>]]></description><link>https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116367884771239435</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116367884771239435</guid><dc:creator><![CDATA[pojntfx@mastodon.social]]></dc:creator><pubDate>Wed, 08 Apr 2026 07:23:27 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 07:22:29 GMT]]></title><description><![CDATA[<p><span><a href="/user/deobald%40fantastic.earth">@<span>deobald</span></a></span> I'm pretty happy about mostly working with higher-level, memory-safe languages</p>]]></description><link>https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116367880988804739</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116367880988804739</guid><dc:creator><![CDATA[pojntfx@mastodon.social]]></dc:creator><pubDate>Wed, 08 Apr 2026 07:22:29 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 07:21:42 GMT]]></title><description><![CDATA[<p><span><a href="/user/deobald%40fantastic.earth">@<span>deobald</span></a></span> And yeah re:Mythos I'll believe it when I see it, but current-gen models except free is already a massive value IMHO. Sonnet etc. is still very useful despite the other models existing</p>]]></description><link>https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116367877896992197</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116367877896992197</guid><dc:creator><![CDATA[pojntfx@mastodon.social]]></dc:creator><pubDate>Wed, 08 Apr 2026 07:21:42 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 07:20:24 GMT]]></title><description><![CDATA[<p><span><a href="/user/deobald%40fantastic.earth">@<span>deobald</span></a></span> Yup, I used Qwen 3.6 with Nanobot via OpenRouter, Alibaba was providing it for free for testing until yesterday. Switched to GLM 5.1 earlier - same thing, beats Opus. GLM's weights are even MIT-licensed</p>]]></description><link>https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116367872813830921</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116367872813830921</guid><dc:creator><![CDATA[pojntfx@mastodon.social]]></dc:creator><pubDate>Wed, 08 Apr 2026 07:20:24 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 07:16:45 GMT]]></title><description><![CDATA[<p><span><a href="/user/pojntfx%40mastodon.social">@<span>pojntfx</span></a></span> have you actually seen qwen perform this well? or are you basing that comment on benchmarks?</p><p>i think the mythos benchmarks only have to be "some amount better" at finding 0days than the current public models to justify them waiting on ga... quite a few maintainers are already swamped.</p>]]></description><link>https://board.circlewithadot.net/post/https://fantastic.earth/users/deobald/statuses/116367858403845787</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://fantastic.earth/users/deobald/statuses/116367858403845787</guid><dc:creator><![CDATA[deobald@fantastic.earth]]></dc:creator><pubDate>Wed, 08 Apr 2026 07:16:45 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 00:34:04 GMT]]></title><description><![CDATA[<p><span><a href="/user/justin%40toot.io">@<span>justin</span></a></span> Idk this argument has been had like a million times on here and at this point it's getting tiring. It's useful in some contexts. Can be the opposite of that in others. It's being used by more and more projects and people every day with pretty good success lately.</p>]]></description><link>https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116366274998543909</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116366274998543909</guid><dc:creator><![CDATA[pojntfx@mastodon.social]]></dc:creator><pubDate>Wed, 08 Apr 2026 00:34:04 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 00:32:33 GMT]]></title><description><![CDATA[<p><span><a href="/user/justin%40toot.io">@<span>justin</span></a></span> Degradation of creativity is a real problem, yes, but "why are you painting a picture of me when you can just take a photo" is nothing new</p>]]></description><link>https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116366269053731788</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116366269053731788</guid><dc:creator><![CDATA[pojntfx@mastodon.social]]></dc:creator><pubDate>Wed, 08 Apr 2026 00:32:33 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 00:32:00 GMT]]></title><description><![CDATA[<p><span><a href="/user/justin%40toot.io">@<span>justin</span></a></span> I don't believe in IP, there is no such thing as "theft" of intellectual "property". Copyleft was a means to get to this at some point and might still be a way to get there but times are changing</p><p>"garbage AI causes FOSS to deal with on a daily basis" - again, something changed here. It's not useless slop AI security reports anymore like a few months ago. systemd uses it, curl uses, Linux uses because it's useful</p>]]></description><link>https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116366266861217745</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116366266861217745</guid><dc:creator><![CDATA[pojntfx@mastodon.social]]></dc:creator><pubDate>Wed, 08 Apr 2026 00:32:00 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 00:30:12 GMT]]></title><description><![CDATA[<p><span><a href="/user/pojntfx%40mastodon.social">@<span>pojntfx</span></a></span> useful doesn't excuse theft, degradation of creativity and the amount of garbage that AI causes FOSS to deal with on a daily basis.</p>]]></description><link>https://board.circlewithadot.net/post/https://toot.io/users/justin/statuses/116366259812371863</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://toot.io/users/justin/statuses/116366259812371863</guid><dc:creator><![CDATA[justin@toot.io]]></dc:creator><pubDate>Wed, 08 Apr 2026 00:30:12 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 00:29:58 GMT]]></title><description><![CDATA[<p><span><a href="/user/justin%40toot.io">@<span>justin</span></a></span> Meh, the abolition of copyright is a nice side effect</p><p>Endless slop polluting clean datasources is a big problem, yes, but not using LLMs for something that is _not_ that won't change it</p>]]></description><link>https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116366258864807221</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116366258864807221</guid><dc:creator><![CDATA[pojntfx@mastodon.social]]></dc:creator><pubDate>Wed, 08 Apr 2026 00:29:58 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 00:29:05 GMT]]></title><description><![CDATA[<p><span><a href="/user/justin%40toot.io">@<span>justin</span></a></span> Something changed, either in the harness or the models idk but something changed ~Nov of last year, maybe ~Feb this year I'm not sure, but it's gone from "useless" to "useful" pretty quickly.</p>]]></description><link>https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116366255433261013</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116366255433261013</guid><dc:creator><![CDATA[pojntfx@mastodon.social]]></dc:creator><pubDate>Wed, 08 Apr 2026 00:29:05 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 00:28:28 GMT]]></title><description><![CDATA[<p><span><a href="/user/pojntfx%40mastodon.social">@<span>pojntfx</span></a></span> AI has far more issues than just energy use.</p>]]></description><link>https://board.circlewithadot.net/post/https://toot.io/users/justin/statuses/116366253022312003</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://toot.io/users/justin/statuses/116366253022312003</guid><dc:creator><![CDATA[justin@toot.io]]></dc:creator><pubDate>Wed, 08 Apr 2026 00:28:28 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 00:27:55 GMT]]></title><description><![CDATA[<p><span><a href="/user/justin%40toot.io">@<span>justin</span></a></span> The fix isn't to not use useful tools it's to a) deregulate clean energy infrastructure so that we expand them China-style and b) make sure that the models are open so you can run them on clean energy right now</p><p>This is the same argument like with EVs "but the grid is dirty" like yes. Fix that. Don't be anti-EV because of it</p>]]></description><link>https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116366250824772203</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116366250824772203</guid><dc:creator><![CDATA[pojntfx@mastodon.social]]></dc:creator><pubDate>Wed, 08 Apr 2026 00:27:55 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Wed, 08 Apr 2026 00:21:45 GMT]]></title><description><![CDATA[<p><span><a href="/user/pojntfx%40mastodon.social">@<span>pojntfx</span></a></span> I really don't get the excitement around tech that destroys the earth more than we as humanity have in our history so far?</p>]]></description><link>https://board.circlewithadot.net/post/https://toot.io/users/justin/statuses/116366226589876530</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://toot.io/users/justin/statuses/116366226589876530</guid><dc:creator><![CDATA[justin@toot.io]]></dc:creator><pubDate>Wed, 08 Apr 2026 00:21:45 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Tue, 07 Apr 2026 20:53:46 GMT]]></title><description><![CDATA[<p>Qwen 3.6 is essentially the same as Opus 4.6 now so I guess we'll see how the new generation stacks up?</p>]]></description><link>https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116365408781936419</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116365408781936419</guid><dc:creator><![CDATA[pojntfx@mastodon.social]]></dc:creator><pubDate>Tue, 07 Apr 2026 20:53:46 GMT</pubDate></item><item><title><![CDATA[Reply to Damn those Mythos benchmarks seem very promising on Tue, 07 Apr 2026 20:53:18 GMT]]></title><description><![CDATA[<p>Wild that they don't seem to be making it GA, makes me suspect it's probably actually not as good as they say</p>]]></description><link>https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116365406933731032</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://mastodon.social/users/pojntfx/statuses/116365406933731032</guid><dc:creator><![CDATA[pojntfx@mastodon.social]]></dc:creator><pubDate>Tue, 07 Apr 2026 20:53:18 GMT</pubDate></item></channel></rss>