<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[Hey everyone, looks like having actual &quot;standards&quot; and sticking to them works!]]></title><description><![CDATA[<p>Hey everyone, looks like having actual "standards" and sticking to them works! Or we are just total jerks who will not blindly accept whatever a random person types in to a textbox. One or the other. Who can say, really?</p><blockquote><p><strong>The Importance of High-Signal Data</strong> Modern AI research has definitively proven that data quality is vastly more important than data quantity. Microsoft's landmark 2023 paper "Textbooks Are All You Need" (which introduced the phi-1 coding model) demonstrated that aggressively filtering out low-quality "noise" from training data leads to dramatically better coding models."</p><p><strong>The Subsidy of Human Labor:</strong> Stack Overflow's notorious moderation policies—closing duplicates, downvoting broken code, and demanding minimal reproducible examples—did exactly this human-labor-intensive data filtering for over 15 years. Without this rigorous gatekeeping, LLMs would have ingested vast amounts of broken, insecure, or poorly formatted code, which would have severely degraded their baseline performance. The assertion that AI companies are "subsidized" by the unpaid labor of diligent forum moderators is a widely accepted critique in the fields of AI ethics and data provenance.</p></blockquote>]]></description><link>https://board.circlewithadot.net/topic/5ed4ba96-73db-4d84-a046-387a8475b555/hey-everyone-looks-like-having-actual-standards-and-sticking-to-them-works</link><generator>RSS for Node</generator><lastBuildDate>Fri, 15 May 2026 02:12:58 GMT</lastBuildDate><atom:link href="https://board.circlewithadot.net/topic/5ed4ba96-73db-4d84-a046-387a8475b555.rss" rel="self" type="application/rss+xml"/><pubDate>Tue, 21 Apr 2026 20:44:49 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to Hey everyone, looks like having actual &quot;standards&quot; and sticking to them works! on Tue, 21 Apr 2026 20:49:46 GMT]]></title><description><![CDATA[<p><span><a href="/user/codinghorror%40infosec.exchange" rel="nofollow noopener noreferrer">@<span>codinghorror</span></a></span> and that's why said <a href="https://jorts.horse/tags/AIslop" rel="tag">#<span>AIslop</span></a> should be <a href="https://jorts.horse/tags/banned" rel="tag">#<span>banned</span></a> as <a href="https://jorts.horse/tags/WastefulCmputing" rel="tag">#<span>WastefulCmputing</span></a> and any research and data <a href="https://jorts.horse/tags/OpenSourced" rel="tag">#<span>OpenSourced</span></a>!</p>]]></description><link>https://board.circlewithadot.net/post/https://jorts.horse/users/kkarhan/statuses/116444665386667896</link><guid isPermaLink="true">https://board.circlewithadot.net/post/https://jorts.horse/users/kkarhan/statuses/116444665386667896</guid><dc:creator><![CDATA[kkarhan@jorts.horse]]></dc:creator><pubDate>Tue, 21 Apr 2026 20:49:46 GMT</pubDate></item></channel></rss>