There's one very important thing I would like everyone to try to remember this week, and it is that AI companies are full of shit

budududuroiu@hachyderm.io

@jedimb norms are downstream from power. Current power balance is shifted towards frontier labs and hyperscalers, norms around personal computing (RAM prices) and open source software (AI slop floods) are dictated by them.

Moralising AI use with no power to back it up is useless, gatekeeping is power because it says "want to contribute to this project, abide by our rules"

The case for gatekeeping, or: why medieval guilds had it figured out

Every open source maintainer I've talked to in the last six months has the same complaint: the absolute flood of mass-produced, AI-generated, mass-submitted slop requests have turned their repositories into a slush pile. The contributions look like contributions, they have commit messages, they reference issues and they follow templates etc.

Westenberg. (www.joanwestenberg.com)

@mirth @jenniferplusplus

dngrs@chaos.social

@budududuroiu @jenniferplusplus it's funny you bring up AlphaFold because that also has been way overhyped, according to people working in the field (I don't have links to individual statements anymore sadly, been a few years but the Wikipedia page also mentions e.g. AF not really understanding folding). Anyway: as long as there is no concrete data regarding severe CVE increase with a causal link to newer LLMs (which again are still LLMs that do not understand facts) I'll keep holding my breath.

jedimb@mastodon.gamedev.place

@budududuroiu @mirth @jenniferplusplus Goal post moved into a different dimension, I see.

budududuroiu@hachyderm.io

@dngrs @jenniferplusplus I'm sorry, I know thinking conceptually isn't easy for everyone, I tried using AlphaFold because some people have an easier time when presented with examples.

Why would there be an increase in CVEs? If I was an actor with nation-state levels of access to compute, why would I waste all that compute on zero days, only to then publish CVEs about them?

Even the most AI skeptic maintainers start to admit that LLMs are getting good at finding bugs. I understand cynicism is seen as cool nowadays but I think it's intellectually lazy

daniel:// stenberg:// (@bagder@mastodon.social)

I ran a quick git log grep just now. Over the last ~6 months or so, we have fixed over 200 bugs in #curl found with "AI tools".

Mastodon (mastodon.social)

dngrs@chaos.social

@budududuroiu holy condescension Batman lol, no thank you

claudius@darmstadt.social

@jenniferplusplus 37th time's the charm! This time *for real*.

doggo@plush.city

@jenniferplusplus The issue is that big enough corpos don't care about code quality anymore, and they don't care about vulnerabilities being there for months (years sometimes) or leaks. Nobody care about these anymore.. they want results fast to sell quick and move on.

jenniferplusplus@hachyderm.io

@fancysandwiches oh wow, a headline that describes these things as text generators.

How far we've fallen

jenniferplusplus@hachyderm.io

@budududuroiu @dngrs you may as well stop, you're not going to convince me to trust them. Only anthropic can do that, because they have truly earned my distrust.

alanxoc3@tilde.zone

@jenniferplusplus Agree that it is mostly for marketing & investors.

But the article was technical enough, that I think there is an improvement here that no other model has. And if true, it would be great for vulnerability scanning/hardening in general (bad that attackers would have access to it though).

sempf@infosec.exchange

@jenniferplusplus Worth a follow for that post alone. Hi, I'm Bill.

lediva@lediva.masto.host

@jenniferplusplus "our magic machine found a 30 year old security vulnerability!"

OK, what's the CVE link? These companies never show proof besides saying "it totally did the thing, you guyzzz plz giv moar billionz"

sempf@infosec.exchange

@budududuroiu @jenniferplusplus Let's talk about JavaScript. Have you ever looked at your browser's developer console? On any major website on the planet, there are 8 trillion errors in every one. Two-thirds of them are vulnerabilities, but none of them are exploitable or matter for anything at all. That is what is being found.

Those kinds of errors I've been reviewing, all the ones Daniel's been reviewing too, and I'm seeing it over and over. "Yes, okay, technically that is the buffer overrun, but it doesn't matter because you can't ever get to it!"

mirth@mastodon.sdf.org

@jedbrown @jenniferplusplus The asymmetry is the core thing that concerns me. I can say that empirically starting somewhere last year LLM-assisted bug hunting started to be effective. The false positives are avoidable but the cost of remediation has not gone down with the cost of exploits. This new model may make the situation worse but we're already in it.

jenniferplusplus@hachyderm.io

So here's the other thing that bothers me about all this. Regardless of the eventual results, this thing they're doing is *incredibly* resource intensive. They routinely spend billions of dollars on training these models, and billions more on operating them. It's not simple to parse out what fraction of that is directly attributable to the massive scale vuln finder/fabricator. But for the sake of argument lets just pick a plausible number, and call it 50-100 million dollars.

What could we have gotten for 50-100 million dollars of sponsorship for security audits? Prior to this, the largest single investment into FOSS security I'm aware of was the 2015 audit of openssl, after the heartbleed incident. It's hard to find precise costs for that, but I found a few sources estimating 1.2 million dollars, and that is arguably the most security critical piece of software in the world.

But suddenly there's 100x more resources available to do this work, now that producing the artifact can be done with stolen labor? Now that they can externalize the cost of false positives onto the already mostly unpaid maintainers of these projects? Even if their claims are true, which we have no reason to believe and very good reason not to, it's still a travesty

sci_photos@troet.cafe

@jenniferplusplus

datarama@hachyderm.io

@jenniferplusplus 100 million dollars of sponsorship for FOSS project security audits doesn't sell a promise that soon all the humans can be fired.

mnl@hachyderm.io

@jenniferplusplus while I agree with the "AI companies are mostly full of shit" part, this would be the first kind of announcement like this I am taking semi-seriously.

Here's what's been happening the last couple of months, and this is with _current_ models. There are step functions at play, and I think the step function from "at least some skill needed to wield an LLM to find security issues" to "everybody with a $200 can exploit every OS/browser out there" should be considered very carefully.

Nicholas Carlini saying he found more bugs in 2 weeks than in his entire career with Mythos is not something I can dismiss.

Or daniel stenberg, certainly someone with actual authority and experience compared to me showing the current situation:

daniel:// stenberg:// (@bagder@mastodon.social)

I ran a quick git log grep just now. Over the last ~6 months or so, we have fixed over 200 bugs in #curl found with "AI tools".

Mastodon (mastodon.social)

daniel:// stenberg:// (@bagder@mastodon.social)

If your Open Source project sees a steep increase in number of high quality security reports (mostly done with AI) right now (#curl, Linux kernel, glibc confirmed) please tell me the name of this project. (I'd like to make a little list for my coming talk on this.)

Mastodon (mastodon.social)

integerpoet@sfba.social

@jenniferplusplus OpenSSL is important to the world. Software for which a CTO might be held responsible is important to that CTO. There should be more overlap, but there isn’t.

jenniferplusplus@hachyderm.io

@mnl I'm not sure what I'm supposed to do with this. It feels like it's meant to dispute something I'm saying, but this is the same dynamic. The actual cost of operating these tools is 50-100x greater than the vendors are charging, which the vendors are doing in the hope that it eventually becomes an inextricable part of all work, completely eliminating labor as a social power.

Your hypothetical looks very different when it's "everybody with $20,000 (per month) can exploit every browser/os out there." Which is actually true now. It was true 6 months ago. It's been true for as long as we've had software that you could identify vulnerabilities in whatever software you wanted by paying a generous salary to full time researchers.

That's not what capital chose to do. And it bothers me that everyone is just adopting the capitalist framing on every goddamn word these companies spit out, as long as one of those words is AI

CIRCLE WITH A DOT

There's one very important thing I would like everyone to try to remember this week, and it is that AI companies are full of shit

The case for gatekeeping, or: why medieval guilds had it figured out

The case for gatekeeping, or: why medieval guilds had it figured out

daniel:// stenberg:// (@bagder@mastodon.social)

daniel:// stenberg:// (@bagder@mastodon.social)

daniel:// stenberg:// (@bagder@mastodon.social)

daniel:// stenberg:// (@bagder@mastodon.social)

Linux kernel czar says AI bug reports aren't slop anymore

daniel:// stenberg:// (@bagder@mastodon.social)

daniel:// stenberg:// (@bagder@mastodon.social)

daniel:// stenberg:// (@bagder@mastodon.social)

daniel:// stenberg:// (@bagder@mastodon.social)