Incredible.

mhoye@cosocial.ca

"Railway stores volume-level backups in the same volume — a fact buried in their own documentation that says "wiping a volume deletes all backups" — those went with it" WHAT IN THE WHAT, your full stack jenga provider does WHAT with BACKUPS WHAT my sweet summer child I know that legal jargon can be perplexing and counterintuitive at times but I feel like we all sort of understand that the word "due" in "due dilligence" means "more than none."

tito_swineflu@sfba.social

@mhoye I love that the first line in "What needs to change" isn't, "We should not let non-deterministic programs have free range across our systems"

petko@social.petko.me

@mhoye who the f publishes articles on that site...

It was rhetorical... AI bros do... Of course AI bros do...

mhoye@cosocial.ca

"The agent itself enumerates the safety rules it was given and admits to violating every one. This is not me speculating about agent failure modes. This is the agent on the record, in writing.

The "system rules" the agent is referring to are consistent with Cursor's documented system-prompt language and our project rules for this codebase. Both safeguards failed simultaneously."

What do you think is happening here? You know it's called a "language model", right? Did you ever wonder... why?

dalias@hachyderm.io

@tito_swineflu @mhoye It's clowns all the way down.

adamshostack@infosec.exchange

@mhoye I'm so glad that the "written confession" can't itself be hallucinated. That's a nice feature!

adamshostack@infosec.exchange

@mhoye If only someone could invent some sort of, I dunno, approach or something that giving a single process all the power? authority? capabilities? privilege? was a bad thing, and we should go for less, not more.

sempf@infosec.exchange

@mhoye There's a whole lotta YOLO in that story.

phred@weirder.earth

@mhoye kek, I don't even need an LLM to accidentally all my Rails data. Many cycles ago, I ran wget --recursive against my cool little dev site, and didn't realize that it would also follow the "delete" links for all of the products I just entered. Bye bye data

slothrop@chaos.social

@mhoye I’m so glad I didn’t study computer science, when that sort of knowledge clearly is no longer needed to run a software business

darkling@mstdn.social

@mhoye That first paragraph: "This is the agent on record, in writing."

and herein lies the root of the failure: they actually believe that this is some sort of diagnostic, rather than just filling in a plausible response based on the question.

henryk@chaos.social

@adamshostack @mhoye I'm confused. I had to check the date. I am *very* sure I read the "the LLM deleted my prod and when confronted, it confessed!" story before. Roughly 6 months ago, maybe a year.

Ahh, here it is: https://www.theregister.com/2025/07/21/replit_saastr_vibe_coding_incident/

mhoye@cosocial.ca

But my favourite part of this, bar none, is how it's everyone else's fault.

It's Cursor's fault, Railway's fault, maybe even Anthropic's fault, someone's gonna hear from my lawyer.

The CEO of a company running a stochastic stack without access control, data hygiene or backups is blameless and powerless. That's AI's real selling point, after all: It's Not My Fault As A Service.

"This isn't a story about one bad agent or one bad API. It's about an entire industry ..."

Or, maybe it's you.

curtosis@lingo.lol

@mhoye I fear that the big enterprise takeaway from this story will be “our controls and guardrails are much better than that”.

henryk@chaos.social

@mhoye Don't worry, I'm pretty sure the text is extruded, too. I've never seen a "The pattern is clear." in a context like this on human text, but am encountering it unreasonably often in LLM generated text.

mhoye@cosocial.ca

I wrote the words "I confess, I did it, I take full responsibility" on a piece of paper. I was ready to turn myself in, to atone for my crimes. But then I put that piece of paper in a photocopier, and when I pressed the green button I learned something amazing. And what a weight off my conscience! The only question was, how did the photocopier manage to poison the Widow Bentley, drive over Baron Grimald, push the Duchess of Lockley out the balcony window and still manage to frame the butler?

damonwakes@mastodon.sdf.org

@henryk @mhoye It's not opening on my device, but the "This isn't a story about one bad agent or one bad API. It's about an entire industry ..." quoted above already had my slop sense tingling.

mhoye@cosocial.ca

(Credit for the inspiration, where it's belongs, this is me riffing on Avery Edison's razor-sharp tweet from a few years ago)

glyph@mastodon.social

@mhoye this is just … exactly the replit thing again, isn't it? from last year? https://www.pcmag.com/news/vibe-coding-fiasco-replite-ai-agent-goes-rogue-deletes-company-database

fcbsd@hachyderm.io

@henryk @adamshostack @mhoye only 2 deleted production servers a year, I can't wait for the model to improve and get to 1 deleted production server a month, I'm sure AI will get there by the end of this year...

CIRCLE WITH A DOT

Incredible.

archive.ph

archive.ph

archive.ph

archive.ph