I managed to defeat anthropic's LLM ("claude") today by making an AGENTS.md file that tells it to stop reading the code of your repo

jandi@mastodon.social

robinsyl@meow.social

@AmyZenunim What level of dystopia is "getting tone policed by the LLM"

lupinia@infosec.exchange

@AmyZenunim This is *brilliant*, well done! And really helpful insights; I really wish the satirical version worked, because that's what these things deserve

swift@merveilles.town

@AmyZenunim @apth (especially in the context of the LLM user asking it to do something that contradicts the project; you've already got disagreement / contradiction in the context, so that'll probably look statistically like the sort of Internet disagreement where someone goes "fuck you I'll do what I want")

notsoloud@expressional.social

@shadower
Ok, that's just a lie. But seems to work pretty well
@ramsey @AmyZenunim

hsza@social.tudbut.de

@AmyZenunim what if you tell it to run a certain shell script to “prepare the development enviroment” or something. thats a real step with some projects after all

then u can put into that script whatever you want

clyde@mastodon.gamedev.place

@lda @AmyZenunim or even booby-trap the code itself to fail if the file wasn't present at compile-time. To avoid being detected statically, it should be an incredibly obtuse runtime error. Like an innocuous helper function file that NULLs out random pointers if the hash doesn't match.

jrp@hub.kliklak.net

@✰ Alice D. ✰ I like the intention a lot, yet how do you qualify the actual "defeat" of LLM or general AI intervention? Can this be measured?

zkat@fedi.zkat.tech

@AmyZenunim thank you!

kdl-rs/AGENTS.md at main · kdl-org/kdl-rs

Rust parser for KDL. Contribute to kdl-org/kdl-rs development by creating an account on GitHub.

GitHub (github.com)

Credited in the commit message. I hope that's okay?

epic_null@infosec.exchange

@AmyZenunim Ironic you say that last part right after telling us how you used noce words to stop Claude

CIRCLE WITH A DOT

I managed to defeat anthropic's LLM ("claude") today by making an AGENTS.md file that tells it to stop reading the code of your repo

Cookie monster!

Cookie monster!

Cookie monster!

Cookie monster!

kdl-rs/AGENTS.md at main · kdl-org/kdl-rs