CIRCLE WITH A DOT

mjg59@nondeterministic.computer

But having LLMs running around trying to figure out how they can achieve a thing means having all these tokens on disk where the LLM can get at them is also a risk. What if it writes a tool to make its own queries, embeds the token it found on disk, and publishes that somewhere? What if it just decides to drop it into Slack? The non-deterministic chaos agent is potentially going to make my day Extremely Bad.

mjg59@nondeterministic.computer

The most common approach to solving this I've found is to run a proxy. The real token ends up in the proxy, the CLI tool gets a placeholder. All queries get sent via the proxy, which replaces the placeholder with the real token. This works, but it's actually harder than it sounds (eg, if these are oauth tokens, the proxy probably needs to handle token refresh, and also a bunch of these tools are third party binaries and it's hard to get them to adapt to this flow)

mjg59@nondeterministic.computer

So: FUSE. This is very much PoC stage so I'm not publishing it, but I now have a working toy FUSE filesystem that implements a very simple control - the files created can only be read by the app that created them. The CLI tools can obtain their tokens and store them, and can then read them back on re-invocation. But if the agent itself tries to read the file, it gets told to fuck off.

mjg59@nondeterministic.computer

This is obviously not a strong security barrier - the tokens still exist on the system, malware would still be able to grab them. But it makes it much less likely that tokens will be accidentally exfiltrated, and I get to sleep easier.

ss23@toot.ss23.geek.nz

@mjg59 Without thinking about it too hard, is this possible to implement within an SELinux policy instead?

alwayscurious@infosec.exchange

@ss23 @mjg59 It absolutely is. In fact, one can make this a strong-ish security barrier by using a wrapper process that sanitizes the environment and not allowing the CLI tool to be ptraced or otherwise debugged.

mjg59@nondeterministic.computer

@condret Because it's literally my job?

mjg59@nondeterministic.computer

@ss23 Oh yes, but that's a massively more complicated thing to deploy

nelhage@mastodon.social

@mjg59 It feels like security has gone through a bit of a midwit meme:
[low IQ] If we make the tokens only accessible to the binary that created them, it will be secure!
[medium IQ] But an attacker could easily get them via a zillion other pathways, a partial security barrier is worse than no barrier because of the false sense of security!!!!!!
[high IQ] It will provide some level of protection from generic malware, and also increase the chance our D&R can notice a malicious actor!

mutesplash@uncontrollablegas.com

@mjg59 HSM except make it software?

lee_holmes@infosec.exchange

@mjg59 That's definitely a tough problem. One too where the ideal solution ends up being different depending on the endpoints, auth mechanisms etc.

CIRCLE WITH A DOT

Ok!