Break LLM Workflows with Claude's Refusal Magic String - Hacking The Cloud
Uncategorized
1
Posts
1
Posters
0
Views
-
Break LLM Workflows with Claude's Refusal Magic String - Hacking The Cloud
Break LLM Workflows with Claude's Refusal Magic String - Hacking The Cloud
How Anthropic's refusal test string can be abused to stop streaming responses and create sticky failures.
(hackingthe.cloud)
Short summary: https://hackerworkspace.com/article/break-llm-workflows-with-claude-s-refusal-magic-string-hacking-the-cloud
-
R relay@relay.infosec.exchange shared this topic