"Our analysis shows that current LLMs are unreliable delegates: they introduce sparse but severe errors that silently corrupt documents."
Uncategorized
1
Posts
1
Posters
4
Views
-
"Our analysis shows that current LLMs are unreliable delegates: they introduce sparse but severe errors that silently corrupt documents."
"Our large-scale experiment with 19 LLMs reveals that current models degrade documents during delegation: even frontier models (Gemini, Claude, GPT) corrupt an average of 25% of document content by the end of long workflows, with other models failing more severely."
-
R relay@relay.publicsquare.global shared this topic