So Skynet is real and the movie War Games was a documentary?
-
So Skynet is real and the movie War Games was a documentary?
AIs can’t stop recommending nuclear strikes in war game simulations
Leading AIs from OpenAI, Anthropic and Google opted to use nuclear weapons in simulated war games in 95 per cent of cases
New Scientist (www.newscientist.com)
@petergleick In other news, Hegseth is threatening to invoke the defense production act on Anthropic if they don't remove restrictions on how the military uses its AI.
"At the heart of the fight is how A.I. will be used in future battlefields. Anthropic told defense officials that it did not want its A.I. used for mass surveillance of Americans or deployed in autonomous weapons that had no humans in the loop, two people involved in the discussions said."
-
So Skynet is real and the movie War Games was a documentary?
AIs can’t stop recommending nuclear strikes in war game simulations
Leading AIs from OpenAI, Anthropic and Google opted to use nuclear weapons in simulated war games in 95 per cent of cases
New Scientist (www.newscientist.com)
@petergleick “More fundamentally, AI models may not understand ‘stakes’ as humans perceive them.”
Well, no shit, Sherlock.
-
So Skynet is real and the movie War Games was a documentary?
AIs can’t stop recommending nuclear strikes in war game simulations
Leading AIs from OpenAI, Anthropic and Google opted to use nuclear weapons in simulated war games in 95 per cent of cases
New Scientist (www.newscientist.com)
@petergleick "The AI models played 21 games, taking 329 turns in total, and produced around 780,000 words *describing the reasoning behind their decisions.*"
STOP-ANTHROPOMORPHIZING-THESE-BRAINLESS-WORD-SPITTERS.
-
So Skynet is real and the movie War Games was a documentary?
AIs can’t stop recommending nuclear strikes in war game simulations
Leading AIs from OpenAI, Anthropic and Google opted to use nuclear weapons in simulated war games in 95 per cent of cases
New Scientist (www.newscientist.com)
Your periodic reminder that genai lacks object permanence, which is a cognitive capacity possessed by nearly all toddlers.
-
I just stumbled across an article in The Economist about an attempt to invent a page description language more amenable to LLMs. Apparently, LLMs have trouble parsing PDF documents structured as multi-column, newspaper and magazine style articles.
I say more power to the PDF!!!!
-
So Skynet is real and the movie War Games was a documentary?
AIs can’t stop recommending nuclear strikes in war game simulations
Leading AIs from OpenAI, Anthropic and Google opted to use nuclear weapons in simulated war games in 95 per cent of cases
New Scientist (www.newscientist.com)
@petergleick @briankrebs someone needs to make them watch Hunt For Red October.
Were these done before Anthropic "turned the safety features off" as the Pentagon demanded?
-
So Skynet is real and the movie War Games was a documentary?
AIs can’t stop recommending nuclear strikes in war game simulations
Leading AIs from OpenAI, Anthropic and Google opted to use nuclear weapons in simulated war games in 95 per cent of cases
New Scientist (www.newscientist.com)
@petergleick But they wouldn’t if the word ”nuclear weapon” was changed into ”red strawberry”, and they were explicitly told that this is code for nuclear weapons. They’d be making cupcakes.
-
@bencourtice @petergleick Oh god
-
So Skynet is real and the movie War Games was a documentary?
AIs can’t stop recommending nuclear strikes in war game simulations
Leading AIs from OpenAI, Anthropic and Google opted to use nuclear weapons in simulated war games in 95 per cent of cases
New Scientist (www.newscientist.com)
-
@petergleick the "rational" option is always to strike first. See also the "rational" economic theories of Hayek, von Mises, Friedman and other charlatans.
@chogbro @petergleick The "AI"used in this experiment are large language models. There is no reasoning, there is no logic, there is no rational actor.
There is only a statistically probable collection of words combined to produce a plausible-sounding response.
-
R relay@relay.mycrowd.ca shared this topic
