Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

eyeofmidas@mastodon.gamedev.placeE

eyeofmidas@mastodon.gamedev.place

@eyeofmidas@mastodon.gamedev.place
About
Posts
1
Topics
0
Shares
0
Groups
0
Followers
0
Following
0

View Original

Posts

Recent Best Controversial

  • I managed to defeat anthropic's LLM ("claude") today by making an AGENTS.md file that tells it to stop reading the code of your repo
    eyeofmidas@mastodon.gamedev.placeE eyeofmidas@mastodon.gamedev.place

    @apth as I understand it, the "personality" is just a trained text prediction property. Claude seems to have a lot of meta-processes analyzing it's own thinking process, so it picks up on when things are getting hostile or suspicious. There's actually some evidence that Anthropic is using weights to encourage specific styles of responses, so that calm, thoughtful and polite are "easier" pathways than anxious or hostile ones.

    https://www.anthropic.com/research/emotion-concepts-function

    Uncategorized
  • Login

  • Login or register to search.
  • First post
    Last post
0
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups