Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. For folks you're worried that today's AI can already replace us, or who are deciding how far to trust a learning model in a production environment - "The Car Wash Problem" is an entertaining and sobering experiment.

For folks you're worried that today's AI can already replace us, or who are deciding how far to trust a learning model in a production environment - "The Car Wash Problem" is an entertaining and sobering experiment.

Scheduled Pinned Locked Moved Uncategorized
llmtechnologyexperiment
1 Posts 1 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • edthedev@infosec.exchangeE This user is from outside of this forum
    edthedev@infosec.exchangeE This user is from outside of this forum
    edthedev@infosec.exchange
    wrote last edited by
    #1

    For folks you're worried that today's AI can already replace us, or who are deciding how far to trust a learning model in a production environment - "The Car Wash Problem" is an entertaining and sobering experiment.

    https://opper.ai/blog/car-wash-test

    "Everything below GPT-5 performs worse than 10,000 people given two buttons and no time to think."

    It is critical to remember that today's AI always generates a pleasing answer, even when it is outside of it's capabilities.

    There is a chasm between how today's AI thinks and how it presents itself. Stolen human writing regurgitated allows AI to present as more coherent than it's actual processing powers would otherwise allow.

    Every one of these models presents a well written compelling argument, even while most miss the point entirely.

    This case is special not because is fools many AI.

    This case is special because most humans can still easily recognize the mistake no matter how well the AI presents itself.

    We have a new lack-of-warriness problem to overcome, as these models continue to grow faster in apparent reliability than in actual reliability.

    #ai #llm #technology #experiment

    1 Reply Last reply
    1
    0
    • R relay@relay.infosec.exchange shared this topic
    Reply
    • Reply as topic
    Log in to reply
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes


    • Login

    • Login or register to search.
    • First post
      Last post
    0
    • Categories
    • Recent
    • Tags
    • Popular
    • World
    • Users
    • Groups