Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. I use AI daily for coding, but I'm confused about one thing.

I use AI daily for coding, but I'm confused about one thing.

Scheduled Pinned Locked Moved Uncategorized
2 Posts 2 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • vitobotta@mastodon.socialV This user is from outside of this forum
    vitobotta@mastodon.socialV This user is from outside of this forum
    vitobotta@mastodon.social
    wrote last edited by
    #1

    I use AI daily for coding, but I'm confused about one thing. If these models just run inference on frozen weights, they aren't really learning, right? It feels like working with an expert who has constant amnesia. Is AGI even possible if the model never truly updates its understanding?

    pointlessone@status.pointless.oneP 1 Reply Last reply
    0
    • vitobotta@mastodon.socialV vitobotta@mastodon.social

      I use AI daily for coding, but I'm confused about one thing. If these models just run inference on frozen weights, they aren't really learning, right? It feels like working with an expert who has constant amnesia. Is AGI even possible if the model never truly updates its understanding?

      pointlessone@status.pointless.oneP This user is from outside of this forum
      pointlessone@status.pointless.oneP This user is from outside of this forum
      pointlessone@status.pointless.one
      wrote last edited by
      #2

      @vitobotta models don’t update on each request but a lot of the requests and responses are collected for the next training round. Not all of it goes into training but it gives signal for what responses are good and what are bad.

      Also I don’t think any of the experts believe transformer architecture (what almost all current LLMs are) is the path to AGI. It mostly CEOs who talk about AGI right behind the corner.

      1 Reply Last reply
      1
      0
      • R relay@relay.mycrowd.ca shared this topic
      Reply
      • Reply as topic
      Log in to reply
      • Oldest to Newest
      • Newest to Oldest
      • Most Votes


      • Login

      • Login or register to search.
      • First post
        Last post
      0
      • Categories
      • Recent
      • Tags
      • Popular
      • World
      • Users
      • Groups