Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. I can't stop thinking about the LLM-generated compiler that passes all the unit tests but emits inner loops that benchmark over 150,000x slower than a gcc debug build.

I can't stop thinking about the LLM-generated compiler that passes all the unit tests but emits inner loops that benchmark over 150,000x slower than a gcc debug build.

Scheduled Pinned Locked Moved Uncategorized
10 Posts 7 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • 0xabad1dea@infosec.exchange0 This user is from outside of this forum
    0xabad1dea@infosec.exchange0 This user is from outside of this forum
    0xabad1dea@infosec.exchange
    wrote last edited by
    #1

    I can't stop thinking about the LLM-generated compiler that passes all the unit tests but emits inner loops that benchmark over 150,000x slower than a gcc debug build. I couldn't possibly have intentionally come up with such a funny demonstration of the point of genuine expertise https://harshanu.space/en/tech/ccc-vs-gcc/

    dysfun@social.treehouse.systemsD lesley@mastodon.gamedev.placeL nina_kali_nina@tech.lgbtN jnpn@mastodon.socialJ thing@plasmatrap.comT 5 Replies Last reply
    0
    • 0xabad1dea@infosec.exchange0 0xabad1dea@infosec.exchange

      I can't stop thinking about the LLM-generated compiler that passes all the unit tests but emits inner loops that benchmark over 150,000x slower than a gcc debug build. I couldn't possibly have intentionally come up with such a funny demonstration of the point of genuine expertise https://harshanu.space/en/tech/ccc-vs-gcc/

      dysfun@social.treehouse.systemsD This user is from outside of this forum
      dysfun@social.treehouse.systemsD This user is from outside of this forum
      dysfun@social.treehouse.systems
      wrote last edited by
      #2

      @0xabad1dea wait what, i missed the 150k slower thing

      1 Reply Last reply
      0
      • dysfun@social.treehouse.systemsD This user is from outside of this forum
        dysfun@social.treehouse.systemsD This user is from outside of this forum
        dysfun@social.treehouse.systems
        wrote last edited by
        #3

        @0xabad1dea amazing

        1 Reply Last reply
        0
        • 0xabad1dea@infosec.exchange0 0xabad1dea@infosec.exchange

          I can't stop thinking about the LLM-generated compiler that passes all the unit tests but emits inner loops that benchmark over 150,000x slower than a gcc debug build. I couldn't possibly have intentionally come up with such a funny demonstration of the point of genuine expertise https://harshanu.space/en/tech/ccc-vs-gcc/

          lesley@mastodon.gamedev.placeL This user is from outside of this forum
          lesley@mastodon.gamedev.placeL This user is from outside of this forum
          lesley@mastodon.gamedev.place
          wrote last edited by
          #4

          @0xabad1dea I have a feeling that this writing replies on LLM way too much

          0xabad1dea@infosec.exchange0 sodiboo@gaysex.cloudS 2 Replies Last reply
          0
          • lesley@mastodon.gamedev.placeL lesley@mastodon.gamedev.place

            @0xabad1dea I have a feeling that this writing replies on LLM way too much

            0xabad1dea@infosec.exchange0 This user is from outside of this forum
            0xabad1dea@infosec.exchange0 This user is from outside of this forum
            0xabad1dea@infosec.exchange
            wrote last edited by
            #5

            @lesley sometimes I feel like the only person in tech who knows how to write three consecutive paragraphs all by herself

            1 Reply Last reply
            0
            • lesley@mastodon.gamedev.placeL lesley@mastodon.gamedev.place

              @0xabad1dea I have a feeling that this writing replies on LLM way too much

              sodiboo@gaysex.cloudS This user is from outside of this forum
              sodiboo@gaysex.cloudS This user is from outside of this forum
              sodiboo@gaysex.cloud
              wrote last edited by
              #6

              @lesley@mastodon.gamedev.place @0xabad1dea@infosec.exchange There's a disclaimer at the bottom of the blog post stating that "The benchmark design, test execution, analysis and writing were done by a human with AI helping where needed."

              1 Reply Last reply
              0
              • 0xabad1dea@infosec.exchange0 0xabad1dea@infosec.exchange

                I can't stop thinking about the LLM-generated compiler that passes all the unit tests but emits inner loops that benchmark over 150,000x slower than a gcc debug build. I couldn't possibly have intentionally come up with such a funny demonstration of the point of genuine expertise https://harshanu.space/en/tech/ccc-vs-gcc/

                nina_kali_nina@tech.lgbtN This user is from outside of this forum
                nina_kali_nina@tech.lgbtN This user is from outside of this forum
                nina_kali_nina@tech.lgbt
                wrote last edited by
                #7

                @0xabad1dea makes two of us. The CCC isn't the flex AI proponents think it is, but there aren't enough people who can understand that it should have been a cautionary tale rather than a sensational headline. 😞

                nina_kali_nina@tech.lgbtN 1 Reply Last reply
                0
                • 0xabad1dea@infosec.exchange0 0xabad1dea@infosec.exchange

                  I can't stop thinking about the LLM-generated compiler that passes all the unit tests but emits inner loops that benchmark over 150,000x slower than a gcc debug build. I couldn't possibly have intentionally come up with such a funny demonstration of the point of genuine expertise https://harshanu.space/en/tech/ccc-vs-gcc/

                  jnpn@mastodon.socialJ This user is from outside of this forum
                  jnpn@mastodon.socialJ This user is from outside of this forum
                  jnpn@mastodon.social
                  wrote last edited by
                  #8

                  @0xabad1dea very interesting read

                  1 Reply Last reply
                  0
                  • nina_kali_nina@tech.lgbtN nina_kali_nina@tech.lgbt

                    @0xabad1dea makes two of us. The CCC isn't the flex AI proponents think it is, but there aren't enough people who can understand that it should have been a cautionary tale rather than a sensational headline. 😞

                    nina_kali_nina@tech.lgbtN This user is from outside of this forum
                    nina_kali_nina@tech.lgbtN This user is from outside of this forum
                    nina_kali_nina@tech.lgbt
                    wrote last edited by
                    #9

                    @0xabad1dea like, I'll bait; great stuff, unsupervised agent produced something that can compile some C code that in a certain definition can be called "working", but absolutely not ready for any sort of production usage.
                    The agent has multiple reference implementations, extensive testing suite, and C is literally based on an extremely well defined standard. AI proponents claim that we're in an era where all we need is to provide a specification, and the agents will just implement the thing for us. This CCC thing is proof that they quite literally can't; it's difficult to think about a commercial software project that would have a specification better defined than the C standard. And a vanilla C compiler isn't all _that_ complicated, it's literally the kind of thing many undergrad SWE students build as a student project (yes yes lots of caveats and simplifications). You'd think Anthropic could improve on their CCC with the agents until they get the compiler working at least as well as the tcc would, but 1/2

                    1 Reply Last reply
                    1
                    0
                    • 0xabad1dea@infosec.exchange0 0xabad1dea@infosec.exchange

                      I can't stop thinking about the LLM-generated compiler that passes all the unit tests but emits inner loops that benchmark over 150,000x slower than a gcc debug build. I couldn't possibly have intentionally come up with such a funny demonstration of the point of genuine expertise https://harshanu.space/en/tech/ccc-vs-gcc/

                      thing@plasmatrap.comT This user is from outside of this forum
                      thing@plasmatrap.comT This user is from outside of this forum
                      thing@plasmatrap.com
                      wrote last edited by
                      #10

                      @0xabad1dea@infosec.exchange claude has a fucking compiler.
                      what the fuck.
                      are we vibecompiling alongside vibecoding now
                      ​​

                      1 Reply Last reply
                      0
                      • R relay@relay.infosec.exchange shared this topic
                      Reply
                      • Reply as topic
                      Log in to reply
                      • Oldest to Newest
                      • Newest to Oldest
                      • Most Votes


                      • Login

                      • Login or register to search.
                      • First post
                        Last post
                      0
                      • Categories
                      • Recent
                      • Tags
                      • Popular
                      • World
                      • Users
                      • Groups