Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

Scheduled Pinned Locked Moved Uncategorized
140 Posts 61 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

    i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

    the "ideal" (their choice of words) case is 64.2%

    fuzzyfuzzyfungus@cyberplace.socialF This user is from outside of this forum
    fuzzyfuzzyfungus@cyberplace.socialF This user is from outside of this forum
    fuzzyfuzzyfungus@cyberplace.social
    wrote last edited by
    #20

    @whitequark "Code style generally does not interfere
    with the code semantics and executability"; but we present novel methods for it to do so!

    1 Reply Last reply
    0
    • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

      this isn't satire, this is real research published by IEEE/ACM

      disorderlyf@todon.euD This user is from outside of this forum
      disorderlyf@todon.euD This user is from outside of this forum
      disorderlyf@todon.eu
      wrote last edited by
      #21

      @whitequark So let me get this straight, IEEE thinks you should count it as a win if rewriting your code by vibing it has less than 15% better odds than a literal coinflip of reproducibility?

      edited for clarity and to fix a typo

      urixturing@hachyderm.ioU sammy@cherrykitten.gayS 2 Replies Last reply
      0
      • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

        this isn't satire, this is real research published by IEEE/ACM

        deborahh@cosocial.caD This user is from outside of this forum
        deborahh@cosocial.caD This user is from outside of this forum
        deborahh@cosocial.ca
        wrote last edited by
        #22

        @whitequark @danlyke so … by "reformatted" I assume you mean aesthetically tidied up, with no change in functionality required?

        If I got that right: wtf?

        whitequark@social.treehouse.systemsW mrkeen@mastodon.socialM 2 Replies Last reply
        0
        • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

          i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

          the "ideal" (their choice of words) case is 64.2%

          ireneista@adhd.irenes.spaceI This user is from outside of this forum
          ireneista@adhd.irenes.spaceI This user is from outside of this forum
          ireneista@adhd.irenes.space
          wrote last edited by
          #23

          @whitequark compare and contrast the Extreme Programming philosophy, in which a code change doesn't count as "refactoring" unless all observable behavior is identical

          krans@mastodon.me.ukK whitequark@social.treehouse.systemsW 2 Replies Last reply
          0
          • deborahh@cosocial.caD deborahh@cosocial.ca

            @whitequark @danlyke so … by "reformatted" I assume you mean aesthetically tidied up, with no change in functionality required?

            If I got that right: wtf?

            whitequark@social.treehouse.systemsW This user is from outside of this forum
            whitequark@social.treehouse.systemsW This user is from outside of this forum
            whitequark@social.treehouse.systems
            wrote last edited by
            #24

            @deborahh @danlyke this is what a reasonable person would understand to be "code style", yes

            nxskok@cupoftea.socialN 1 Reply Last reply
            0
            • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

              i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

              the "ideal" (their choice of words) case is 64.2%

              lu_leipzig@troet.cafeL This user is from outside of this forum
              lu_leipzig@troet.cafeL This user is from outside of this forum
              lu_leipzig@troet.cafe
              wrote last edited by
              #25

              @whitequark And this is how research money is lit on fire, I guess. Why else conduct research into ML for a task that has had obvious, deterministic, efficient and well-tested solutions for decades?

              whitequark@social.treehouse.systemsW srazkvt@tech.lgbtS 2 Replies Last reply
              0
              • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                @porglezomp you'll love Fig. 6

                fuzzyfuzzyfungus@cyberplace.socialF This user is from outside of this forum
                fuzzyfuzzyfungus@cyberplace.socialF This user is from outside of this forum
                fuzzyfuzzyfungus@cyberplace.social
                wrote last edited by
                #26

                @whitequark @porglezomp This looks like it could join the current crop of "DLSS5 off/DLSS5 on" memes.

                1 Reply Last reply
                0
                • ireneista@adhd.irenes.spaceI ireneista@adhd.irenes.space

                  @whitequark compare and contrast the Extreme Programming philosophy, in which a code change doesn't count as "refactoring" unless all observable behavior is identical

                  krans@mastodon.me.ukK This user is from outside of this forum
                  krans@mastodon.me.ukK This user is from outside of this forum
                  krans@mastodon.me.uk
                  wrote last edited by
                  #27

                  @ireneista TIL that my philosophy is the same as the Extreme Programming philosophy

                  @whitequark

                  ireneista@adhd.irenes.spaceI 1 Reply Last reply
                  0
                  • ireneista@adhd.irenes.spaceI ireneista@adhd.irenes.space

                    @whitequark compare and contrast the Extreme Programming philosophy, in which a code change doesn't count as "refactoring" unless all observable behavior is identical

                    whitequark@social.treehouse.systemsW This user is from outside of this forum
                    whitequark@social.treehouse.systemsW This user is from outside of this forum
                    whitequark@social.treehouse.systems
                    wrote last edited by
                    #28

                    @ireneista i like how it starts with this (left) and ends with "here is a variable we think would be good here. Do you like this" (right)

                    Link Preview ImageLink Preview Image
                    whitequark@social.treehouse.systemsW 1 Reply Last reply
                    0
                    • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                      @ireneista i like how it starts with this (left) and ends with "here is a variable we think would be good here. Do you like this" (right)

                      Link Preview ImageLink Preview Image
                      whitequark@social.treehouse.systemsW This user is from outside of this forum
                      whitequark@social.treehouse.systemsW This user is from outside of this forum
                      whitequark@social.treehouse.systems
                      wrote last edited by
                      #29

                      @ireneista starting with "gotofail bad" and ending with making the problem significantly worse, apparently without ever reflecting on this

                      ireneista@adhd.irenes.spaceI 1 Reply Last reply
                      0
                      • lu_leipzig@troet.cafeL lu_leipzig@troet.cafe

                        @whitequark And this is how research money is lit on fire, I guess. Why else conduct research into ML for a task that has had obvious, deterministic, efficient and well-tested solutions for decades?

                        whitequark@social.treehouse.systemsW This user is from outside of this forum
                        whitequark@social.treehouse.systemsW This user is from outside of this forum
                        whitequark@social.treehouse.systems
                        wrote last edited by
                        #30

                        @lu_leipzig I actually really don't like formatters like black or rustfmt which is why I'm collaborating on research into doing it with ML, but there are ways to do it that never produce a different AST

                        lu_leipzig@troet.cafeL argv_minus_one@mastodon.sdf.orgA 2 Replies Last reply
                        0
                        • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                          i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

                          the "ideal" (their choice of words) case is 64.2%

                          going_to_maine@mastodon.socialG This user is from outside of this forum
                          going_to_maine@mastodon.socialG This user is from outside of this forum
                          going_to_maine@mastodon.social
                          wrote last edited by
                          #31

                          @whitequark so excited about astral being acquired...

                          1 Reply Last reply
                          0
                          • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                            i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

                            the "ideal" (their choice of words) case is 64.2%

                            theorangetheme@en.osm.townT This user is from outside of this forum
                            theorangetheme@en.osm.townT This user is from outside of this forum
                            theorangetheme@en.osm.town
                            wrote last edited by
                            #32

                            @whitequark That's it, these people lose their computer privileges until they take some undergraduate CS theory classes.

                            whitequark@social.treehouse.systemsW 1 Reply Last reply
                            0
                            • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                              @lu_leipzig I actually really don't like formatters like black or rustfmt which is why I'm collaborating on research into doing it with ML, but there are ways to do it that never produce a different AST

                              lu_leipzig@troet.cafeL This user is from outside of this forum
                              lu_leipzig@troet.cafeL This user is from outside of this forum
                              lu_leipzig@troet.cafe
                              wrote last edited by
                              #33

                              @whitequark oh, interesting, what do you not like about them? I could imagine a ML model would do a decent job deciding between n equivalent deterministically produced ASTs that vary e.g. w.r.t. indentation on multi-line definitions/calls.

                              whitequark@social.treehouse.systemsW 1 Reply Last reply
                              0
                              • theorangetheme@en.osm.townT theorangetheme@en.osm.town

                                @whitequark That's it, these people lose their computer privileges until they take some undergraduate CS theory classes.

                                whitequark@social.treehouse.systemsW This user is from outside of this forum
                                whitequark@social.treehouse.systemsW This user is from outside of this forum
                                whitequark@social.treehouse.systems
                                wrote last edited by
                                #34

                                @theorangetheme both authors are currently full professors i believe

                                1 Reply Last reply
                                0
                                • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                                  @ireneista starting with "gotofail bad" and ending with making the problem significantly worse, apparently without ever reflecting on this

                                  ireneista@adhd.irenes.spaceI This user is from outside of this forum
                                  ireneista@adhd.irenes.spaceI This user is from outside of this forum
                                  ireneista@adhd.irenes.space
                                  wrote last edited by
                                  #35

                                  @whitequark because "the thing we're promoting is incredibly dangerous, and not in fun ways" is not really the thing anyone wants to be cited for

                                  geoffwozniak@masto.hackers.townG 1 Reply Last reply
                                  0
                                  • lu_leipzig@troet.cafeL lu_leipzig@troet.cafe

                                    @whitequark oh, interesting, what do you not like about them? I could imagine a ML model would do a decent job deciding between n equivalent deterministically produced ASTs that vary e.g. w.r.t. indentation on multi-line definitions/calls.

                                    whitequark@social.treehouse.systemsW This user is from outside of this forum
                                    whitequark@social.treehouse.systemsW This user is from outside of this forum
                                    whitequark@social.treehouse.systems
                                    wrote last edited by
                                    #36

                                    @lu_leipzig I view code as art and so any tool that puts determinism strictly above aesthetics is a net negative to my craft

                                    theeclecticdyslexic@mstdn.socialT 1 Reply Last reply
                                    0
                                    • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                                      @lu_leipzig I actually really don't like formatters like black or rustfmt which is why I'm collaborating on research into doing it with ML, but there are ways to do it that never produce a different AST

                                      argv_minus_one@mastodon.sdf.orgA This user is from outside of this forum
                                      argv_minus_one@mastodon.sdf.orgA This user is from outside of this forum
                                      argv_minus_one@mastodon.sdf.org
                                      wrote last edited by
                                      #37

                                      @whitequark

                                      Even if the AST is the same, might a sufficiently bad format mislead humans reading the resulting code?

                                      I'm reminded of the Obfuscated C Contest…

                                      @lu_leipzig

                                      1 Reply Last reply
                                      0
                                      • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                                        i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

                                        the "ideal" (their choice of words) case is 64.2%

                                        selinica@social.mechsploitation.orgS This user is from outside of this forum
                                        selinica@social.mechsploitation.orgS This user is from outside of this forum
                                        selinica@social.mechsploitation.org
                                        wrote last edited by
                                        #38

                                        @whitequark@social.treehouse.systems I didn't know the ideal number for code to behave differently was over 30% of the time!
                                        Then again, I like and don't mind working with legacy code and systems so I personally tend to wonder "why even redo a working thing"

                                        1 Reply Last reply
                                        0
                                        • xgranade@wandering.shopX xgranade@wandering.shop

                                          @whitequark @porglezomp I'm spitting out my drink at j++ ­→ j--. Holy shit.

                                          robin@gts.icewind.meR This user is from outside of this forum
                                          robin@gts.icewind.meR This user is from outside of this forum
                                          robin@gts.icewind.me
                                          wrote last edited by
                                          #39

                                          @xgranade
                                          I think the right is the output from running the model on the right code (center being the "desired output"). So it's not changing the semantics of the loop, just not not changing the loop order to match their desired outcome.

                                          Given that loop order can have behavioral impact (and I would never trust an LLM to be able to tell if it did), that seems like the correct behavior to me though
                                          @whitequark @porglezomp

                                          whitequark@social.treehouse.systemsW 1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups