Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

Scheduled Pinned Locked Moved Uncategorized
140 Posts 61 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • robin@gts.icewind.meR robin@gts.icewind.me

    @xgranade
    I think the right is the output from running the model on the right code (center being the "desired output"). So it's not changing the semantics of the loop, just not not changing the loop order to match their desired outcome.

    Given that loop order can have behavioral impact (and I would never trust an LLM to be able to tell if it did), that seems like the correct behavior to me though
    @whitequark @porglezomp

    whitequark@social.treehouse.systemsW This user is from outside of this forum
    whitequark@social.treehouse.systemsW This user is from outside of this forum
    whitequark@social.treehouse.systems
    wrote last edited by
    #44

    @robin @xgranade @porglezomp oh you're right

    1 Reply Last reply
    0
    • theeclecticdyslexic@mstdn.socialT theeclecticdyslexic@mstdn.social

      @whitequark @lu_leipzig Ideally, I think a formatter that learns how I formatted the rest of the buffer would be the goal.

      Most of the time I like the deterministic formatting. However, I find deterministic formatting fails me around function headers and long function calls / long boolean statements.

      I want it to do the deterministic formatting once, and then if I undo immediately, don't do it again to that area... and preferably learn what I was trying to do.

      whitequark@social.treehouse.systemsW This user is from outside of this forum
      whitequark@social.treehouse.systemsW This user is from outside of this forum
      whitequark@social.treehouse.systems
      wrote last edited by
      #45

      @theeclecticdyslexic @lu_leipzig my goal is to be able to run a command on a patch that formats the added lines "more or less like the rest of the file"

      theeclecticdyslexic@mstdn.socialT 1 Reply Last reply
      0
      • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

        i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

        the "ideal" (their choice of words) case is 64.2%

        snowyfox@deadinsi.deS This user is from outside of this forum
        snowyfox@deadinsi.deS This user is from outside of this forum
        snowyfox@deadinsi.de
        wrote last edited by
        #46

        Figure in question seems to be about "model performing in its ideal conditions"

        The author's actual opinion is implied in the Results:

        "After inspecting the compilation checking module, we found that DUET CS achieves 55.8% computational accuracy, which is a practical metric for a code generation system. This result shows that more than half of the output code are compilable and implement the same function as the input code. The user can
        use this check as an optional layer of the pipeline to guarantee grammar correctness.
        ...
        We found that even the non-compilable outputs display around 60% similarity to the ground truth, which means even if DUET CS cannot always produce grammar-correct code, it can still provide valuable information to help user to transfer code style.
        ...
        Notice, that generally the task of generating the exact same code as ground truth is very hard, especially when the code length is rather long (˜47 lines)."

        snowyfox@deadinsi.deS 1 Reply Last reply
        0
        • snowyfox@deadinsi.deS snowyfox@deadinsi.de

          Figure in question seems to be about "model performing in its ideal conditions"

          The author's actual opinion is implied in the Results:

          "After inspecting the compilation checking module, we found that DUET CS achieves 55.8% computational accuracy, which is a practical metric for a code generation system. This result shows that more than half of the output code are compilable and implement the same function as the input code. The user can
          use this check as an optional layer of the pipeline to guarantee grammar correctness.
          ...
          We found that even the non-compilable outputs display around 60% similarity to the ground truth, which means even if DUET CS cannot always produce grammar-correct code, it can still provide valuable information to help user to transfer code style.
          ...
          Notice, that generally the task of generating the exact same code as ground truth is very hard, especially when the code length is rather long (˜47 lines)."

          snowyfox@deadinsi.deS This user is from outside of this forum
          snowyfox@deadinsi.deS This user is from outside of this forum
          snowyfox@deadinsi.de
          wrote last edited by
          #47

          That last one is a funny statement because it's laughably easy for a human to maintain the execution of a function after a style refactor. You would reprimand a junior if they couldn't do that

          1 Reply Last reply
          0
          • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

            @theeclecticdyslexic @lu_leipzig my goal is to be able to run a command on a patch that formats the added lines "more or less like the rest of the file"

            theeclecticdyslexic@mstdn.socialT This user is from outside of this forum
            theeclecticdyslexic@mstdn.socialT This user is from outside of this forum
            theeclecticdyslexic@mstdn.social
            wrote last edited by
            #48

            @whitequark @lu_leipzig that's a pretty reasonable concept I think.

            I like the idea at least.

            One thing I will say of deterministic formatters is they have changed my habits over time in order to get it to format the way I want. You can take that as both good and bad, but I think most (maybe 60%) of the things they have forced on me have been good.

            Edit: I also get stun locked trying to decide how to format 15 lines of code far less often.

            whitequark@social.treehouse.systemsW 1 Reply Last reply
            0
            • theeclecticdyslexic@mstdn.socialT theeclecticdyslexic@mstdn.social

              @whitequark @lu_leipzig that's a pretty reasonable concept I think.

              I like the idea at least.

              One thing I will say of deterministic formatters is they have changed my habits over time in order to get it to format the way I want. You can take that as both good and bad, but I think most (maybe 60%) of the things they have forced on me have been good.

              Edit: I also get stun locked trying to decide how to format 15 lines of code far less often.

              whitequark@social.treehouse.systemsW This user is from outside of this forum
              whitequark@social.treehouse.systemsW This user is from outside of this forum
              whitequark@social.treehouse.systems
              wrote last edited by
              #49

              @theeclecticdyslexic @lu_leipzig yeah if a formatter requires me to do things I don't want I simply quit using the formatter (and sometimes the codebase)

              burningtyger@nrw.socialB 1 Reply Last reply
              0
              • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

                the "ideal" (their choice of words) case is 64.2%

                yvandasilva@hachyderm.ioY This user is from outside of this forum
                yvandasilva@hachyderm.ioY This user is from outside of this forum
                yvandasilva@hachyderm.io
                wrote last edited by
                #50

                @whitequark what the what.

                1 Reply Last reply
                0
                • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                  i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

                  the "ideal" (their choice of words) case is 64.2%

                  gudenau@hachyderm.ioG This user is from outside of this forum
                  gudenau@hachyderm.ioG This user is from outside of this forum
                  gudenau@hachyderm.io
                  wrote last edited by
                  #51

                  @whitequark Just like, use one of the tools that already exists? It'll be:
                  - Fast
                  - Cheap
                  - Efficient
                  - Accurate

                  I don't understand any of this "industry" outside of being a massive destructive boondoggle.

                  1 Reply Last reply
                  0
                  • R relay@relay.an.exchange shared this topic
                  • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                    i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

                    the "ideal" (their choice of words) case is 64.2%

                    npars01@mstdn.socialN This user is from outside of this forum
                    npars01@mstdn.socialN This user is from outside of this forum
                    npars01@mstdn.social
                    wrote last edited by
                    #52

                    @whitequark

                    Why is AI as a consumer product being rushed out the door so fast? It's obviously not ready for prime time. It's unreliable, inaccurate, and fragile.

                    It's like a car being sent out to car dealerships with only 3 wheels with hasty promises of a future 4th wheel.

                    Possibly the goal isn't a car with 4 wheels but a plan for something else, similar to a Waymo power outage gridlock

                    forbes.com

                    favicon

                    (www.forbes.com)

                    Reliance on AI is a national security risk vulnerable to high fuel prices

                    1/

                    1 Reply Last reply
                    0
                    • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                      i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

                      the "ideal" (their choice of words) case is 64.2%

                      budududuroiu@hachyderm.ioB This user is from outside of this forum
                      budududuroiu@hachyderm.ioB This user is from outside of this forum
                      budududuroiu@hachyderm.io
                      wrote last edited by
                      #53

                      @whitequark "92 boosts, 115 favourites" damn

                      I swear to god sometimes Mastodon is just "old person yells at thing".

                      Researchers spend tons of time and money trying to solve Sudoku in polynomial time not because Sudoku is such a important problem to humanity, but because it's a NP-Hard problem, and you can thus reduce all other NP-complete problems to Sudoku and solve them all in polynomial time if you can solve Sudoku in polynomial time.

                      The research challenge is disentangling content from style in a learned embedding space, it's a classic representation learning problem that's genuinely hard: 1) Two functions that do the same thing should have identical content embeddings but different style embeddings, 2) Style must generalise to unseen code patterns, not just pattern-match known rules, 3) It's unsupervised, so there's no labeled (code_A, same_code_in_style_B) training pairs.

                      Code formatting is actually a very good medium to test this hypothesis, because you have an infinite latent space of code that does the exact same thing but is stylistically different.

                      whitequark@social.treehouse.systemsW 1 Reply Last reply
                      0
                      • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                        i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

                        the "ideal" (their choice of words) case is 64.2%

                        dch@bsd.networkD This user is from outside of this forum
                        dch@bsd.networkD This user is from outside of this forum
                        dch@bsd.network
                        wrote last edited by
                        #54

                        @whitequark feck that

                        1 Reply Last reply
                        0
                        • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                          @deborahh @danlyke this is what a reasonable person would understand to be "code style", yes

                          nxskok@cupoftea.socialN This user is from outside of this forum
                          nxskok@cupoftea.socialN This user is from outside of this forum
                          nxskok@cupoftea.social
                          wrote last edited by
                          #55

                          @whitequark @deborahh @danlyke ie, the sort of thing a linter does?

                          hennichodernich@radiosocial.deH 1 Reply Last reply
                          0
                          • budududuroiu@hachyderm.ioB budududuroiu@hachyderm.io

                            @whitequark "92 boosts, 115 favourites" damn

                            I swear to god sometimes Mastodon is just "old person yells at thing".

                            Researchers spend tons of time and money trying to solve Sudoku in polynomial time not because Sudoku is such a important problem to humanity, but because it's a NP-Hard problem, and you can thus reduce all other NP-complete problems to Sudoku and solve them all in polynomial time if you can solve Sudoku in polynomial time.

                            The research challenge is disentangling content from style in a learned embedding space, it's a classic representation learning problem that's genuinely hard: 1) Two functions that do the same thing should have identical content embeddings but different style embeddings, 2) Style must generalise to unseen code patterns, not just pattern-match known rules, 3) It's unsupervised, so there's no labeled (code_A, same_code_in_style_B) training pairs.

                            Code formatting is actually a very good medium to test this hypothesis, because you have an infinite latent space of code that does the exact same thing but is stylistically different.

                            whitequark@social.treehouse.systemsW This user is from outside of this forum
                            whitequark@social.treehouse.systemsW This user is from outside of this forum
                            whitequark@social.treehouse.systems
                            wrote last edited by
                            #56

                            @budududuroiu the reason I was reading the paper is because I'm working on the same problem and I think the encoding presented in the paper makes no sense at all to use

                            budududuroiu@hachyderm.ioB 1 Reply Last reply
                            0
                            • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                              @budududuroiu the reason I was reading the paper is because I'm working on the same problem and I think the encoding presented in the paper makes no sense at all to use

                              budududuroiu@hachyderm.ioB This user is from outside of this forum
                              budududuroiu@hachyderm.ioB This user is from outside of this forum
                              budududuroiu@hachyderm.io
                              wrote last edited by
                              #57

                              @whitequark to use for what? It's research, it's not meant to create something for industry use. Academia already suffers from the "File-drawer problem". I also did research on using GANs for Outlier Detection, when most of the time Outlier Detection is a classification problem, not a learned representation problem.

                              whitequark@social.treehouse.systemsW 1 Reply Last reply
                              0
                              • budududuroiu@hachyderm.ioB budududuroiu@hachyderm.io

                                @whitequark to use for what? It's research, it's not meant to create something for industry use. Academia already suffers from the "File-drawer problem". I also did research on using GANs for Outlier Detection, when most of the time Outlier Detection is a classification problem, not a learned representation problem.

                                whitequark@social.treehouse.systemsW This user is from outside of this forum
                                whitequark@social.treehouse.systemsW This user is from outside of this forum
                                whitequark@social.treehouse.systems
                                wrote last edited by
                                #58

                                @budududuroiu yes yes i know you're here because you look at trends and start arguments, now move on to something else and stop wasting my time

                                budududuroiu@hachyderm.ioB 1 Reply Last reply
                                0
                                • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                                  @budududuroiu yes yes i know you're here because you look at trends and start arguments, now move on to something else and stop wasting my time

                                  budududuroiu@hachyderm.ioB This user is from outside of this forum
                                  budududuroiu@hachyderm.ioB This user is from outside of this forum
                                  budududuroiu@hachyderm.io
                                  wrote last edited by
                                  #59

                                  @whitequark lmao, have fun "clowning" on stuff you don't understand

                                  whitequark@social.treehouse.systemsW 1 Reply Last reply
                                  0
                                  • budududuroiu@hachyderm.ioB budududuroiu@hachyderm.io

                                    @whitequark lmao, have fun "clowning" on stuff you don't understand

                                    whitequark@social.treehouse.systemsW This user is from outside of this forum
                                    whitequark@social.treehouse.systemsW This user is from outside of this forum
                                    whitequark@social.treehouse.systems
                                    wrote last edited by
                                    #60

                                    @budududuroiu go take a short walk off a long pier

                                    1 Reply Last reply
                                    0
                                    • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                                      @porglezomp you'll love Fig. 6

                                      rootkitty@yiff.lifeR This user is from outside of this forum
                                      rootkitty@yiff.lifeR This user is from outside of this forum
                                      rootkitty@yiff.life
                                      wrote last edited by
                                      #61

                                      @whitequark "If" right next to "if"

                                      1 Reply Last reply
                                      0
                                      • whitequark@social.treehouse.systemsW whitequark@social.treehouse.systems

                                        i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

                                        the "ideal" (their choice of words) case is 64.2%

                                        fibrojedi@gamepad.clubF This user is from outside of this forum
                                        fibrojedi@gamepad.clubF This user is from outside of this forum
                                        fibrojedi@gamepad.club
                                        wrote last edited by
                                        #62

                                        @whitequark if I have understood you correctly, they're saying 64% functional is a satisfactory result?

                                        whitequark@social.treehouse.systemsW 1 Reply Last reply
                                        0
                                        • fibrojedi@gamepad.clubF fibrojedi@gamepad.club

                                          @whitequark if I have understood you correctly, they're saying 64% functional is a satisfactory result?

                                          whitequark@social.treehouse.systemsW This user is from outside of this forum
                                          whitequark@social.treehouse.systemsW This user is from outside of this forum
                                          whitequark@social.treehouse.systems
                                          wrote last edited by
                                          #63

                                          @FibroJedi that's my read of it yeah

                                          fibrojedi@gamepad.clubF 1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups