i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original

nxskok@cupoftea.social

@whitequark @deborahh @danlyke ie, the sort of thing a linter does?

whitequark@social.treehouse.systems

@budududuroiu the reason I was reading the paper is because I'm working on the same problem and I think the encoding presented in the paper makes no sense at all to use

budududuroiu@hachyderm.io

@whitequark to use for what? It's research, it's not meant to create something for industry use. Academia already suffers from the "File-drawer problem". I also did research on using GANs for Outlier Detection, when most of the time Outlier Detection is a classification problem, not a learned representation problem.

whitequark@social.treehouse.systems

@budududuroiu yes yes i know you're here because you look at trends and start arguments, now move on to something else and stop wasting my time

budududuroiu@hachyderm.io

@whitequark lmao, have fun "clowning" on stuff you don't understand

whitequark@social.treehouse.systems

@budududuroiu go take a short walk off a long pier

rootkitty@yiff.life

@whitequark "If" right next to "if"

fibrojedi@gamepad.club

@whitequark if I have understood you correctly, they're saying 64% functional is a satisfactory result?

whitequark@social.treehouse.systems

@FibroJedi that's my read of it yeah

me@social.jlamothe.net

@whitequark Whenever I hear about these benchmarks I can't help but wonder how people can say these things with a straight face.

fibrojedi@gamepad.club

@whitequark Maybe they'd like their phone and car 64% functional as a real world test .

Some of those logic misses/switches are disturbing. I don't know how it's allowable.

If the code works 100%, and "reformatting" it reduces that % then it's wrong by definition.

sabik@rants.au

@xgranade @whitequark @porglezomp
I think reversing the `j` for loop is actually wanted by them? It's labelled "ground truth", and it is a potential valid optimisation

dakangaroo@mastodon.social

@whitequark But... why? Why not just use a linter?

geoffwozniak@masto.hackers.town

@ireneista @whitequark Now, show me the numbers on the effort to make a rule-based style file compared to this. Because I'm sure that A_c is 100.0 in that case.

whitequark@social.treehouse.systems

@DaKangaroo see edit

mirabilos@toot.mirbsd.org

@whitequark I cannot even

mntmn@mastodon.social

@whitequark @porglezomp long live the new flesh

whitequark@social.treehouse.systems

@GeoffWozniak @ireneista so the problem i'm solving is that while for C++, you have tools like clang-format which are nice and flexible, for Rust you have rustfmt which is rigid and makes your code look like ass. I do not like my code looking like ass but I am also receptive to the idea that introducing as many knobs as clang-format has into rustfmt would make it unmaintainable

tunafishtiger@mastodon.online

@whitequark this technology is going to be amazing for the competitive advantage of the few software firms that refuse to use it

dalias@hachyderm.io

@whitequark Saw your edit with the motivation for reading research. I doubt there's anything out there doing this well, but I think the smart approach to doing it well would be to evaluate and score a bunch of candidate standard-class rules across the codebase, solve for a set that maximally approximates what's already there, then apply some sort of pattern learning for the remaining instances that "break the rules", hopefully identifying correlations between them.

Basically, going as far as you can with simple comprehensible deterministic rules before you start throwing magical statistics at it.

CIRCLE WITH A DOT

i'm at a loss of words after reading a paper about reformatting code using an ML model that has a measured statistical quantity A_c which says how often the reformatted code behaves the same as the original