Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. The only code review agent I have ever seen be even remotely good is just Codex xhigh.

The only code review agent I have ever seen be even remotely good is just Codex xhigh.

Scheduled Pinned Locked Moved Uncategorized
15 Posts 5 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • nateberkopec@mastodon.socialN nateberkopec@mastodon.social

    The only code review agent I have ever seen be even remotely good is just Codex xhigh. All the review services (and I've seen at least a dozen at this point) suck so bad that I'm not sure how they make any money at all.

    tsvallender@ruby.socialT This user is from outside of this forum
    tsvallender@ruby.socialT This user is from outside of this forum
    tsvallender@ruby.social
    wrote last edited by
    #3

    @nateberkopec We have the built in Copilot review, which spews nonsense ~75% of the time but in this specific context (code review) its easy enough to ignore the noise for the value in the few comments left. We're about to migrate to Claude though, which I'm hoping is an improvement.

    pointlessone@status.pointless.oneP 1 Reply Last reply
    0
    • tsvallender@ruby.socialT tsvallender@ruby.social

      @nateberkopec We have the built in Copilot review, which spews nonsense ~75% of the time but in this specific context (code review) its easy enough to ignore the noise for the value in the few comments left. We're about to migrate to Claude though, which I'm hoping is an improvement.

      pointlessone@status.pointless.oneP This user is from outside of this forum
      pointlessone@status.pointless.oneP This user is from outside of this forum
      pointlessone@status.pointless.one
      wrote last edited by
      #4

      @tsvallender @nateberkopec I’d quit if 3/4 of comments on my PR were useless. And you’re paying for it. Why are you doing it to yourself?

      zenspider@ruby.socialZ 1 Reply Last reply
      0
      • pointlessone@status.pointless.oneP pointlessone@status.pointless.one

        @tsvallender @nateberkopec I’d quit if 3/4 of comments on my PR were useless. And you’re paying for it. Why are you doing it to yourself?

        zenspider@ruby.socialZ This user is from outside of this forum
        zenspider@ruby.socialZ This user is from outside of this forum
        zenspider@ruby.social
        wrote last edited by
        #5

        @tsvallender @nateberkopec @pointlessone completely agree. I came here to say “do you hear yourself?”. This sounds like Stockholm syndrome.

        nateberkopec@mastodon.socialN 1 Reply Last reply
        0
        • zenspider@ruby.socialZ zenspider@ruby.social

          @tsvallender @nateberkopec @pointlessone completely agree. I came here to say “do you hear yourself?”. This sounds like Stockholm syndrome.

          nateberkopec@mastodon.socialN This user is from outside of this forum
          nateberkopec@mastodon.socialN This user is from outside of this forum
          nateberkopec@mastodon.social
          wrote last edited by
          #6

          @zenspider @tsvallender @pointlessone it's not an uncommon opinion/situation FWIW among my client base. Drives me absolutely insane, even as an LLM-augmentation booster myself

          tsvallender@ruby.socialT 1 Reply Last reply
          0
          • nateberkopec@mastodon.socialN nateberkopec@mastodon.social

            @zenspider @tsvallender @pointlessone it's not an uncommon opinion/situation FWIW among my client base. Drives me absolutely insane, even as an LLM-augmentation booster myself

            tsvallender@ruby.socialT This user is from outside of this forum
            tsvallender@ruby.socialT This user is from outside of this forum
            tsvallender@ruby.social
            wrote last edited by
            #7

            @nateberkopec @zenspider @pointlessone I really don’t see why. Objectively, it’s prevented bugs shipping and cut-down on overall review time by catching some issues before a human review. The cost is a minute or two of the author’s time to scan the comments and quickly resolve the ones that aren’t helpful. I’m not saying it’s perfect, I am saying it has value _in this context_.

            zenspider@ruby.socialZ pointlessone@status.pointless.oneP 2 Replies Last reply
            0
            • tsvallender@ruby.socialT tsvallender@ruby.social

              @nateberkopec @zenspider @pointlessone I really don’t see why. Objectively, it’s prevented bugs shipping and cut-down on overall review time by catching some issues before a human review. The cost is a minute or two of the author’s time to scan the comments and quickly resolve the ones that aren’t helpful. I’m not saying it’s perfect, I am saying it has value _in this context_.

              zenspider@ruby.socialZ This user is from outside of this forum
              zenspider@ruby.socialZ This user is from outside of this forum
              zenspider@ruby.social
              wrote last edited by
              #8

              @nateberkopec @pointlessone @tsvallender were you *that “guy”* on the group project who did zero work, didn’t reveal that until the night before it was due, and then got upset that people were mad at how little you did?

              Otherwise this really sounds like Stockholm syndrome. You’re totally fine with being forcibly paired with an F- student and having to spend your time and effort checking their work instead of being an A student on your own?

              zenspider@ruby.socialZ 1 Reply Last reply
              0
              • zenspider@ruby.socialZ zenspider@ruby.social

                @nateberkopec @pointlessone @tsvallender were you *that “guy”* on the group project who did zero work, didn’t reveal that until the night before it was due, and then got upset that people were mad at how little you did?

                Otherwise this really sounds like Stockholm syndrome. You’re totally fine with being forcibly paired with an F- student and having to spend your time and effort checking their work instead of being an A student on your own?

                zenspider@ruby.socialZ This user is from outside of this forum
                zenspider@ruby.socialZ This user is from outside of this forum
                zenspider@ruby.social
                wrote last edited by
                #9

                @nateberkopec @pointlessone @tsvallender (I don’t know how well that translates to UK. Sorry)

                tsvallender@ruby.socialT 1 Reply Last reply
                0
                • zenspider@ruby.socialZ zenspider@ruby.social

                  @nateberkopec @pointlessone @tsvallender (I don’t know how well that translates to UK. Sorry)

                  tsvallender@ruby.socialT This user is from outside of this forum
                  tsvallender@ruby.socialT This user is from outside of this forum
                  tsvallender@ruby.social
                  wrote last edited by
                  #10

                  @zenspider @nateberkopec @pointlessone Translates fine, but it’s a false analogy. If the F- student made my work better and it took me next to no time to check their work, then that sounds fine. As for your first point, that feels more just like a random insult 🤷

                  1 Reply Last reply
                  0
                  • tsvallender@ruby.socialT tsvallender@ruby.social

                    @nateberkopec @zenspider @pointlessone I really don’t see why. Objectively, it’s prevented bugs shipping and cut-down on overall review time by catching some issues before a human review. The cost is a minute or two of the author’s time to scan the comments and quickly resolve the ones that aren’t helpful. I’m not saying it’s perfect, I am saying it has value _in this context_.

                    pointlessone@status.pointless.oneP This user is from outside of this forum
                    pointlessone@status.pointless.oneP This user is from outside of this forum
                    pointlessone@status.pointless.one
                    wrote last edited by
                    #11

                    @tsvallender you say it takes only a minute to scan and resolve the ones that are not helpful but it still takes time. Ultimately you train yourself to ignore a big chunk of feedback. It’s similar to how you setup monitoring and an overzealous alert that you ignore 3 out of 4 times. It creates noice that you learn to ignore. You think it’s useful once in a while but you still spend your mental bandwidth on filtering the noice. You also may think that this only happens to AI reviews but this training totally translates to all other feedback that looks similar, which is all feedback because it’s in the same place and uses the same UI.

                    @nateberkopec @zenspider

                    tsvallender@ruby.socialT 1 Reply Last reply
                    0
                    • pointlessone@status.pointless.oneP pointlessone@status.pointless.one

                      @tsvallender you say it takes only a minute to scan and resolve the ones that are not helpful but it still takes time. Ultimately you train yourself to ignore a big chunk of feedback. It’s similar to how you setup monitoring and an overzealous alert that you ignore 3 out of 4 times. It creates noice that you learn to ignore. You think it’s useful once in a while but you still spend your mental bandwidth on filtering the noice. You also may think that this only happens to AI reviews but this training totally translates to all other feedback that looks similar, which is all feedback because it’s in the same place and uses the same UI.

                      @nateberkopec @zenspider

                      tsvallender@ruby.socialT This user is from outside of this forum
                      tsvallender@ruby.socialT This user is from outside of this forum
                      tsvallender@ruby.social
                      wrote last edited by
                      #12

                      @pointlessone @nateberkopec @zenspider

                      If there’s evidence of that I’d be interested, but I don’t think those things are analagous. Alerts are different in that they’re _push_, so they do need to be high value or you will tune them out entirely, agreed. But I _don’t_ think learning to tune out AI noise turns into tuning out humans here, you go into the process with a different headspace (I do, at least). The AI feedback feels like a CI step, the human feedback is a conversation.

                      pointlessone@status.pointless.oneP 1 Reply Last reply
                      0
                      • tsvallender@ruby.socialT tsvallender@ruby.social

                        @pointlessone @nateberkopec @zenspider

                        If there’s evidence of that I’d be interested, but I don’t think those things are analagous. Alerts are different in that they’re _push_, so they do need to be high value or you will tune them out entirely, agreed. But I _don’t_ think learning to tune out AI noise turns into tuning out humans here, you go into the process with a different headspace (I do, at least). The AI feedback feels like a CI step, the human feedback is a conversation.

                        pointlessone@status.pointless.oneP This user is from outside of this forum
                        pointlessone@status.pointless.oneP This user is from outside of this forum
                        pointlessone@status.pointless.one
                        wrote last edited by
                        #13

                        @tsvallender Idunno. Flaky CI doesn’t sound very enticing to me even if analogy might be better.

                        I’m curious what kind of feedback you get from AI. Could you give a few examples of feedback you typically ignore and a few of useful comments?

                        @nateberkopec @zenspider

                        tsvallender@ruby.socialT 1 Reply Last reply
                        0
                        • pointlessone@status.pointless.oneP pointlessone@status.pointless.one

                          @tsvallender Idunno. Flaky CI doesn’t sound very enticing to me even if analogy might be better.

                          I’m curious what kind of feedback you get from AI. Could you give a few examples of feedback you typically ignore and a few of useful comments?

                          @nateberkopec @zenspider

                          tsvallender@ruby.socialT This user is from outside of this forum
                          tsvallender@ruby.socialT This user is from outside of this forum
                          tsvallender@ruby.social
                          wrote last edited by
                          #14

                          @pointlessone

                          Heh, I guess maybe CI isn’t the right analogy either when you put it like that. I’ll try and remember to grab a couple next time I deal with some. I will just reinforce though, I was only arguing that it does have value, not that it’s fantastic!

                          pointlessone@status.pointless.oneP 1 Reply Last reply
                          0
                          • tsvallender@ruby.socialT tsvallender@ruby.social

                            @pointlessone

                            Heh, I guess maybe CI isn’t the right analogy either when you put it like that. I’ll try and remember to grab a couple next time I deal with some. I will just reinforce though, I was only arguing that it does have value, not that it’s fantastic!

                            pointlessone@status.pointless.oneP This user is from outside of this forum
                            pointlessone@status.pointless.oneP This user is from outside of this forum
                            pointlessone@status.pointless.one
                            wrote last edited by
                            #15

                            @tsvallender I didn’t say it doesn’t. I’m just unconvinced the value is worthwhile.

                            Like there’s value in asbestos. It’s just negative aspects outweigh benefits. Not saying that AI is full on asbestos, just a colorful demonstration of the idea.

                            1 Reply Last reply
                            1
                            0
                            • R relay@relay.mycrowd.ca shared this topic
                            Reply
                            • Reply as topic
                            Log in to reply
                            • Oldest to Newest
                            • Newest to Oldest
                            • Most Votes


                            • Login

                            • Login or register to search.
                            • First post
                              Last post
                            0
                            • Categories
                            • Recent
                            • Tags
                            • Popular
                            • World
                            • Users
                            • Groups