Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. The only code review agent I have ever seen be even remotely good is just Codex xhigh.

The only code review agent I have ever seen be even remotely good is just Codex xhigh.

Scheduled Pinned Locked Moved Uncategorized
15 Posts 5 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • nateberkopec@mastodon.socialN nateberkopec@mastodon.social

    The only code review agent I have ever seen be even remotely good is just Codex xhigh. All the review services (and I've seen at least a dozen at this point) suck so bad that I'm not sure how they make any money at all.

    philip@social.simplexity.questP This user is from outside of this forum
    philip@social.simplexity.questP This user is from outside of this forum
    philip@social.simplexity.quest
    wrote last edited by
    #2

    @nateberkopec coderabbit has been worth the money for us. But not by a wide margin.

    1 Reply Last reply
    0
    • nateberkopec@mastodon.socialN nateberkopec@mastodon.social

      The only code review agent I have ever seen be even remotely good is just Codex xhigh. All the review services (and I've seen at least a dozen at this point) suck so bad that I'm not sure how they make any money at all.

      tsvallender@ruby.socialT This user is from outside of this forum
      tsvallender@ruby.socialT This user is from outside of this forum
      tsvallender@ruby.social
      wrote last edited by
      #3

      @nateberkopec We have the built in Copilot review, which spews nonsense ~75% of the time but in this specific context (code review) its easy enough to ignore the noise for the value in the few comments left. We're about to migrate to Claude though, which I'm hoping is an improvement.

      pointlessone@status.pointless.oneP 1 Reply Last reply
      0
      • tsvallender@ruby.socialT tsvallender@ruby.social

        @nateberkopec We have the built in Copilot review, which spews nonsense ~75% of the time but in this specific context (code review) its easy enough to ignore the noise for the value in the few comments left. We're about to migrate to Claude though, which I'm hoping is an improvement.

        pointlessone@status.pointless.oneP This user is from outside of this forum
        pointlessone@status.pointless.oneP This user is from outside of this forum
        pointlessone@status.pointless.one
        wrote last edited by
        #4

        @tsvallender @nateberkopec I’d quit if 3/4 of comments on my PR were useless. And you’re paying for it. Why are you doing it to yourself?

        zenspider@ruby.socialZ 1 Reply Last reply
        0
        • pointlessone@status.pointless.oneP pointlessone@status.pointless.one

          @tsvallender @nateberkopec I’d quit if 3/4 of comments on my PR were useless. And you’re paying for it. Why are you doing it to yourself?

          zenspider@ruby.socialZ This user is from outside of this forum
          zenspider@ruby.socialZ This user is from outside of this forum
          zenspider@ruby.social
          wrote last edited by
          #5

          @tsvallender @nateberkopec @pointlessone completely agree. I came here to say “do you hear yourself?”. This sounds like Stockholm syndrome.

          nateberkopec@mastodon.socialN 1 Reply Last reply
          0
          • zenspider@ruby.socialZ zenspider@ruby.social

            @tsvallender @nateberkopec @pointlessone completely agree. I came here to say “do you hear yourself?”. This sounds like Stockholm syndrome.

            nateberkopec@mastodon.socialN This user is from outside of this forum
            nateberkopec@mastodon.socialN This user is from outside of this forum
            nateberkopec@mastodon.social
            wrote last edited by
            #6

            @zenspider @tsvallender @pointlessone it's not an uncommon opinion/situation FWIW among my client base. Drives me absolutely insane, even as an LLM-augmentation booster myself

            tsvallender@ruby.socialT 1 Reply Last reply
            0
            • nateberkopec@mastodon.socialN nateberkopec@mastodon.social

              @zenspider @tsvallender @pointlessone it's not an uncommon opinion/situation FWIW among my client base. Drives me absolutely insane, even as an LLM-augmentation booster myself

              tsvallender@ruby.socialT This user is from outside of this forum
              tsvallender@ruby.socialT This user is from outside of this forum
              tsvallender@ruby.social
              wrote last edited by
              #7

              @nateberkopec @zenspider @pointlessone I really don’t see why. Objectively, it’s prevented bugs shipping and cut-down on overall review time by catching some issues before a human review. The cost is a minute or two of the author’s time to scan the comments and quickly resolve the ones that aren’t helpful. I’m not saying it’s perfect, I am saying it has value _in this context_.

              zenspider@ruby.socialZ pointlessone@status.pointless.oneP 2 Replies Last reply
              0
              • tsvallender@ruby.socialT tsvallender@ruby.social

                @nateberkopec @zenspider @pointlessone I really don’t see why. Objectively, it’s prevented bugs shipping and cut-down on overall review time by catching some issues before a human review. The cost is a minute or two of the author’s time to scan the comments and quickly resolve the ones that aren’t helpful. I’m not saying it’s perfect, I am saying it has value _in this context_.

                zenspider@ruby.socialZ This user is from outside of this forum
                zenspider@ruby.socialZ This user is from outside of this forum
                zenspider@ruby.social
                wrote last edited by
                #8

                @nateberkopec @pointlessone @tsvallender were you *that “guy”* on the group project who did zero work, didn’t reveal that until the night before it was due, and then got upset that people were mad at how little you did?

                Otherwise this really sounds like Stockholm syndrome. You’re totally fine with being forcibly paired with an F- student and having to spend your time and effort checking their work instead of being an A student on your own?

                zenspider@ruby.socialZ 1 Reply Last reply
                0
                • zenspider@ruby.socialZ zenspider@ruby.social

                  @nateberkopec @pointlessone @tsvallender were you *that “guy”* on the group project who did zero work, didn’t reveal that until the night before it was due, and then got upset that people were mad at how little you did?

                  Otherwise this really sounds like Stockholm syndrome. You’re totally fine with being forcibly paired with an F- student and having to spend your time and effort checking their work instead of being an A student on your own?

                  zenspider@ruby.socialZ This user is from outside of this forum
                  zenspider@ruby.socialZ This user is from outside of this forum
                  zenspider@ruby.social
                  wrote last edited by
                  #9

                  @nateberkopec @pointlessone @tsvallender (I don’t know how well that translates to UK. Sorry)

                  tsvallender@ruby.socialT 1 Reply Last reply
                  0
                  • zenspider@ruby.socialZ zenspider@ruby.social

                    @nateberkopec @pointlessone @tsvallender (I don’t know how well that translates to UK. Sorry)

                    tsvallender@ruby.socialT This user is from outside of this forum
                    tsvallender@ruby.socialT This user is from outside of this forum
                    tsvallender@ruby.social
                    wrote last edited by
                    #10

                    @zenspider @nateberkopec @pointlessone Translates fine, but it’s a false analogy. If the F- student made my work better and it took me next to no time to check their work, then that sounds fine. As for your first point, that feels more just like a random insult 🤷

                    1 Reply Last reply
                    0
                    • tsvallender@ruby.socialT tsvallender@ruby.social

                      @nateberkopec @zenspider @pointlessone I really don’t see why. Objectively, it’s prevented bugs shipping and cut-down on overall review time by catching some issues before a human review. The cost is a minute or two of the author’s time to scan the comments and quickly resolve the ones that aren’t helpful. I’m not saying it’s perfect, I am saying it has value _in this context_.

                      pointlessone@status.pointless.oneP This user is from outside of this forum
                      pointlessone@status.pointless.oneP This user is from outside of this forum
                      pointlessone@status.pointless.one
                      wrote last edited by
                      #11

                      @tsvallender you say it takes only a minute to scan and resolve the ones that are not helpful but it still takes time. Ultimately you train yourself to ignore a big chunk of feedback. It’s similar to how you setup monitoring and an overzealous alert that you ignore 3 out of 4 times. It creates noice that you learn to ignore. You think it’s useful once in a while but you still spend your mental bandwidth on filtering the noice. You also may think that this only happens to AI reviews but this training totally translates to all other feedback that looks similar, which is all feedback because it’s in the same place and uses the same UI.

                      @nateberkopec @zenspider

                      tsvallender@ruby.socialT 1 Reply Last reply
                      0
                      • pointlessone@status.pointless.oneP pointlessone@status.pointless.one

                        @tsvallender you say it takes only a minute to scan and resolve the ones that are not helpful but it still takes time. Ultimately you train yourself to ignore a big chunk of feedback. It’s similar to how you setup monitoring and an overzealous alert that you ignore 3 out of 4 times. It creates noice that you learn to ignore. You think it’s useful once in a while but you still spend your mental bandwidth on filtering the noice. You also may think that this only happens to AI reviews but this training totally translates to all other feedback that looks similar, which is all feedback because it’s in the same place and uses the same UI.

                        @nateberkopec @zenspider

                        tsvallender@ruby.socialT This user is from outside of this forum
                        tsvallender@ruby.socialT This user is from outside of this forum
                        tsvallender@ruby.social
                        wrote last edited by
                        #12

                        @pointlessone @nateberkopec @zenspider

                        If there’s evidence of that I’d be interested, but I don’t think those things are analagous. Alerts are different in that they’re _push_, so they do need to be high value or you will tune them out entirely, agreed. But I _don’t_ think learning to tune out AI noise turns into tuning out humans here, you go into the process with a different headspace (I do, at least). The AI feedback feels like a CI step, the human feedback is a conversation.

                        pointlessone@status.pointless.oneP 1 Reply Last reply
                        0
                        • tsvallender@ruby.socialT tsvallender@ruby.social

                          @pointlessone @nateberkopec @zenspider

                          If there’s evidence of that I’d be interested, but I don’t think those things are analagous. Alerts are different in that they’re _push_, so they do need to be high value or you will tune them out entirely, agreed. But I _don’t_ think learning to tune out AI noise turns into tuning out humans here, you go into the process with a different headspace (I do, at least). The AI feedback feels like a CI step, the human feedback is a conversation.

                          pointlessone@status.pointless.oneP This user is from outside of this forum
                          pointlessone@status.pointless.oneP This user is from outside of this forum
                          pointlessone@status.pointless.one
                          wrote last edited by
                          #13

                          @tsvallender Idunno. Flaky CI doesn’t sound very enticing to me even if analogy might be better.

                          I’m curious what kind of feedback you get from AI. Could you give a few examples of feedback you typically ignore and a few of useful comments?

                          @nateberkopec @zenspider

                          tsvallender@ruby.socialT 1 Reply Last reply
                          0
                          • pointlessone@status.pointless.oneP pointlessone@status.pointless.one

                            @tsvallender Idunno. Flaky CI doesn’t sound very enticing to me even if analogy might be better.

                            I’m curious what kind of feedback you get from AI. Could you give a few examples of feedback you typically ignore and a few of useful comments?

                            @nateberkopec @zenspider

                            tsvallender@ruby.socialT This user is from outside of this forum
                            tsvallender@ruby.socialT This user is from outside of this forum
                            tsvallender@ruby.social
                            wrote last edited by
                            #14

                            @pointlessone

                            Heh, I guess maybe CI isn’t the right analogy either when you put it like that. I’ll try and remember to grab a couple next time I deal with some. I will just reinforce though, I was only arguing that it does have value, not that it’s fantastic!

                            pointlessone@status.pointless.oneP 1 Reply Last reply
                            0
                            • tsvallender@ruby.socialT tsvallender@ruby.social

                              @pointlessone

                              Heh, I guess maybe CI isn’t the right analogy either when you put it like that. I’ll try and remember to grab a couple next time I deal with some. I will just reinforce though, I was only arguing that it does have value, not that it’s fantastic!

                              pointlessone@status.pointless.oneP This user is from outside of this forum
                              pointlessone@status.pointless.oneP This user is from outside of this forum
                              pointlessone@status.pointless.one
                              wrote last edited by
                              #15

                              @tsvallender I didn’t say it doesn’t. I’m just unconvinced the value is worthwhile.

                              Like there’s value in asbestos. It’s just negative aspects outweigh benefits. Not saying that AI is full on asbestos, just a colorful demonstration of the idea.

                              1 Reply Last reply
                              1
                              0
                              • R relay@relay.mycrowd.ca shared this topic
                              Reply
                              • Reply as topic
                              Log in to reply
                              • Oldest to Newest
                              • Newest to Oldest
                              • Most Votes


                              • Login

                              • Login or register to search.
                              • First post
                                Last post
                              0
                              • Categories
                              • Recent
                              • Tags
                              • Popular
                              • World
                              • Users
                              • Groups