Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. Welp, for the first semester ever, SOTA LLMs can do *every single assignment, from scratch (readmes, etc.), and get 100%*.

Welp, for the first semester ever, SOTA LLMs can do *every single assignment, from scratch (readmes, etc.), and get 100%*.

Scheduled Pinned Locked Moved Uncategorized
52 Posts 17 Posters 86 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • krismicinski@types.plK krismicinski@types.pl

    Welp, for the first semester ever, SOTA LLMs can do *every single assignment, from scratch (readmes, etc.), and get 100%*. It is frustrating because it would now be quite tough to make it so that LLMs *couldn't* do this without also making it so that the assignments are obfuscated past the point of offering educational value...

    bool@mathstodon.xyzB This user is from outside of this forum
    bool@mathstodon.xyzB This user is from outside of this forum
    bool@mathstodon.xyz
    wrote last edited by
    #5

    @krismicinski if people don't want to lean let them, universities are degree factories anyway

    krismicinski@types.plK 1 Reply Last reply
    0
    • bool@mathstodon.xyzB bool@mathstodon.xyz

      @krismicinski if people don't want to lean let them, universities are degree factories anyway

      krismicinski@types.plK This user is from outside of this forum
      krismicinski@types.plK This user is from outside of this forum
      krismicinski@types.pl
      wrote last edited by
      #6

      @bool I hear you on this and mostly I agree with you, but I definitely think we're going to see a big exodus of applicants (we're already seeing this here in CS) when the AI can just do most of the assignments even driven by someone totally unqualified.

      1 Reply Last reply
      0
      • csgordon@discuss.systemsC This user is from outside of this forum
        csgordon@discuss.systemsC This user is from outside of this forum
        csgordon@discuss.systems
        wrote last edited by
        #7

        @jeremysiek @krismicinski I'm moving all of my classes to required code walks. Has the nice side effect of the fact that once you convince your students that the code walk grade doesn't suffer when you find a bug during it, it often turns into a design discussion and talking about debugging strategies. I think the students also feel like their hard work is more appreciated after they actually get to chat with faculty about it.

        Has worked wonderfully in undergrad compilers and grad distributed systems.

        Hard to scale past 40 students though

        lindsey@recurse.socialL 1 Reply Last reply
        0
        • krismicinski@types.plK This user is from outside of this forum
          krismicinski@types.plK This user is from outside of this forum
          krismicinski@types.pl
          wrote last edited by
          #8

          @jeremysiek fully-automated natural-language-based software engineering is here, in a big, big way. I think it has the potential to be massively powerful in the right circumstances, but a major shift happened around December I noticed.

          1 Reply Last reply
          0
          • csgordon@discuss.systemsC csgordon@discuss.systems

            @jeremysiek @krismicinski I'm moving all of my classes to required code walks. Has the nice side effect of the fact that once you convince your students that the code walk grade doesn't suffer when you find a bug during it, it often turns into a design discussion and talking about debugging strategies. I think the students also feel like their hard work is more appreciated after they actually get to chat with faculty about it.

            Has worked wonderfully in undergrad compilers and grad distributed systems.

            Hard to scale past 40 students though

            lindsey@recurse.socialL This user is from outside of this forum
            lindsey@recurse.socialL This user is from outside of this forum
            lindsey@recurse.social
            wrote last edited by
            #9

            @csgordon @jeremysiek @krismicinski Yeah, I tried to do this in my PL course with ~150 students, and it was a massive logistical headache and left me and all the course staff exhausted. I would love to do it, but it's not feasible with the enrollments we have.

            csgordon@discuss.systemsC 1 Reply Last reply
            0
            • lindsey@recurse.socialL lindsey@recurse.social

              @csgordon @jeremysiek @krismicinski Yeah, I tried to do this in my PL course with ~150 students, and it was a massive logistical headache and left me and all the course staff exhausted. I would love to do it, but it's not feasible with the enrollments we have.

              csgordon@discuss.systemsC This user is from outside of this forum
              csgordon@discuss.systemsC This user is from outside of this forum
              csgordon@discuss.systems
              wrote last edited by
              #10

              @lindsey @jeremysiek @krismicinski yeah... our first-year courses (~400 students) are trying a lightweight version of this with 10 minute chats, with random samples of students each week. so I think each student meets twice a term. It seems to be going well, but it's still a stretch even with an additional faculty member volunteering to help with these(!). The deeper material in later courses definitely needs more than 10 minutes.

              jfdm@discuss.systemsJ 1 Reply Last reply
              0
              • csgordon@discuss.systemsC csgordon@discuss.systems

                @lindsey @jeremysiek @krismicinski yeah... our first-year courses (~400 students) are trying a lightweight version of this with 10 minute chats, with random samples of students each week. so I think each student meets twice a term. It seems to be going well, but it's still a stretch even with an additional faculty member volunteering to help with these(!). The deeper material in later courses definitely needs more than 10 minutes.

                jfdm@discuss.systemsJ This user is from outside of this forum
                jfdm@discuss.systemsJ This user is from outside of this forum
                jfdm@discuss.systems
                wrote last edited by
                #11

                @csgordon @lindsey @jeremysiek @krismicinski I know it doesn’t help much, but I ask my students to produce a short report along side the technical submission.

                The idea being you can eyeball the differences between written report and code submission. Identify where more interrogation isrequired. I know genai could produce these reports but every little helps.

                Submission canaries, for want of a better phrase.

                csgordon@discuss.systemsC 1 Reply Last reply
                0
                • jfdm@discuss.systemsJ jfdm@discuss.systems

                  @csgordon @lindsey @jeremysiek @krismicinski I know it doesn’t help much, but I ask my students to produce a short report along side the technical submission.

                  The idea being you can eyeball the differences between written report and code submission. Identify where more interrogation isrequired. I know genai could produce these reports but every little helps.

                  Submission canaries, for want of a better phrase.

                  csgordon@discuss.systemsC This user is from outside of this forum
                  csgordon@discuss.systemsC This user is from outside of this forum
                  csgordon@discuss.systems
                  wrote last edited by
                  #12

                  @jfdm @lindsey @jeremysiek @krismicinski I've long had students do a form of this with their assignments, but the generated reports got "good" enough a year and a half or so ago that I spent more time waffling on whether something crosses the line plus basically arguing politely with offended students than I would just meeting with everyone. (I'm sure this balance point varies depending how exactly you set things up with the report, but I couldn't figure out a more effective setup.) Meeting with everyone has ended up more equal for students, less frustrating for me, and more positive (i.e., non-adversarial) for students. The tipping point also depends on enrollment.

                  jfdm@discuss.systemsJ 1 Reply Last reply
                  0
                  • csgordon@discuss.systemsC csgordon@discuss.systems

                    @jfdm @lindsey @jeremysiek @krismicinski I've long had students do a form of this with their assignments, but the generated reports got "good" enough a year and a half or so ago that I spent more time waffling on whether something crosses the line plus basically arguing politely with offended students than I would just meeting with everyone. (I'm sure this balance point varies depending how exactly you set things up with the report, but I couldn't figure out a more effective setup.) Meeting with everyone has ended up more equal for students, less frustrating for me, and more positive (i.e., non-adversarial) for students. The tipping point also depends on enrollment.

                    jfdm@discuss.systemsJ This user is from outside of this forum
                    jfdm@discuss.systemsJ This user is from outside of this forum
                    jfdm@discuss.systems
                    wrote last edited by
                    #13

                    @csgordon @lindsey @jeremysiek @krismicinski There are two routes I have considered:

                    1. learn how to write more open book take home assignments;
                    2. get students to do programming coursework in the lab in exam conditions

                    Either way, we are doomed.

                    smarr@mastodon.acm.orgS shriramk@mastodon.socialS 2 Replies Last reply
                    0
                    • jfdm@discuss.systemsJ jfdm@discuss.systems

                      @csgordon @lindsey @jeremysiek @krismicinski There are two routes I have considered:

                      1. learn how to write more open book take home assignments;
                      2. get students to do programming coursework in the lab in exam conditions

                      Either way, we are doomed.

                      smarr@mastodon.acm.orgS This user is from outside of this forum
                      smarr@mastodon.acm.orgS This user is from outside of this forum
                      smarr@mastodon.acm.org
                      wrote last edited by
                      #14

                      @jfdm @csgordon @lindsey @jeremysiek @krismicinski at least you could sell 2. as "practice for coding interviews", assuming that's still a thing when they graduate... 🙈

                      1 Reply Last reply
                      0
                      • jfdm@discuss.systemsJ jfdm@discuss.systems

                        @csgordon @lindsey @jeremysiek @krismicinski There are two routes I have considered:

                        1. learn how to write more open book take home assignments;
                        2. get students to do programming coursework in the lab in exam conditions

                        Either way, we are doomed.

                        shriramk@mastodon.socialS This user is from outside of this forum
                        shriramk@mastodon.socialS This user is from outside of this forum
                        shriramk@mastodon.social
                        wrote last edited by
                        #15

                        @jfdm @csgordon @lindsey @jeremysiek @krismicinski
                        "we are doomed" is an incredibly disappointing take. You should have come to my "GenAI and CS Ed" talk (-:.

                        If our only value-add was "my course was gated behind a needlessly difficult thing", that doesn't say much for the value of our courses.

                        jfdm@discuss.systemsJ ltratt@mastodon.socialL 2 Replies Last reply
                        0
                        • shriramk@mastodon.socialS shriramk@mastodon.social

                          @jfdm @csgordon @lindsey @jeremysiek @krismicinski
                          "we are doomed" is an incredibly disappointing take. You should have come to my "GenAI and CS Ed" talk (-:.

                          If our only value-add was "my course was gated behind a needlessly difficult thing", that doesn't say much for the value of our courses.

                          jfdm@discuss.systemsJ This user is from outside of this forum
                          jfdm@discuss.systemsJ This user is from outside of this forum
                          jfdm@discuss.systems
                          wrote last edited by
                          #16

                          @shriramk @csgordon @lindsey @jeremysiek @krismicinski right so to be clear on these things, the doom part is because the actions require not so trivial changes in how we do things. Within UK academia we are under lots of pressures, with not a lot of time, and not the same power and influence as 'full chairs' do in the states.

                          More so,

                          1. open book assignments do not exclude the use of GenAI, they can embrace it, and you can guard against or incorporate its use. Such assignments, in my experience are harder to design, and require training on how to do well.

                          2. Exam conditions are also important as we want students to not rely on GenAI, and to ensure they have the fundamentals down.

                          The argument with GenAI is must be how our forefathers thought about pocket calculators...and their forefathers thought about slide rules, and so on.

                          shriramk@mastodon.socialS 1 Reply Last reply
                          0
                          • jfdm@discuss.systemsJ jfdm@discuss.systems

                            @shriramk @csgordon @lindsey @jeremysiek @krismicinski right so to be clear on these things, the doom part is because the actions require not so trivial changes in how we do things. Within UK academia we are under lots of pressures, with not a lot of time, and not the same power and influence as 'full chairs' do in the states.

                            More so,

                            1. open book assignments do not exclude the use of GenAI, they can embrace it, and you can guard against or incorporate its use. Such assignments, in my experience are harder to design, and require training on how to do well.

                            2. Exam conditions are also important as we want students to not rely on GenAI, and to ensure they have the fundamentals down.

                            The argument with GenAI is must be how our forefathers thought about pocket calculators...and their forefathers thought about slide rules, and so on.

                            shriramk@mastodon.socialS This user is from outside of this forum
                            shriramk@mastodon.socialS This user is from outside of this forum
                            shriramk@mastodon.social
                            wrote last edited by
                            #17

                            @jfdm @csgordon @lindsey @jeremysiek @krismicinski
                            Yes, it requires a fair bit of work to re-jig things.

                            I can't speak for UK academia. But I view it as my job to figure out how to upgrade courses for my students.

                            I don't know what power you think "full chairs" have in the US. You may be mistaking us for German Lehrstuhl's. We aren't! US "assistant professors" are not "assistants" to any "professors", for instance. I may wish I had some; I don't. <-;

                            1 Reply Last reply
                            0
                            • shriramk@mastodon.socialS This user is from outside of this forum
                              shriramk@mastodon.socialS This user is from outside of this forum
                              shriramk@mastodon.social
                              wrote last edited by
                              #18

                              @krismicinski @jfdm @csgordon @lindsey @jeremysiek
                              Good, you seem to understand the assignment! Now go out and design and run a course that puts that into practice! Listening to me talk about it is 50 fewer minutes you'll have, and which you can't get back, to design your course. (-:

                              1 Reply Last reply
                              0
                              • va2lam@mastodon.nzV This user is from outside of this forum
                                va2lam@mastodon.nzV This user is from outside of this forum
                                va2lam@mastodon.nz
                                wrote last edited by
                                #19

                                @krismicinski @shriramk @jfdm @csgordon @lindsey @jeremysiek I've just seen a different course proposal, Responsible AI, for designers of AI systems (argues that responsibility must be built into the design).

                                krismicinski@types.plK 1 Reply Last reply
                                0
                                • va2lam@mastodon.nzV va2lam@mastodon.nz

                                  @krismicinski @shriramk @jfdm @csgordon @lindsey @jeremysiek I've just seen a different course proposal, Responsible AI, for designers of AI systems (argues that responsibility must be built into the design).

                                  krismicinski@types.plK This user is from outside of this forum
                                  krismicinski@types.plK This user is from outside of this forum
                                  krismicinski@types.pl
                                  wrote last edited by
                                  #20

                                  @va2lam @shriramk @jfdm @csgordon @lindsey @jeremysiek yeah, here our local thing is called "Resilient AI" (since that covers, you know, everything).

                                  1 Reply Last reply
                                  0
                                  • shriramk@mastodon.socialS This user is from outside of this forum
                                    shriramk@mastodon.socialS This user is from outside of this forum
                                    shriramk@mastodon.social
                                    wrote last edited by
                                    #21

                                    @krismicinski @jfdm @csgordon @lindsey @jeremysiek
                                    I think I *am* interested in a kind of "how to use AI to do CS". I believe it can be done wisely and smartly.

                                    shriramk@mastodon.socialS krismicinski@types.plK 2 Replies Last reply
                                    0
                                    • shriramk@mastodon.socialS shriramk@mastodon.social

                                      @krismicinski @jfdm @csgordon @lindsey @jeremysiek
                                      I think I *am* interested in a kind of "how to use AI to do CS". I believe it can be done wisely and smartly.

                                      shriramk@mastodon.socialS This user is from outside of this forum
                                      shriramk@mastodon.socialS This user is from outside of this forum
                                      shriramk@mastodon.social
                                      wrote last edited by
                                      #22

                                      @krismicinski @jfdm @csgordon @lindsey @jeremysiek
                                      Notably, I'm talking about *novice* CS. For upper-level CS, it's pretty clear there are all kinds of interesting possibilities.

                                      krismicinski@types.plK 1 Reply Last reply
                                      0
                                      • shriramk@mastodon.socialS shriramk@mastodon.social

                                        @krismicinski @jfdm @csgordon @lindsey @jeremysiek
                                        I think I *am* interested in a kind of "how to use AI to do CS". I believe it can be done wisely and smartly.

                                        krismicinski@types.plK This user is from outside of this forum
                                        krismicinski@types.plK This user is from outside of this forum
                                        krismicinski@types.pl
                                        wrote last edited by
                                        #23

                                        @shriramk @jfdm @csgordon @lindsey @jeremysiek okay, wow--I did not really expect that. Interesting, I will have to think about that.

                                        shriramk@mastodon.socialS 1 Reply Last reply
                                        0
                                        • shriramk@mastodon.socialS shriramk@mastodon.social

                                          @krismicinski @jfdm @csgordon @lindsey @jeremysiek
                                          Notably, I'm talking about *novice* CS. For upper-level CS, it's pretty clear there are all kinds of interesting possibilities.

                                          krismicinski@types.plK This user is from outside of this forum
                                          krismicinski@types.plK This user is from outside of this forum
                                          krismicinski@types.pl
                                          wrote last edited by
                                          #24

                                          @shriramk @jfdm @csgordon @lindsey @jeremysiek I think once you trust that the student could in principle write the code (and they're treating it like code the prof gave them, code their coworker wrote, etc.) then what you're saying is right. The concern is: "go through whole college career and just have claude code do every single homework assignment with very little intellectual effort." Of course, many would argue that this is a failure of the curriculum design--but it will inevitably take time to catch up.

                                          shriramk@mastodon.socialS 1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups