The hidden beauty of vibe coding

nf3xn@mastodon.social

@gerrymcgovern Tired of explaining that more lines of code does not equal more productivity. I don't think there will be a reckoning, they will expect us to get used to cloud outages instead and just accept 'the robot ate my homework' as an excuse.

gatesvp@mstdn.ca

@gerrymcgovern This is an unusual article. It mixes truth and misconceptions in awkward ways.

For example:

Smiley pointed to a recent attempt to rewrite SQLite in Rust using AI

This isn't what happened. It was a C Compiler that was rewritten. A different tester then rebuilt SQLite using both the AI and the official one. The AI one did worse.

But it did worse for very specific reasons. The AI version was only tested for correctness. It was only given unit tests as a parameter for success. It failed on real world performance tests, because it was never actually given that as a requirement.

Lines of code, number of [pull requests], these are liabilities. These are not measures of engineering excellence."... Measures of engineering excellence, said Smiley, include metrics like deployment frequency, lead time to production, change failure rate, mean time to restore, and incident severity.

So these are famously known as the DORA metrics. And they don't measure engineering excellence, ... /1

gatesvp@mstdn.ca

@gerrymcgovern ... they measure the capabilities of the engineering platform along with the expertise of the people using that platform.

There are lots of companies with excellent engineers and crummy DORA scores because they don't have the institutional support to improve those metrics. Nor does the score mean the business is successful. You can have great DORA metrics and still lack for paying customers.

"The other challenge here is that the incentives are misaligned,"

But then he proceeds to list a bunch of examples for competing incentives. His examples of "misaligned" are really examples of "I would like to deliver less and get paid more"... /2

gatesvp@mstdn.ca

@gerrymcgovern ...

If there's an incentives problem here, it's that companies have been paying for a lot of BS rituals and they're discovering that the BS generating machine is undermining part of the ritual. Companies have also been getting away with under-specifying success in order to pad results as "good". But Gen AIs will "fill in" the under-specificity with made up data. Or they will fail to deliver anything into the gap that some human was hoping would be filled.

But none of this is "misaligned". It's intentional ambiguity designed to protect business units. The AI is just exposing the BS for what it is.

OP is kind of talking about that BS problem. But he's taking weird micro angles to view subsets of the problem without calling out the greater problem. He's not wrong, but he's also not really right either. ‍️ //

arclight@oldbytes.space

@gerrymcgovern You love to see it:
"Insurers, he said, are already lobbying state-level insurance regulators to win a carve-out in business insurance liability policies so they are not obligated to cover AI-related workflows. "That kills the whole system," Deeks said."

Someone will be left holding the bag when slop kills someone, bankrupts a business, or causes serious damage. Insurers will make sure it isn't them.

uplategeek@bitbang.social

@gerrymcgovern message from management: I didn’t read your test report, but our AI assistant told us we should fire all but one software engineer and push to prod, whatever that means. Let us know how it goes, we’ll be at the money burning party our AI assistant scheduled for us.

curious_carrot@mastodon.social

@dpnash @gerrymcgovern

Do you have a link you would like to share about this 1000 times slower implementation of a SQLite instance?

circus_maximus@social.anoxinon.de

@poprox @PatrickTingen @gerrymcgovern

Given that, when a model is trained with GPL code, it has to be GPL'ed itself, would be fair.
No fun to say, but I think copyright/IP issues have a long way to go until the whole thing is settled and needless to say, is has to be fair

steveclough@metalhead.club

@poprox @PatrickTingen @gerrymcgovern The problem is that all coders take from other people, copy code, and vibe coding does this better, in theory.

There is a lot of harm, and there are - IME - better ways of ding this. Or at least, there were, until they were all enshittified.

pmdj@mstdn.social

@curious_carrot @dpnash @gerrymcgovern It's linked from the original Register article: https://medium.com/write-a-catalyst/an-ai-wrote-576-000-lines-to-replace-sqlite-7ea538826d72

mawhrin@circumstances.run

@curious_carrot @dpnash @gerrymcgovern yes. (and you have it too.)

oh, and it's 20000 times slower, actually.

autoiue@mastodon.xn--cfa.xyz

@gerrymcgovern The original article has it at 20000× worse x)

Your LLM Doesn't Write Correct Code. It Writes Plausible Code.

One of the simplest tests you can run on a database:

(blog.katanaquant.com)

shelldozer@oldbytes.space

@gerrymcgovern KLOC as a measurement of productivity has come back into fashion yet again, for the *umpteenth* time. We never learn. Welcome to the 1970s.

CIRCLE WITH A DOT

The hidden beauty of vibe coding

AI still doesn't work very well in business, reckoning soon

AI still doesn't work very well in business, reckoning soon

AI still doesn't work very well in business, reckoning soon

AI still doesn't work very well in business, reckoning soon

AI still doesn't work very well in business, reckoning soon

Your LLM Doesn't Write Correct Code. It Writes Plausible Code.

AI still doesn't work very well in business, reckoning soon