Found myself wincing while reading this story about how Ars Technica fired a reporter over fabricated quotations generated by an AI tool.

ct@app.wafrn.net

Two-fold failure here. This guy should have taken a sick day (and possibly was incentivized not to do so? We don't know), and under no circumstances is "using AI to mine sources" an error you get to bounce back from as a journalist. Unforgivable - you understood the risks!

ct@app.wafrn.net

Ars Technica's credibility is forever marred by this event, however fair you think that is. And it's this dude's fault!

alessandro@mstdn.ca

@michael

Also we only use it when we're sick - we'd definitely never do this when we're feeling fine, no sirree.

@briankrebs

screwturn@mastodon.social

@nirak

Wrong in what way?
Yes, in most cases I'm reading the entire text, but sometimes the AI captures something I missed, and other times it confirms what I already got.

Time saving does feature, but the bigger issue is that using it improves validity, because of catching the missed topics

@briankrebs

screwturn@mastodon.social

@briankrebs

In qualitative research we routinely redo each other's work and our own.
Having an AI do that too increases construction validity and reliability.

@nirak

chicob@mstdn.social

@briankrebs
AI in journalism is Farse Technica

briankrebs@infosec.exchange

@chicob Arse. It was right there, dude.

colo_lee@mstdn.social

@briankrebs well, at least he only did it once.
er, correction ...
he only got *caught* doing it once

stumpythemutt@social.linux.pizza

@screwturn @nirak @briankrebs Even a blind pig will find the occasional acorn.

kennethbousquet@mastodon.social

@briankrebs He's not at fault here. Even after explanations, he souldn't have been fired. The person has not intentionaly take credit for others work. That person should be reinstated, have his job back.

reflex@retrogaming.social

@briankrebs I learned not to trust Ars reporting after the Hacker X story, which they have still declined to retract.

teriradichel@infosec.exchange

@briankrebs you 100% cannot trust it. Like Google search results and Wikipedia. But it still might give you some idea or thought or resource you hadn’t seen yet that you can go research further. It can help you think of new questions and point you in new directions (which can be good or bad). I use it to explore ideas and if I do copy something written by AI I write “from Google AI:” or whatever so people can take it with a grain of salt and can back that up with links to other sources. It’s usually something I know is right but I like the way it wrote it and saves me some time. Sometimes I call out when it is wrong to demonstrate why you can’t always trust it. But I’m researching and writing about AI not the kind of things you write about so it’s a bit different. I generally just cite sources if I’m writing something about a data breach like you (and nowhere near the deep dive you do!)

briankrebs@infosec.exchange

@screwturn @nirak There are some pretty decent and recent studies showing AI substantially misses or misrepresents the point or summary of a story about 40-50 percent of the time.

ct@app.wafrn.net

Even if the success were 95%, as a journalist, consistently using a stochastic method to give sources guarantees you eventually fuck up and let a fabricated quote into print.

screwturn@mastodon.social

@ct
um... sure, but who is going to be asking the free version of ChatGPT for sources?
That is going to be a very poor use case.

If I am using the Ai that is inside my CAQDAS, I am not going to see hallucination, and it internally cites each fact it produces. Reliability and validity are going to vary greatly depending on the environment you use the AI in, and what you are trying to do.

@briankrebs @nirak

screwturn@mastodon.social

@briankrebs
I'm not sure what that means
Like 50% of the time I use it, it will miss at least one point? Sure, but those odds are fine by me if it is also spotting things I missed, and has a reasonable inter-rater reliability with what I saw.

If you mean it gets 50% of the points wrong, then that is probably true of the free versions, but not what I am seeing in practice when I use the AI inside my CAQDAS

@nirak

screwturn@mastodon.social

@StumpyTheMutt
If it finds an acorn that I missed, then it found something of value.

Keep in mind, I'm not using the free version in its wide-open configuration, but rather a tightly configured version inside a research workbench. In three years of use, I have not seen a single case of hallucination by the LLM.

@nirak @briankrebs

ct@app.wafrn.net

I'm curious how your example actually works under the hood.

I have a sneaking suspicion that maybe your personal experience with a research summarization tool was not relevant to this story of a tech journalist, who needs to source current events from myriad sources and not just a limited database of pre-curated published research? I speculate your CAQDAS tool would not have been useful for a current events journalist who may need to quote things like statements from leadership, self-published cybersecurity reports, transcriptions of tech presentations etc… where there's a lot more critical thinking involved in selecting who to source from.

Regardless, I'd love to see how your CAQDAS tool fares against peer-reviewed fact-checking tests. I am very skeptical the failure rate is under 1% just from your testimony.

screwturn@mastodon.social

@ct

I have a sneaking suspicion that you don't know what "goes on under the hood" of qualitative research.
Let's just be clear what we are talking about in praxis.

One would not use the AI to find material, conduct interviews (although that is a distinct future possibility), or to do the discovery part of research, or journalism.
Once you have pulled those texts, transcripts, etc into a CAQDAS, THEN you would use the AI to summarize, identify topics, match topics, etc

@briankrebs @nirak

screwturn@mastodon.social

@ct
"peer-reviewed fact-checking tests"

Do you mean inter-rater reliability?
Because several people have found various LLMs in CAQDAS platforms to be on par between the AI and human researchers and between human researchers

@briankrebs @nirak

CIRCLE WITH A DOT

Found myself wincing while reading this story about how Ars Technica fired a reporter over fabricated quotations generated by an AI tool.

Ars Technica Fires Reporter After AI Controversy Involving Fabricated Quotes

Ars Technica Fires Reporter After AI Controversy Involving Fabricated Quotes

Ars Technica Fires Reporter After AI Controversy Involving Fabricated Quotes

Ars Technica Fires Reporter After AI Controversy Involving Fabricated Quotes

Ars Technica Fires Reporter After AI Controversy Involving Fabricated Quotes

Ars Technica Fires Reporter After AI Controversy Involving Fabricated Quotes