this post was submitted on 15 Aug 2023

138 points (87.1% liked)

Technology

74345 readers

2868 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

[email protected]

138

ChatGPT has a style over substance trick that seems to dupe people into thinking it's smart, researchers found (www.businessinsider.com)

submitted 2 years ago by [email protected] to c/[email protected]

20 comments fedilink hide all child comments

ChatGPT has a style over substance trick that seems to dupe people into thinking it's smart, researchers found::Developers often prefer ChatGPT's responses about code to those submitted by humans, despite the bot frequently being wrong, researchers found.

all 23 comments

sorted by: hot top controversial new old

[–] [email protected] 81 points 2 years ago (3 children)

A caveat: This user analysis involved just 12 programmers being asked to assess if they prefer the responses of ChatGPT or those written by humans on Stack Overflow to 2,000 randomly sampled questions.

Nothing to see here.

[–] [email protected] 17 points 2 years ago (1 children)

What they should have done is asked those same 12 programmers to ask a common everyday question on Stack Overflow and then while waiting for a response, ask ChatGPT the same question.

I'd bet 50 bucks almost all of them would get an acceptable answer to their question out of ChatGPT 4 in far less time than it takes the moderators at Stack Overflow to delete the question. I can't imagine any of the questions will actually be answered on SO.

[–] [email protected] 5 points 2 years ago

Right. The problem with SO is that you don't actually get to ask any questions; so reason would suggest anything is at least as good as SO-- even asking a house plant, or Siri, or whatever. Something that actually answers your question would obviously be a better option.

Stack Overflow brought their irrelevance on themselves, I suspect.

[–] [email protected] 12 points 2 years ago

12 programmers? That ChatGPT sample size is way too small to be meaningful IMO

[–] [email protected] 11 points 2 years ago (3 children)

Anyone who has actually needed a correct answer to a question realized this a long time ago.

The problem is that most people don't bother checking the answers.

[–] [email protected] 4 points 2 years ago

If you need a correct answer, you're doing it wrong!

I'm joking of course, but there's a seed of truth: I've found ChatGPT's wrong or incomplete answers to be incredibly helpful as a starting point. Sometimes it will suggest a Python module I didn't even know about that does half my work for me. Or sometimes it has a lot of nonsense but the one line I actually need is correct (or close enough for me to understand).

Nobody should be copying code off Stack Overflow without understanding it, either.

[–] [email protected] 0 points 2 years ago

This hasn't been my experience. Yes, chatgpt gets stuff wrong, and fairly regularly. But I can ask it my question directly, and can include sample code, and I get an answer immediately. Anyone going on stack overflow has to either google around and sift through answers for relevance, or has to post the question and wait for someone to respond.

With either chatgpt or stack you have to check the answer to make sure it works - that's how coding goes. But one I know if it works or not pretty much immediately with fairly low investment of time and effort. And if it doesn't, I just rephrase the question, or literally say "that doesn't seem to work, now I'm getting this error: $error"

[–] [email protected] -3 points 2 years ago* (last edited 2 years ago) (1 children)

I check and triple check every answer. It's rarely incorrect in my experience.

[–] [email protected] 10 points 2 years ago

When you underpay a bunch of gig workers to rate the outputs? Obviously it's going to write in a manner that best BS's a layperson.

Would be too expensive to hire experts in every field to train the AI to actually do good work. Imagine paying software engineers 100k plus benefits to vote on its code outputs, or getting Miss Manners to comment on its etiquette suggestions.

[–] [email protected] 10 points 2 years ago

This was the first thing I've noticed on day one. The way it "speaks" is designed to sound like a polite authority in the field.

[–] [email protected] 7 points 2 years ago (1 children)

BTW, how does the L4sBot decide which articles to post?

[–] [email protected] 18 points 2 years ago

I don't know, but I'm guessing it's style over substance.

[–] [email protected] 2 points 2 years ago (2 children)

It's like crypto, or really any other con job.

It makes idiots feel smart.

Make a mark feel like they're smart, and they'll become attached to the idea and defend it to their death. Because the alternative is they aren't really smart and fell for a scam.

When smart people try to explain that to the idiots, it just makes them defend the scam even harder.

Try to tell people chatgpt isn't great, and they just ramble on about some nonsensical stuff they don't even understand themselves and then claim anyone that disagrees just isn't smart enough to get it.

It's a great business plan if you have zero morals, which is why the method never really goes away, just moves to another product.

[–] [email protected] 6 points 2 years ago

I have seen someone type "tell me how make a million dollar business" into chatgpt. Of course that's not going to work. But LLMs have immediate obvious value that crypto does not, and I think making the comparison reveals a lack of experience with those useful applications. I'm using chatgpt nearly every day as a tool to help with coding. It's not a replacement for a person, but it is like giving a person a forklift.

[–] [email protected] 1 points 2 years ago* (last edited 2 years ago)

Could someone paste link to study?
I live outside US/UK and I cant view article

[–] [email protected] 0 points 2 years ago* (last edited 2 years ago) (1 children)

Certainly it's gotten worse as we've all seen the news probably. When gpt4 came to the API it was impressive at times. A caveat always remained: don't blindly trust it, but that goes for stack overflow replies too.

Ohh cool, a downvote and smug reply. Go back to reddit or something.

Lol https://mastodon.social/@rodhilton/110894818243613681

[–] [email protected] 2 points 2 years ago* (last edited 2 years ago)

I've seen that in the news. I haven't experienced it at all. In fact I'm getting far better results now than I ever did before, though I suspect that's mostly on me - experience using almost any tool will improve the output.