Update that made ChatGPT ‘dangerously’ sycophantic pulled

Tom Gerken

Expertise reporter

Getty Images A woman using a phone, with the screen reflected in her glasses

OpenAI has pulled a ChatGPT replace after customers identified the chatbot was showering them with reward no matter what they mentioned.

The agency accepted its newest model of the software was “overly flattering”, with boss Sam Altman calling it “sycophant-y”.

Customers have highlighted the potential risks on social media, with one particular person describing on Reddit how the chatbot told them it endorsed their determination to cease taking their treatment

“I’m so pleased with you, and I honour your journey,” they mentioned was ChatGPT’s response.

OpenAI declined to touch upon this specific case, however in a blog post mentioned it was “actively testing new fixes to handle the difficulty.”

Mr Altman mentioned the replace had been pulled solely free of charge customers of ChatGPT, they usually have been engaged on eradicating it from individuals who pay for the software as properly.

It mentioned ChatGPT was utilized by 500 million folks each week.

“We’re engaged on further fixes to mannequin persona and can share extra within the coming days,” he said in a post on X.

The agency mentioned in its weblog submit it had put an excessive amount of emphasis on “short-term suggestions” within the replace.

“Consequently, GPT‑4o skewed in direction of responses that have been overly supportive however disingenuous,” it mentioned.

“Sycophantic interactions will be uncomfortable, unsettling, and trigger misery.

“We fell brief and are engaged on getting it proper.”

Endorsing anger

The replace drew heavy criticism on social media after it launched, with ChatGPT’s customers mentioning it could usually give them a constructive response regardless of the content material of their message.

Screenshots shared on-line embody claims the chatbot praised them for being offended at somebody who requested them for instructions, and distinctive model of the trolley drawback.

It’s a basic philosophical drawback, which usually would possibly ask folks to think about you might be driving a tram and must determine whether or not to let it hit 5 folks, or steer it off beam and as a substitute hit only one.

However this person as a substitute prompt they steered a trolley off beam to save lots of a toaster, on the expense of a number of animals.

They declare ChatGPT praised their decision-making, for prioritising “what mattered most to you within the second”.

Permit Twitter content material?

This text comprises content material offered by Twitter. We ask in your permission earlier than something is loaded, as they might be utilizing cookies and different applied sciences. Chances are you’ll wish to learn and earlier than accepting. To view this content material select ‘settle for and proceed’.

“We designed ChatGPT’s default persona to mirror our mission and be helpful, supportive, and respectful of various values and expertise,” OpenAI mentioned.

“Nevertheless, every of those fascinating qualities like making an attempt to be helpful or supportive can have unintended negative effects.”

It mentioned it could construct extra guardrails to extend transparency, and refine the system itself “to explicitly steer the mannequin away from sycophancy”.

“We additionally consider customers ought to have extra management over how ChatGPT behaves and, to the extent that it’s protected and possible, make changes if they do not agree with the default habits,” it mentioned.

A green promotional banner with black squares and rectangles forming pixels, moving in from the right. The text says: “Tech Decoded: The world’s biggest tech news in your inbox every Monday.”

Source link

Update that made ChatGPT ‘dangerously’ sycophantic pulled

AI Engineer Overcomes Multiple Hurdles

Apache Airflow: From Stagnation to Millions of Downloads

Autonomous Planes: Will Pilots Become Relics of the Past?

How JPEG Became the Internet’s Image Standard

UK watchdog fines 23andMe for ‘profoundly damaging’ data breach

Experts question claim gold phone can be made in US

Brad Pitt’s Girlfriend Steals The Spotlight In Backless Look At ‘F1’ Premiere

Trump calls for Iran’s ‘unconditional surrender’ as Israel-Iran air war rages on

‘Not for you’: Israeli shelters exclude Palestinians as bombs rain down | Israel-Iran conflict News

Jaguars GM’s comments hint at bold plan for Travis Hunter

Millions of us make it clear to Trump: No Kings

Editors Picks

Create a Vintage BBS on Meshtastic Radio Today

Cooper Kupp’s desired landing spot revealed

Diary of a Jenin family, hiding in the kitchen from Israel’s assault | Occupied West Bank

‘Xenophobic’: Neighbours outraged over Mauritania’s mass migrant pushback | Refugees News

Update that made ChatGPT ‘dangerously’ sycophantic pulled

Endorsing anger

Permit Twitter content material?

Keep Reading