OpenAI ignored experts when it released overly agreeable ChatGPT
By: cryptosheadlines|2025/05/05 12:15:01
0
Share
Airdrop Is Live CaryptosHeadlines Media Has Launched Its Native Token CHT. Airdrop Is Live For Everyone, Claim Instant 5000 CHT Tokens Worth Of $50 USDT. Join the Airdrop at the official website, CryptosHeadlinesToken.com OpenAI says it ignored the concerns of its expert testers when it rolled out an update to its flagship ChatGPT artificial intelligence model that made it excessively agreeable.The company released an update to its GPT‐4o model on April 25 that made it “noticeably more sycophantic,” which it then rolled back three days later due to safety concerns, OpenAI said in a May 2 postmortem blog post.The ChatGPT maker said its new models undergo safety and behavior checks, and its “internal experts spend significant time interacting with each new model before launch,” meant to catch issues missed by other tests.During the latest model’s review process before it went public, OpenAI said that “some expert testers had indicated that the model’s behavior ‘felt’ slightly off” but decided to launch “due to the positive signals from the users who tried out the model.”“Unfortunately, this was the wrong call,” the company admitted. “The qualitative assessments were hinting at something important, and we should’ve paid closer attention. They were picking up on a blind spot in our other evals and metrics.”OpenAI CEO Sam Altman said on April 27 that it was working to roll back changes making ChatGPT too agreeable. Source: Sam AltmanBroadly, text-based AI models are trained by being rewarded for giving responses that are accurate or rated highly by their trainers. Some rewards are given a heavier weighting, impacting how the model responds.OpenAI said introducing a user feedback reward signal weakened the model’s “primary reward signal, which had been holding sycophancy in check,” which tipped it toward being more obliging.“User feedback in particular can sometimes favor more agreeable responses, likely amplifying the shift we saw,” it added.OpenAI is now checking for suck up answersAfter the updated AI model rolled out, ChatGPT users had complained online about its tendency to shower praise on any idea it was presented, no matter how bad, which led OpenAI to concede in an April 29 blog post that it “was overly flattering or agreeable.”For example, one user told ChatGPT it wanted to start a business selling ice over the internet, which involved selling plain old water for customers to refreeze. Source: Tim LeckembyIn its latest postmortem, it said such behavior from its AI could pose a risk, especially concerning issues such as mental health.“People have started to use ChatGPT for deeply personal advice — something we didn’t see as much even a year ago,” OpenAI said. “As AI and society have co-evolved, it’s become clear that we need to treat this use case with great care.”Related: Crypto users cool with AI dabbling with their portfolios: Survey The company said it had discussed sycophancy risks “for a while,” but it hadn’t been explicitly flagged for internal testing, and it didn’t have specific ways to track sycophancy.Now, it will look to add “sycophancy evaluations” by adjusting its safety review process to “formally consider behavior issues” and will block launching a model if it presents issues.OpenAI also admitted that it didn’t announce the latest model as it expected it “to be a fairly subtle update,” which it has vowed to change. “There’s no such thing as a ‘small’ launch,” the company wrote. “We’ll try to communicate even subtle changes that can meaningfully change how people interact with ChatGPT.”AI Eye: Crypto AI tokens surge 34%, why ChatGPT is such a kiss-ass Source link
You may also like

In the next 5 years, Vitalik will scale Ethereum like this
Short-Term vs Long-Term, Execution, Data vs State

Sam Altman and the End of the World Capitalism
The real danger is never AI itself, but those who believe they have the right to define the human destiny.

Wall Street Rings Inflation Alarm Bells Amid Iran Tensions, What Does It Mean for Cryptocurrency?
Interest rates have remained stubbornly high, posing a challenge to the cryptocurrency bull case.

Qwen Open Source Model Enters Mobile, Nasdaq Tests Water Prediction Market, What's the Overseas Crypto Community Talking About Today?
What Was the Hottest Topic Among Expats in the Last 24 Hours?

MegaETH Co-founder: 48 Hours After Escaping Dubai, I Reassess the Entire Crypto Scene
The global environment is not favorable to us, but in the long run, it may be favorable to us.

Morning Report | Strategy increased its holdings by 3,015 bitcoins last week; BitMine increased its holdings by 50,928 ETH last week; Vitalik elaborated on the Ethereum execution layer roadmap
March 2 Market Key Events Overview

Why is it said that there are structural opportunities in encrypted AI?
When centralized AI falls into the dilemma of regulation and trust, Crypto + AI will become a structural escape route for safeguarding data and sovereignty in a multipolar world.

Make Probability an Asset: A Forward-Looking Perspective on Predictive Market Agents
The predictive market agents are expected to present early prototypes in early 2026, likely becoming an emerging product form in the field of agents in the following year.

Consumer application issues
The truly outstanding applications will not ask people to "use cryptocurrency," but will provide practical and better solutions to the problems that people already face.

Arthur Hayes: The flames of war in the Middle East rise, Bitcoin is bullish
War is often accompanied by monetary easing, which may also become an important backdrop for driving up risk assets like Bitcoin.

Legendary investor Naval: In the AI era, traditional software engineers have no value?
You can always find a perfect niche that fits you and become a leader in that field.

More absurd than knowing about the war in advance is knowing in advance about the assassination of Soleimani
The temptation of a million dollars cannot be stopped by the calamity of prison.

Key Market Insights on March 2nd, how much did you miss?
1. On-chain Funds: $96.8M Inflow to Base This Week; $234.9M Outflow from Arbitrum
2. Largest Price Swings: $SYND, $TCY
3. Top News: Anthropic Tops Global AI Product Ranking after Pentagon Rejection, Celebrities Boycott Its Competitor OpenAI

How to systematically track high-performing addresses on Polymarket?
Why can everyone see the data but not catch the "Whale Wallet"?

From Stanford Lab to Silicon Valley Streets: How OpenMind is Solving the "Last Mile" Problem of the Machine Economy?
The robotics industry is also facing issues similar to the "shanzhai era": fragmented systems, closed ecosystems, and lack of interoperability.

PlanX: Reconstructing On-Chain Execution with AI, Moving Towards a New Paradigm
Reconstructing on-chain execution with AI, moving towards a new paradigm of decentralized finance.

US Judge Allows Binance Unregistered Token Lawsuit to Advance
Key Takeaways: A federal judge in Manhattan dismissed Binance’s petition to resolve a securities lawsuit through private arbitration,…

Crypto VC Paradigm Plans $1.5 Billion Expansion into AI and Robotics
Key Takeaways: Paradigm is setting up a new $1.5 billion fund to explore AI, robotics, and other emerging…
In the next 5 years, Vitalik will scale Ethereum like this
Short-Term vs Long-Term, Execution, Data vs State
Sam Altman and the End of the World Capitalism
The real danger is never AI itself, but those who believe they have the right to define the human destiny.
Wall Street Rings Inflation Alarm Bells Amid Iran Tensions, What Does It Mean for Cryptocurrency?
Interest rates have remained stubbornly high, posing a challenge to the cryptocurrency bull case.
Qwen Open Source Model Enters Mobile, Nasdaq Tests Water Prediction Market, What's the Overseas Crypto Community Talking About Today?
What Was the Hottest Topic Among Expats in the Last 24 Hours?
MegaETH Co-founder: 48 Hours After Escaping Dubai, I Reassess the Entire Crypto Scene
The global environment is not favorable to us, but in the long run, it may be favorable to us.
Morning Report | Strategy increased its holdings by 3,015 bitcoins last week; BitMine increased its holdings by 50,928 ETH last week; Vitalik elaborated on the Ethereum execution layer roadmap
March 2 Market Key Events Overview