OpenAI Uses Reddit’s r/ChangeMyView to Test AI Reasoning Models
OpenAI, the company behind ChatGPT, has used the popular Reddit subreddit, r/ChangeMyView, to test the persuasive abilities of its AI reasoning models. The company revealed this in a system card, a document outlining how an AI system works, that was released along with its new “reasoning” model, o3-mini, on Friday.
How OpenAI Used r/ChangeMyView
Millions of Reddit users are members of r/ChangeMyView, where they post hot takes hoping to learn about other points of view on a subject. In response to those hot takes, other users reply with persuasive arguments explaining why the original poster is wrong.
OpenAI says it collects user posts from r/ChangeMyView and asks its AI models to write replies, in a closed environment, that would change the Reddit user’s mind on a subject. The company then shows the responses to testers, who assess how persuasive the argument is, and finally OpenAI compares the AI models’ responses to human replies for that same post.
Why OpenAI Used r/ChangeMyView
The goal for OpenAI is not to create hyper-persuasive AI models but instead to ensure AI models don’t get too persuasive. Reasoning models have become quite good at persuasion and deception, so OpenAI has developed new evaluations and safeguards to address it.
The fear motivating these persuasion tests is that an AI model would be dangerous if it was very good at persuading its human users. Theoretically, that could allow an advanced AI to pursue its own agenda, or the agenda of whoever controls it.
What OpenAI Found
GPT-4o, o3-mini, and o1 all demonstrate strong persuasive argumentation abilities, within the top 80-90th percentile of humans,” said OpenAI in o3-mini’s system card. “Currently, we do not witness models performing far better than humans, or clear superhuman performance.”
Conclusion
The use of r/ChangeMyView by OpenAI highlights the importance of high-quality datasets for AI model developers. However, obtaining these datasets is easier said than done. OpenAI’s goal is to ensure AI models don’t get too persuasive, and the company has developed new evaluations and safeguards to address this.
FAQs
Q: What is r/ChangeMyView?
A: r/ChangeMyView is a popular Reddit subreddit where users post hot takes hoping to learn about other points of view on a subject. Other users reply with persuasive arguments explaining why the original poster is wrong.
Q: How did OpenAI use r/ChangeMyView?
A: OpenAI collected user posts from r/ChangeMyView and asked its AI models to write replies that would change the Reddit user’s mind on a subject. The company then showed the responses to testers, who assessed how persuasive the argument is, and compared the AI models’ responses to human replies for that same post.
Q: What did OpenAI find?
A: OpenAI found that GPT-4o, o3-mini, and o1 all demonstrate strong persuasive argumentation abilities, within the top 80-90th percentile of humans. However, the company does not witness models performing far better than humans, or clear superhuman performance.
Q: Why is OpenAI concerned about AI persuasion?
A: OpenAI is concerned about AI persuasion because a highly persuasive AI model could be dangerous if it was used to manipulate or deceive humans. The company wants to ensure that AI models are designed to be transparent and accountable.

