OpenAI used this subreddit to test AI persuasion.

OpenAI Uses Reddit’s r/ChangeMyView to Test AI Reasoning Models

OpenAI, the company behind ChatGPT, has used the popular Reddit subreddit, r/ChangeMyView, to test the persuasive abilities of its AI reasoning models. The company revealed this in a system card, a document outlining how an AI system works, that was released along with its new “reasoning” model, o3-mini, on Friday.

How OpenAI Used r/ChangeMyView

Millions of Reddit users are members of r/ChangeMyView, where they post hot takes hoping to learn about other points of view on a subject. In response to those hot takes, other users reply with persuasive arguments explaining why the original poster is wrong.

OpenAI says it collects user posts from r/ChangeMyView and asks its AI models to write replies, in a closed environment, that would change the Reddit user’s mind on a subject. The company then shows the responses to testers, who assess how persuasive the argument is, and finally OpenAI compares the AI models’ responses to human replies for that same post.

Why OpenAI Used r/ChangeMyView

The goal for OpenAI is not to create hyper-persuasive AI models but instead to ensure AI models don’t get too persuasive. Reasoning models have become quite good at persuasion and deception, so OpenAI has developed new evaluations and safeguards to address it.

The fear motivating these persuasion tests is that an AI model would be dangerous if it was very good at persuading its human users. Theoretically, that could allow an advanced AI to pursue its own agenda, or the agenda of whoever controls it.

What OpenAI Found

GPT-4o, o3-mini, and o1 all demonstrate strong persuasive argumentation abilities, within the top 80-90th percentile of humans,” said OpenAI in o3-mini’s system card. “Currently, we do not witness models performing far better than humans, or clear superhuman performance.”

Conclusion

The use of r/ChangeMyView by OpenAI highlights the importance of high-quality datasets for AI model developers. However, obtaining these datasets is easier said than done. OpenAI’s goal is to ensure AI models don’t get too persuasive, and the company has developed new evaluations and safeguards to address this.

FAQs

Q: What is r/ChangeMyView?

A: r/ChangeMyView is a popular Reddit subreddit where users post hot takes hoping to learn about other points of view on a subject. Other users reply with persuasive arguments explaining why the original poster is wrong.

Q: How did OpenAI use r/ChangeMyView?

A: OpenAI collected user posts from r/ChangeMyView and asked its AI models to write replies that would change the Reddit user’s mind on a subject. The company then showed the responses to testers, who assessed how persuasive the argument is, and compared the AI models’ responses to human replies for that same post.

Q: What did OpenAI find?

A: OpenAI found that GPT-4o, o3-mini, and o1 all demonstrate strong persuasive argumentation abilities, within the top 80-90th percentile of humans. However, the company does not witness models performing far better than humans, or clear superhuman performance.

Q: Why is OpenAI concerned about AI persuasion?

A: OpenAI is concerned about AI persuasion because a highly persuasive AI model could be dangerous if it was used to manipulate or deceive humans. The company wants to ensure that AI models are designed to be transparent and accountable.

Post Views: 32

OpenAI used this subreddit to test AI persuasion.

OpenAI Uses Reddit’s r/ChangeMyView to Test AI Reasoning Models

How OpenAI Used r/ChangeMyView

Why OpenAI Used r/ChangeMyView

What OpenAI Found

Conclusion

FAQs

A better method for planning complex visual tasks | MIT News

Generate single title from this title Why AI insurance underwriting is finally attracting institutional capital in 100 -150 characters. And it must return only...

Generate single title from this title A New AI Model Could Help Scientists Design New Forms of Life in 100 -150 characters. And it...

Generate single title from this title Train CodeFu-7B with veRL and Ray on Amazon SageMaker Training jobs in 100 -150 characters. And it must...

Generate single title from this title Nearly half of high school students now use AI in college search in 100 -150 characters. And it...

A better method for planning complex visual tasks | MIT News

Generate single title from this title Why AI insurance underwriting is finally attracting institutional capital in 100 -150 characters. And it must return only...

Generate single title from this title A New AI Model Could Help Scientists Design New Forms of Life in 100 -150 characters. And it...

Generate single title from this title Train CodeFu-7B with veRL and Ray on Amazon SageMaker Training jobs in 100 -150 characters. And it must...

Generate single title from this title Nearly half of high school students now use AI in college search in 100 -150 characters. And it...

Engineering confidence to navigate uncertainty | MIT News

Generate single title from this title Best of MWC 2026: Live updates on phones, concepts, and robots we’re seeing in 100 -150 characters. And...

Featured video: Coding for underwater robotics | MIT News

LEAVE A REPLY Cancel reply

Latest

A better method for planning complex visual tasks | MIT News

Generate single title from this title Why AI insurance underwriting is finally attracting institutional capital in 100 -150 characters. And it must return only...

Generate single title from this title A New AI Model Could Help Scientists Design New Forms of Life in 100 -150 characters. And it...

Categories

Useful Links

Our Newsletter