OpenAI has evidence that its models helped train China’s DeepSeek.

Chinese AI Company Accused of Stealing Technology from OpenAI

Chinese artificial intelligence company DeepSeek has been accused of developing AI models that compete with OpenAI’s flagship offerings, including the popular ChatGPT, at a fraction of the cost. However, OpenAI and Microsoft are investigating whether DeepSeek used OpenAI’s data to develop their own models.

Accusations of Data Theft

According to sources, Microsoft security researchers detected large amounts of data being exfiltrated through OpenAI developer accounts in late 2024, which are believed to be affiliated with DeepSeek. This has led to accusations that DeepSeek used OpenAI’s API to integrate OpenAI’s AI models into their own.

Distillation Technique Suspected

OpenAI has found evidence linking DeepSeek to the use of distillation, a technique used to train AI models by extracting data from larger, more capable ones. This is an efficient way to train smaller models at a fraction of the cost of training larger models. However, OpenAI has not provided details of the evidence it has found.

IP Theft Suspected

President Donald Trump’s artificial intelligence czar, David Sacks, has stated that it is possible that IP theft had occurred. “There’s substantial evidence that what DeepSeek did here is they distilled knowledge out of OpenAI models and I don’t think OpenAI is very happy about this,” Sacks said in an interview with Fox News.

OpenAI’s Response

OpenAI has stated that it is aware that companies, including those in China, are trying to distill the models of leading US AI companies. The company has a careful process for determining which frontier capabilities to include in released models and believes it is critical to work closely with the US government to protect its intellectual property.

Conclusion

The accusations of IP theft and data exfiltration have raised concerns about the integrity of the AI model development process. As the AI industry continues to grow, it is crucial to ensure that companies like OpenAI and others take measures to protect their intellectual property and prevent unauthorized use of their technology.

FAQs

Q: What is distillation in AI model development?
A: Distillation is a technique used to train AI models by extracting data from larger, more capable ones.

Q: What is OpenAI’s stance on the allegations?
A: OpenAI has stated that it is investigating the allegations and has found evidence linking DeepSeek to the use of distillation.

Q: What is the potential impact on the AI industry?
A: The allegations of IP theft and data exfiltration could have significant implications for the AI industry, as it may lead to a loss of trust in the integrity of AI model development.

Post Views: 63

OpenAI has evidence that its models helped train China’s DeepSeek.

Chinese AI Company Accused of Stealing Technology from OpenAI

Accusations of Data Theft

Distillation Technique Suspected

IP Theft Suspected

OpenAI’s Response

Conclusion

FAQs

A better method for planning complex visual tasks | MIT News

Generate single title from this title Why AI insurance underwriting is finally attracting institutional capital in 100 -150 characters. And it must return only...

Generate single title from this title A New AI Model Could Help Scientists Design New Forms of Life in 100 -150 characters. And it...

Generate single title from this title Train CodeFu-7B with veRL and Ray on Amazon SageMaker Training jobs in 100 -150 characters. And it must...

Generate single title from this title Nearly half of high school students now use AI in college search in 100 -150 characters. And it...

A better method for planning complex visual tasks | MIT News

Generate single title from this title Why AI insurance underwriting is finally attracting institutional capital in 100 -150 characters. And it must return only...

Generate single title from this title A New AI Model Could Help Scientists Design New Forms of Life in 100 -150 characters. And it...

Generate single title from this title Train CodeFu-7B with veRL and Ray on Amazon SageMaker Training jobs in 100 -150 characters. And it must...

Generate single title from this title Nearly half of high school students now use AI in college search in 100 -150 characters. And it...

Engineering confidence to navigate uncertainty | MIT News

Generate single title from this title Best of MWC 2026: Live updates on phones, concepts, and robots we’re seeing in 100 -150 characters. And...

Featured video: Coding for underwater robotics | MIT News

LEAVE A REPLY Cancel reply

Latest

A better method for planning complex visual tasks | MIT News

Generate single title from this title Why AI insurance underwriting is finally attracting institutional capital in 100 -150 characters. And it must return only...

Generate single title from this title A New AI Model Could Help Scientists Design New Forms of Life in 100 -150 characters. And it...

Categories

Useful Links

Our Newsletter