Chinese AI Company Accused of Stealing Technology from OpenAI
Chinese artificial intelligence company DeepSeek has been accused of developing AI models that compete with OpenAI’s flagship offerings, including the popular ChatGPT, at a fraction of the cost. However, OpenAI and Microsoft are investigating whether DeepSeek used OpenAI’s data to develop their own models.
Accusations of Data Theft
According to sources, Microsoft security researchers detected large amounts of data being exfiltrated through OpenAI developer accounts in late 2024, which are believed to be affiliated with DeepSeek. This has led to accusations that DeepSeek used OpenAI’s API to integrate OpenAI’s AI models into their own.
Distillation Technique Suspected
OpenAI has found evidence linking DeepSeek to the use of distillation, a technique used to train AI models by extracting data from larger, more capable ones. This is an efficient way to train smaller models at a fraction of the cost of training larger models. However, OpenAI has not provided details of the evidence it has found.
IP Theft Suspected
President Donald Trump’s artificial intelligence czar, David Sacks, has stated that it is possible that IP theft had occurred. “There’s substantial evidence that what DeepSeek did here is they distilled knowledge out of OpenAI models and I don’t think OpenAI is very happy about this,” Sacks said in an interview with Fox News.
OpenAI’s Response
OpenAI has stated that it is aware that companies, including those in China, are trying to distill the models of leading US AI companies. The company has a careful process for determining which frontier capabilities to include in released models and believes it is critical to work closely with the US government to protect its intellectual property.
Conclusion
The accusations of IP theft and data exfiltration have raised concerns about the integrity of the AI model development process. As the AI industry continues to grow, it is crucial to ensure that companies like OpenAI and others take measures to protect their intellectual property and prevent unauthorized use of their technology.
FAQs
Q: What is distillation in AI model development?
A: Distillation is a technique used to train AI models by extracting data from larger, more capable ones.
Q: What is OpenAI’s stance on the allegations?
A: OpenAI has stated that it is investigating the allegations and has found evidence linking DeepSeek to the use of distillation.
Q: What is the potential impact on the AI industry?
A: The allegations of IP theft and data exfiltration could have significant implications for the AI industry, as it may lead to a loss of trust in the integrity of AI model development.

