OpenAI Accidentally Erases Potential Evidence in Training Data Lawsuit

A Stunned Error: OpenAI Erases Critical Evidence in Lawsuit Against New York Times and Other Newspapers

Mistake Highlights Concerns Over AI Training Data

In a stunning misstep, OpenAI engineers accidentally erased critical evidence gathered by The New York Times and other major newspapers in their lawsuit over AI training data, according to a court filing Wednesday.

What Happened?

The newspapers’ legal teams had spent over 150 hours searching through OpenAI’s AI training data to find instances where their news articles were included, the filing claims. However, it doesn’t explain how this mistake occurred or what precisely the data included. While the filing says OpenAI admitted to the error and tried to recover the data, what was recovered was incomplete and unreliable – so what was recovered cannot help properly trace how the news organizations’ articles were used in building OpenAI’s AI models.

OpenAI’s Response

OpenAI’s lawyers characterized the data erasure as a "glitch," while The New York Times’ attorneys noted they had "no reason to believe" it was intentional.

The Lawsuit

The New York Times Company launched this landmark battle last December, claiming OpenAI and its partner Microsoft had built their AI tools by "copying and using millions" of the publication’s articles and now "directly compete" with its content as a result. The publication is asking for OpenAI to be held liable for "billions of dollars in statutory and actual damages" for allegedly copying its works.

The Cost of the Battle

The Times has already spent more than $1 million battling OpenAI in court – a significant fee few publishers can match. Meanwhile, OpenAI has struck deals with major outlets like Axel Springer, Conde Nast, and The Verge’s parent company Vox Media, suggesting many publishers would rather partner than fight.

OpenAI’s Refusal to Join the Update

OpenAI declined to join The New York Times in filing the update to the court. This declaration was filed by Jennifer Maisel, an attorney representing the news organizations, to formally notify the court about what happened.

Conclusion

The data erasure has sparked concerns over the reliability and transparency of AI training data. The lawsuit highlights the importance of ensuring the integrity of AI systems and the need for open communication between AI developers and news organizations.

Frequently Asked Questions

Q: What happened to the data?
A: OpenAI engineers accidentally erased critical evidence gathered by The New York Times and other major newspapers in their lawsuit over AI training data.

Q: How did this mistake occur?
A: The exact cause of the mistake is unclear, but OpenAI admitted to the error and tried to recover the data.

Q: What was recovered from the data?
A: The recovered data was incomplete and unreliable, so it cannot help properly trace how the news organizations’ articles were used in building OpenAI’s AI models.

Q: What is the significance of this lawsuit?
A: The lawsuit highlights concerns over the reliability and transparency of AI training data and the need for open communication between AI developers and news organizations.

Post Views: 61

OpenAI Accidentally Erases Potential Evidence in Training Data Lawsuit

5 Design Considerations for Effective Employee Recognition Programs

Agibot reaches new milestone as its 15,000th humanoid robot rolls off production line

How AI Navigation is Improving the Performance of Robotic Pool Cleaners

Generate single title from this title SAP aligns commerce data for AI personalisation in 100 -150 characters. And it must return only title i...

Goodwood Festival of Speed unveils Future Lab lineup for 2026

5 Design Considerations for Effective Employee Recognition Programs

Agibot reaches new milestone as its 15,000th humanoid robot rolls off production line

How AI Navigation is Improving the Performance of Robotic Pool Cleaners

Generate single title from this title SAP aligns commerce data for AI personalisation in 100 -150 characters. And it must return only title i...

Goodwood Festival of Speed unveils Future Lab lineup for 2026

Generate single title from this title Build an AI-Powered Equipment Repair Assistant Using Amazon Bedrock AgentCore in 100 -150 characters. And it must return...

LLMs help robots understand vague instructions and focus on key details | MIT News

We Ranked #11 on the Top 100 Inspiring Workplaces List. Here’s What Got Us There.

LEAVE A REPLY Cancel reply

Latest

5 Design Considerations for Effective Employee Recognition Programs

Agibot reaches new milestone as its 15,000th humanoid robot rolls off production line

How AI Navigation is Improving the Performance of Robotic Pool Cleaners

Categories

Useful Links

Our Newsletter