Date:

OpenAI Fired Up

OpenAI Responds to DeepSeek’s Upset

It’s been just over a week since DeepSeek upended the AI world. The introduction of its open-weight model—apparently trained on a fraction of the specialized computing chips that power industry leaders—set off shock waves inside OpenAI. Not only did employees claim to see hints that DeepSeek had “inappropriately distilled” OpenAI’s models to create its own, but the startup’s success had Wall Street questioning whether companies like OpenAI were wildly overspending on compute.

The Sputnik Moment

“DeepSeek R1 is AI’s Sputnik moment,” wrote Marc Andreessen, one of Silicon Valley’s most influential and provocative inventors, on X.

OpenAI’s Response

In response, OpenAI is preparing to launch a new model today, ahead of its originally planned schedule. The model, o3-mini, will debut in both API and chat. Sources say it has o1 level reasoning with 4o-level speed. In other words, it’s fast, cheap, smart, and designed to crush DeepSeek. (OpenAI spokesperson Niko Felix says work on o3-mini began long before DeepSeek’s debut and the goal was to launch by the end of January).

Efficiency and Competition

The moment has galvanized OpenAI staff. Inside the company, there’s a feeling that—particularly as DeepSeek dominates the conversation—OpenAI must become more efficient or risk falling behind its newest competitor.

Internal Power Struggle

Part of the issue stems from OpenAI’s origins as a nonprofit research organization before becoming a profit-seeking powerhouse. An ongoing power struggle between the research and product groups, employees claim, has resulted in a rift between the teams working on advanced reasoning and those working on chat. (OpenAI spokesperson Niko Felix says this is “incorrect” and notes that the leaders of these teams, chief product officer Kevin Weil and chief research officer Mark Chen, “meet every week and work closely to align on product and research priorities.”)

Chat Product

Some inside OpenAI want the company to build a unified chat product, one model that can tell whether a question requires advanced reasoning. So far, that hasn’t happened. Instead, a drop-down menu in ChatGPT prompts users to decide whether they want to use GPT-4o (“great for most questions”) or o1 (“uses advanced reasoning”).

Staff Feedback

Some staffers claim that while chat brings in the lion’s share of OpenAI’s revenue, o1 gets more attention—and computing resources—from leadership. “Leadership doesn’t care about chat,” says a former employee who worked on (you guessed it) chat. “Everyone wants to work on o1 because it’s sexy, but the code base wasn’t built for experimentation, so there’s no momentum.” The former employee asked to remain anonymous, citing a nondisclosure agreement.

Conclusion

OpenAI is preparing to launch a new model, o3-mini, in response to DeepSeek’s success. The company is under pressure to become more efficient and competitive in the AI market. The ongoing power struggle between research and product groups may hinder the company’s ability to adapt to the changing landscape.

FAQs

Q: What is DeepSeek’s new model?
A: DeepSeek’s new model, R1, is an open-weight model that has o1 level reasoning with 4o-level speed.

Q: What is OpenAI’s response to DeepSeek’s success?
A: OpenAI is launching a new model, o3-mini, ahead of its originally planned schedule. The model will debut in both API and chat.

Q: What is the ongoing power struggle within OpenAI?
A: There is an ongoing power struggle between the research and product groups within OpenAI, which may hinder the company’s ability to adapt to the changing landscape.

Q: Why is OpenAI launching o3-mini?
A: OpenAI is launching o3-mini to crush DeepSeek and become more efficient and competitive in the AI market.

Latest stories

Read More

LEAVE A REPLY

Please enter your comment!
Please enter your name here