Rival OpenAI’s o1-preview in 19 hours

Open-Source Approaches to Artificial Intelligence: A New Frontier

Open-source approaches continue to show promise in democratizing artificial intelligence (AI).

NovaSky’s Sky-T1-32B-Preview

On Friday, the NovaSky research team at UC Berkeley released a new reasoning model, Sky-T1-32B-Preview, that performs comparably to OpenAI’s o1-preview — only it’s open source and was built in just 19 hours for under $450 using eight Nvidia H100 GPUs.

The Best of Both Worlds: Open-Source AI Models

The team developed Sky-T1 by fine-tuning Alibaba’s Qwen2.5-32-Instruct and trained it on data generated with QwQ-32B-Preview, another open-source model comparable to o1-preview. Using synthetic training data can help lower costs.

Data Preparation: The Key to Success

"We curate the data mixture to cover diverse domains that require reasoning, and a reject sampling procedure to improve the data quality. We then rewrite QwQ traces with GPT-4o-mini into a well-formatted version, inspired by Still-2, to improve data quality and ease parsing," the team says of their data preparation process in the blog.

Outperforming OpenAI’s o1-Preview

The model performed at or above o1-preview’s level on math and coding benchmarks but did not surpass o1 on the graduate-level benchmark GPQA-Diamond, which includes more advanced physics-related questions. NovaSky open-sourced all parts of the model, including weights, data, infrastructure, and technical details.

A More Affordable Reasoning Model

The relatively short 19-hour training time means Sky-T1 cost just $450 to build, according to Lambda Cloud pricing, the team clarifies in the blog post. Considering GPT-4 used a suspected $78 million in compute, it is no small feat to present an example of a more affordable reasoning model that can be replicated by academic and open-source groups that lack OpenAI’s funding.

Conclusion

The continued development of open-source AI models like Sky-T1-32B-Preview holds great promise for democratizing AI and creating a more even playing field for smaller labs, nonprofits, and other entities to develop competitive models. As the field of AI continues to evolve, it will be exciting to see how these open-source models can be used to drive innovation and progress.

FAQs

Q: What is the significance of Sky-T1-32B-Preview?
A: Sky-T1-32B-Preview is an open-source reasoning model that performs comparably to OpenAI’s o1-preview, built in just 19 hours for under $450 using eight Nvidia H100 GPUs.

Q: How was Sky-T1-32B-Preview developed?
A: The team developed Sky-T1 by fine-tuning Alibaba’s Qwen2.5-32-Instruct and trained it on data generated with QwQ-32B-Preview, another open-source model comparable to o1-preview.

Q: Is Sky-T1-32B-Preview available for use?
A: Yes, NovaSky open-sourced all parts of the model, including weights, data, infrastructure, and technical details.

Q: What are the potential applications of Sky-T1-32B-Preview?
A: The potential applications of Sky-T1-32B-Preview are vast, including but not limited to, natural language processing, computer vision, and robotics.

Post Views: 29

Rival OpenAI’s o1-preview in 19 hours

Generate single title from this title Why AI insurance underwriting is finally attracting institutional capital in 100 -150 characters. And it must return only...

Generate single title from this title A New AI Model Could Help Scientists Design New Forms of Life in 100 -150 characters. And it...

Generate single title from this title Train CodeFu-7B with veRL and Ray on Amazon SageMaker Training jobs in 100 -150 characters. And it must...

Generate single title from this title Nearly half of high school students now use AI in college search in 100 -150 characters. And it...

Engineering confidence to navigate uncertainty | MIT News

Generate single title from this title Why AI insurance underwriting is finally attracting institutional capital in 100 -150 characters. And it must return only...

Generate single title from this title A New AI Model Could Help Scientists Design New Forms of Life in 100 -150 characters. And it...

Generate single title from this title Train CodeFu-7B with veRL and Ray on Amazon SageMaker Training jobs in 100 -150 characters. And it must...

Generate single title from this title Nearly half of high school students now use AI in college search in 100 -150 characters. And it...

Engineering confidence to navigate uncertainty | MIT News

Generate single title from this title Best of MWC 2026: Live updates on phones, concepts, and robots we’re seeing in 100 -150 characters. And...

Featured video: Coding for underwater robotics | MIT News

Generate single title from this title Upgrading agentic AI for finance workflows in 100 -150 characters. And it must return only title i dont...

LEAVE A REPLY Cancel reply

Latest

Generate single title from this title Why AI insurance underwriting is finally attracting institutional capital in 100 -150 characters. And it must return only...

Generate single title from this title A New AI Model Could Help Scientists Design New Forms of Life in 100 -150 characters. And it...

Generate single title from this title Train CodeFu-7B with veRL and Ray on Amazon SageMaker Training jobs in 100 -150 characters. And it must...

Categories

Useful Links

Our Newsletter