A New Wave of Excitement in AI
The recently released DeepSeek-R1 model family has brought a new wave of excitement to the AI community, allowing enthusiasts and developers to run state-of-the-art reasoning models with problem-solving, math, and code capabilities, all from the privacy of local PCs.
A New Class of Models That Reason
Reasoning models are a new class of large language models (LLMs) that spend more time on “thinking” and “reflecting” to work through complex problems, while describing the steps required to solve a task.
The Fundamental Principle
The fundamental principle is that any problem can be solved with deep thought, reasoning, and time, just like how humans tackle problems. By spending more time — and thus compute — on a problem, the LLM can yield better results. This phenomenon is known as test-time scaling, where a model dynamically allocates compute resources during inference to reason through problems.
The DeepSeek Difference
The DeepSeek-R1 family of distilled models is based on a large 671-billion-parameter mixture-of-experts (MoE) model. MoE models consist of multiple smaller expert models for solving complex problems. DeepSeek models further divide the work and assign subtasks to smaller sets of experts.
Distillation Technique
DeepSeek employed a technique called distillation to build a family of six smaller student models — ranging from 1.5-70 billion parameters — from the large DeepSeek 671-billion-parameter model. The reasoning capabilities of the larger DeepSeek-R1 671-billion-parameter model were taught to the smaller Llama and Qwen student models, resulting in powerful, smaller reasoning models that run locally on RTX AI PCs with fast performance.
Peak Performance on RTX
Inference speed is critical for this new class of reasoning models. GeForce RTX 50 Series GPUs, built with dedicated fifth-generation Tensor Cores, are based on the same NVIDIA Blackwell GPU architecture that fuels world-leading AI innovation in the data center. RTX fully accelerates DeepSeek, offering maximum inference performance on PCs.
Throughput performance of the Deepseek-R1 distilled family of models across GPUs on the PC.
Experience DeepSeek on RTX in Popular Tools
NVIDIA’s RTX AI platform offers the broadest selection of AI tools, software development kits, and models, opening access to the capabilities of DeepSeek-R1 on over 100 million NVIDIA RTX AI PCs worldwide, including those powered by GeForce RTX 50 Series GPUs.
Benefits of RTX AI PCs
High-performance RTX GPUs make AI capabilities always available — even without an internet connection — and offer low latency and increased privacy because users don’t have to upload sensitive materials or expose their queries to an online service.
Conclusion
The DeepSeek-R1 model family has opened up new possibilities for AI enthusiasts and developers to run state-of-the-art reasoning models on their local PCs. With the power of RTX AI PCs, users can experience the capabilities of DeepSeek-R1 in popular tools, enjoying fast performance, low latency, and increased privacy.
FAQs
Q: What is the DeepSeek-R1 model family?
A: The DeepSeek-R1 model family is a new class of large language models (LLMs) that spend more time on “thinking” and “reflecting” to work through complex problems, while describing the steps required to solve a task.
Q: What is the advantage of using RTX AI PCs with DeepSeek-R1?
A: RTX AI PCs offer fast performance, low latency, and increased privacy when running DeepSeek-R1 models, making them ideal for AI enthusiasts and developers who want to work with complex problems without relying on online services.
Q: Can I use DeepSeek-R1 models with other AI tools and software?
A: Yes, NVIDIA’s RTX AI platform offers the broadest selection of AI tools, software development kits, and models, allowing users to experience the capabilities of DeepSeek-R1 in popular tools and software.
Q: How do I get started with DeepSeek-R1 and RTX AI PCs?
A: Users can get started with DeepSeek-R1 and RTX AI PCs by visiting the NVIDIA website and exploring the available software, tools, and models. They can also join online communities and forums to learn more about the capabilities and applications of DeepSeek-R1 and RTX AI PCs.