← Back to models

DeepSeek R1

DeepSeek AI (China) January 20 2025

Parameters

671B / 37B (MoE)

License

MIT

Key Features

Advanced reasoning with cold-start RL training; excels in math, code, and complex problem-solving; supports JSON output and function calling.

Paper / Source

https://arxiv.org/abs/2501.12948