671B / 37B (MoE)
MIT
Advanced reasoning with cold-start RL training; excels in math, code, and complex problem-solving; supports JSON output and function calling.
https://arxiv.org/abs/2501.12948