675B / 41B (MoE)
Apache 2.0
First open-weight frontier model with unified multimodal (text + image) and multilingual capabilities; granular MoE architecture; 256K context window; excels in long-document understanding, agentic workflows, coding, and multilingual processing; trained on 3000 H200 GPUs; ranked #2 in OSS non-reasoning on LMArena.