111B (dense)
CC-BY-NC
Enterprise-optimized model excelling at tool use, RAG, and agentic tasks; 256K context; 150% higher throughput than Command R+; competitive with GPT-4o and DeepSeek V3; requires only 2 GPUs; strong multilingual support (23 languages).