Introducing M2.5, an open-source frontier model designed for real-world productivity. SOTA performance at coding (SWE-Bench Verified 80.2%), search (BrowseComp 76.3%), agentic tool-calling (BFCL 76.8%) & office work. Optimized for efficient execution, 37% faster at complex tasks. At $1 per hour with 100 tps, infinite scaling of long-horizon agents now economically possible.
Big news for open models: MiniMax-M2.5 is out with SOTA performance at coding (SWE-Bench Verified 80.2%). The first open model to beat Sonnet. Only @Claude by Anthropic's Opus and @OpenAI 's GPT-5.2 Codex score higher.
Paths between open and proprietary models are converging...
Pro tip: If you want to quickly experiment with it, @MiniMax-M2.5 is free for a week on @Kilo Code (until Thursday, Feb 19).
looks great! This is something that seems like it would pair well with ClawdBot agents...
80%+ on SWE-Bench Verified for an open model is wild — especially if it’s actually usable in real workflows and not just benchmark-flexing. Curious how it holds up on messy, legacy codebases vs clean benchmark repos?
Big news for open models: MiniMax-M2.5 is out with SOTA performance at coding (SWE-Bench Verified 80.2%). The first open model to beat Sonnet. Only @Claude by Anthropic's Opus and @OpenAI 's GPT-5.2 Codex score higher.
Paths between open and proprietary models are converging...
Pro tip: If you want to quickly experiment with it, @MiniMax-M2.5 is free for a week on @Kilo Code (until Thursday, Feb 19).
OSS ftw!