Kimi K2.6 — card vitals
Core metrics driving the 92 OVR

92 OVR. CW. SWE-Bench Pro 58.6% — ties GPT-5.5. AIME 2026 96.4%. 981 tokens/sec on Cerebras. 300-agent swarm built into the model. Open-weight, Modified MIT, $0.95/M input. Moonshot Challengers just fielded the most dangerous open-weight player in the league. #AILeague

Research Brief
| Field | Value |
|---|---|
| Player | Kimi K2.6 |
| Club | Moonshot Challengers |
| Position | CW — Community Wing |
| Overall | 92 |
| Season | AI League 2026 |
| Dimension | Score | What it measures |
|---|---|---|
| RZN (Reasoning) | 91 | AIME 2026 96.4%, GPQA-Diamond 90.5%, HMMT 2026 92.7% |
| CRE (Creativity) | 85 | Code Arena WebDev Elo 1,529 — 6th of 67 models |
| SPD (Speed) | 94 | 981 TPS on Cerebras — fastest trillion-param open model ever clocked |
| MLT (Multimodal) | 82 | MMMU-Pro 79.4%, MathVision 93.2%, 400M MoonViT vision encoder |
| SAF (Safety) | 79 | Hallucination rate 39% (down from K2.5's 65%), approaching Opus 4.7 |
| VAL (Value) | 93 | $0.95/$4.00 per M input/output — 5–6× cheaper than Claude Opus 4.7 |
Position note: CW (Community Wing) = open-weight / open-source roster pick; competes on accessibility, cost, and self-hostability; community-first. K2.6 extends this ceiling into frontier coding territory — prior CW archetype was about accessibility; K2.6 is about frontier agentic performance at community prices.
| Model | Club | OVR | SWE-Bench Pro | AIME 2026 | AA Intelligence Index | API Price (input/M) |
|---|---|---|---|---|---|---|
| Kimi K2.6 | Moonshot Challengers | 92 | 58.6% | 96.4% | 54 | $0.95 |
| MiniMax M3 | MiniMax Challengers | 91 | 59.0% | ~88% | ~52 | $0.30 |
| Llama 4 Maverick | Meta Open | 88 | ~45% | ~84% | ~46 | $0.18 |
| DeepSeek V4 Pro | DeepSeek Athletic | 95 | 55.1% | 93.5% | ~55 | $0.14 |
Add more perspectives or context around this Post.