08 Nov 2025
Moonshot AI, an Alibaba-backed startup, has released Kimi K2 Thinking, an open-source reasoning model that the newsletter reports matches or beats leading closed models on a range of agentic benchmarks. According to the scraped coverage, Kimi outperformed GPT-5 and Claude 4.5 Sonnet on several agentic tests and hit a new top score of 44.9% on the benchmark called Humanity’s Last Exam. The model also showed notable coding improvements over Moonshot’s version from four months prior, while still trailing the very top coding specialists.
K2 Thinking reportedly chains 200–300 autonomous tool calls to complete tasks and performs strongly on creative writing. Moonshot says the model cost under $5M to train and will be priced well below current frontier offerings — positioning it as a lower-cost, open-source alternative to major closed models. The newsletter frames the release as evidence for Jensen Huang’s recent comment that China is “nanoseconds” behind U.S. AI efforts, suggesting the open-source and Chinese lab landscape is closing in on frontier capabilities.
Source