Kimi K2.6 launches with Baseten as day-0 partner

Moonshot AI@Kimi_MoonshotApr 20, 2026

We are excited to have @baseten as a day 0 launch partner for Kimi K2.6! Their inference stack brings KV-aware routing, NVFP4 on Blackwell, multi-modal hierarchical caching, and prefill-decode disaggregation, so K2.6 runs the way it's meant to in production. Try it out at: https://t.co/yEiflsudlh

Views100.0k
Comments13
Reposts42
Likes924
Launched Apr 20, 2026View post

More from Moonshot AI

1

Introducing 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔: Rethinking depth-wise aggregation. Residual connections have long relied on fixed, uniform accumulation. Inspired by the duality of time and depth, we introduce Attention Residuals, replacing standard depth-wise recurrence with https://t.co/gcWyzhZVc0

Mar 15, 2026
Views5.07M
Comments334
Reposts2.04k
View post
2

🌘 Kimi-K2.7-Code, our latest coding model, is now released and open-sourced! 🔷 Improved coding & agent performance over K2.6: +21.8% on Kimi Code Bench v2, +11.0% on Program Bench, and +31.5% on MLS Bench Lite. 🔷 Reasoning efficiency: Less overthinking, with 30% lower https://t.co/jFS7I40avs

Jun 12, 2026
Views2.55M
Comments646
Reposts1.75k
View post
3

🌘 Meet Kimi K2.7 Code HighSpeed! A high-speed mode of our latest open-source multimodal coding model, Kimi K2.7 Code. ⚡️ Up to 6× faster: Around 180 tok/s on coding tasks with median-length inputs, and up to 260 tok/s on shorter-context tasks. 🔷 Rolling out to Kimi Code Beta https://t.co/syOOgIdtI4

Jun 15, 2026
Views519.4k
Comments166
Reposts326
View post
4

Introducing Kimi Code, an open-source coding agent under the Apache 2.0 License. 🔹 Python-based, easy to extend. 🔹 Fully transparent — clear, safe, reliable. 🔹 Seamlessly integrates with VS Code, Cursor, JetBrains, Zed, and more. 🔹 Fully-featured & out-of-the-box ready. https://t.co/HtEvvH3amI

Jan 26, 2026
Views258.1k
Comments61
Reposts212
View post

See all Moonshot AI launches →