Two open-weights coding models landed in the same week. Moonshot AI released Kimi K2.7-Code on Hugging Face as the coding-focused agentic build of the Kimi K2.6 base, and Xiaomi shipped MiMo Code as a separate open-weights drop. For TypeScript and JavaScript developers who self-host their dev tools, this widens the practical menu of models that can sit behind Continue, Aider, OpenCode, or any other tool with a local-model backend.
Kimi K2.7-Code
Kimi K2.7-Code is a 1T-parameter Mixture-of-Experts model with 32B parameters activated per token, 61 layers (one dense), 7168-dim attention hidden state, 2048-dim MoE hidden state per expert, 64 attention heads, 384 experts with 8 selected per token, and one shared expert. The vocabulary is 160K and the context length is 256K. Attention is Multi-head Latent Attention (MLA), activation is SwiGLU, and the model ships with a 400M-parameter MoonViT vision encoder, although the coding variant is text-only at inference time.
The headline metric from Moonshot is a roughly 30% drop in thinking-token usage versus K2.6 on the same long-horizon coding tasks, paired with stronger end-to-end completion on multi-step software engineering work. The 256K context is the practical story for tasks that span a large TypeScript codebase: the model can hold a meaningful slice of a real repo plus the issue, the test plan, and the prior conversation.
The architecture implies a few things for self-hosting. At 1T parameters, K2.7-Code is not a single-GPU model. Multi-GPU is the floor, and a hosted inference partner is the realistic production story. The agentic scaffolding (tool use, planning, error recovery) is the bottleneck, not the weights themselves. Expect the gap between the raw model and a Claude Fable 5-class agent to be much smaller on raw coding benchmarks than on long-horizon agentic evaluations.
Xiaomi MiMo Code
Xiaomi's MiMo Code is the second open-weights coding model release of the week, and the more accessible of the two on consumer hardware. Xiaomi has not yet published a model card with the same level of architectural detail as Moonshot; the public material positions MiMo Code as a coding-focused drop, with open weights and the same general agentic workload framing.
The relevance is symbolic and practical at the same time. Two open-weights coding models in a single week from two separate Chinese vendors is a strong signal that the open-weights frontier is no longer a single-vendor story. For self-hosted dev tools, the menu is now wide enough to compare architectures, not just parameter counts.
What this means for self-hosted dev tools
The practical question for a TypeScript team running a local coding agent is whether the open-weights alternative is good enough for your workload. The answer depends on what the agent has to do.
For code completion, single-file edits, and tightly-scoped refactors, the open-weights ecosystem has had workable options for over a year. For agentic tasks that require multi-file reasoning, long-context retention, and reliable tool use across many turns, the gap to the closed-weight frontier (Claude Fable 5, GPT-class, Gemini-class) is still real, but it is narrower than it was six months ago. The Claude Code rise in 2026 and the March 2026 dev-tool rankings were written when the open-weights menu was a single line item. After this week, it is at least a short list, and the closed-weight vendors will need to keep shipping capability lifts to stay ahead.
For tooling: Continue, Aider, OpenCode, and Cline with a local backend can all load a Hugging Face checkpoint today. Kimi K2.7-Code at 1T parameters wants multi-GPU; MiMo Code is the candidate for laptop inference. The architectural numbers suggest K2.7-Code has the headroom for serious work, but the practical test is whether your agentic scaffold can make use of it. The same evaluation will apply to the WebStorm 2026.1 service-powered TS engine and any other editor that surfaces a local model picker.



