This post was originally published on here
Follow ZDNET: Add us as a preferred source on Google.
ZDNET’s key takeaways
- Moonshot AI pushes autonomous coding to new limits.
- AI designs and builds full-stack apps from prompts.
- Persistent agents run for days, handling real operations.
Yesterday, Moonshot AI announced Kimi K2.6, the latest version of its open-source AI model. This release has enhanced coding capabilities, long multi-step operation execution, and agent swarm capabilities (which doesn’t sound terrifying at all).
Also: The best free AI for coding – only 3 make the cut now
The company is doubling down on what it calls a “seamless AI coworker experience,” based on a reinterpretation of the OpenClaw AI assistant approach to automated AI processing for complex, real-world workflows.
Improvements in long-horizon coding performance
At the core of the Kimi K2.6 release is a substantial improvement in long-horizon coding performance. Long-horizon coding is another way of saying that the AI can do a very long series of steps without human oversight.
Think of the difference between short-horizon and long-horizon as analogous to the difference between having an employee you have to check on every 15 minutes, and an employee to whom you can just give an assignment and know that what you need will be on your desk tomorrow morning without fuss or hassle.
Also: 7 AI coding techniques I use to ship real, reliable products – fast
Moonshot uses a SysY compiler project as an example of a long-horizon assignment. SysY is a minimalist C-like language used for teaching compiler design to students. Kimi K2.6 designed and built a full SysY compiler from scratch in 10 hours, passing 140 functional tests without human input. It says this work is the equivalent of having four engineers working for two months.
Without a doubt, this is a considerable accomplishment. But Moonshot is not alone in using AI to build compilers. Anthropic reported in February that it built a full C compiler (not just a cut-down training wheels version) using its Opus 4.6 model.
The Anthropic project did fairly well, but it did run into a snag when the agents hit the complex task of compiling the Linux kernel, causing them to get stuck on the same bugs, overwrite each other’s work, and break existing functionality as new features were added.
I’m guessing that the choice of SysY on the part of the Kimi developers was to keep the overall complexity down, and that this new model would probably hit a similar set of snags to those Anthropic encountered.
Moonshot says that the K2.6 model demonstrates strong generalization (meaning it’s able to handle new and unexpected situations across languages including Rust, Go, and Python). It also reports that the new model demonstrates reliability across front-end, DevOps, and performance optimization tasks.
Expanding from coding into design and creation
Coding output isn’t Kimi K2.6’s only big trick. The model is capable of doing user interface design work and then producing coding output from that design. This enables non-coders to build full web applications from prompts, including the look and feel. It provides an assist to developers who may not have design expertise.
Also: I tried to save $1,200 by vibe coding for free – and quickly regretted it
Going back to the long-horizon claim discussed earlier, Moonshot demonstrated the full-scale project capability by building a series of websites. The company reported that Kimi K2.6, “Identified 30 restaurants in Los Angeles without official websites, then automatically generated high-converting landing pages for each. These pages include booking functionality, with all information seamlessly synchronized to their database.”
Agent swarms, proactive agents, and persistent execution
According to Moonshot AI founder Zhilin Yang, “By orchestrating 100 or even 1,000 sub-agents in parallel, we can accomplish complex tasks within a timeframe that is tolerable for the real world.” It calls this “agent swarms.”
I don’t know. I’ve probably seen Terminator too many times, but while I can see the practical benefit, the very idea of swarms of AI agents is freaky as heck.
The company reports, “It seamlessly coordinates heterogeneous agents to combine complementary skills and broad search capabilities layered with deep research, plus large-scale document analysis fused with long-form writing, and multi-format content generation executed in parallel.”
It says that, “This compositional intelligence enables the swarm to deliver end-to-end outputs spanning documents, websites, slides, and spreadsheets within a single autonomous run.”
The Kimi K2.6 model now supports autonomous agents operating continuously across applications and workflows. This release also improves API interpretation, long-running stability, and safety awareness.
The company demonstrated a K2.6-backed agent that, “Operated autonomously for 5 days, managing monitoring, incident response, and system operations, demonstrating persistent context, multi-threaded task handling, and full-cycle execution from alert to resolution.”
Also: AI agents are fast, loose, and out of control, MIT study finds
Another capability added to Kimi K2.6 is what the company calls “Claw Groups,” enabling multiple OpenClaw-style agents running across devices to collaborate with a shared context. There is a central coordinator that dynamically assigns tasks and resolves failures.
Moonshot AI says this all becomes a form of collective intelligence. It says, “We are moving beyond simply asking AI a question or assigning AI a task, and entering a phase where human and AI collaborate as genuine partners–combining strengths to solve problems collectively.”
As long as the agents don’t go and invent time travel, we’re probably safe. For now.
Would you feel comfortable letting an AI agent run continuously for days, managing systems on your behalf? Let us know in the comments below.
You can follow my day-to-day project updates on social media. Be sure to subscribe to my weekly update newsletter, and follow me on Twitter/X at @DavidGewirtz, on Facebook at Facebook.com/DavidGewirtz, on Instagram at Instagram.com/DavidGewirtz, on Bluesky at @DavidGewirtz.com, and on YouTube at YouTube.com/DavidGewirtzTV.
Artificial Intelligence







