2026-06-22 08:52 - CodeWithLLM

Chinese AI services continue to gradually catch up with their US counterparts.

TRAE Solo is now Work
https://solo.trae.cn/
https://docs.trae.ai/solo/what-is-trae-solo?_lang=en
ByteDance has renamed its "Trae Solo" tool to Trae Work, highlighting a shift in positioning: from a simple developer assistant to a fully autonomous "AI employee" for various tasks (data scraping, content creation, web research, etc.). Code remains as a separate tab, and a GitHub connector is available. The interface resembles the Codex app, featuring Skills and an MCP with a catalog. The tool is accessible via web, desktop, and mobile. By default, "Privacy Mode" is disabled for new accounts, so users need to enable it manually.

The Capable GLM-5.2
https://docs.z.ai/guides/llm/glm-5.2
https://artificialanalysis.ai/articles/glm-5-2-is-the-new-leading-open-weights-model-on-the-artificial-analysis-intelligence-index
Zhipu AI has released GLM-5.2 — a 753B parameter Mixture-of-Experts (MoE) model under the MIT license, which significantly improves upon GLM-5.1. The context window has been expanded to 1M tokens (compared to 200k in its predecessor).

https://www.youtube.com/watch?v=nODxez6nZEU

The model ranked first among open-source models in the Artificial Analysis Intelligence Index (v4.1) with a score of 51, demonstrating coding skills on par with the proprietary Claude Opus 4.8. While it tends to get confused more easily and consumes more tokens overall, it still delivers results.

Discussion
https://news.ycombinator.com/item?id=48567759
On Hacker News, the model is praised for its price-to-performance ratio in long-running development cycles. However, users note that the "Max" reasoning mode is extremely slow and highly token-intensive. Due to its large size (753B), running it locally on standard MacBook Pros is not possible, but users can rent GPU cloud instances or access it via https://openrouter.ai/z-ai/glm-5-2#providers.

Current top coding models on OpenRouter by usage volume (token count):

MiMo-V2.5 (by xiaomi) — the clear leader with 4.59T (trillion) tokens, representing 22.5% of the total market share.
MiniMax M3 (by minimax) — holds second place with 2.45T tokens (12.0%).
Hy3 preview (by tencent) — third place with 1.43T tokens (7.0%).
Claude Opus 4.7 (by anthropic) — fourth place, accounting for 1.17T tokens (5.7%).
DeepSeek V4 Pro (by deepseek) — closes the top five with 1.14T tokens (5.6%).
DeepSeek V4 Flash (by deepseek) — sixth place with 972B (billion) tokens (4.8%).
GLM 5.1 (by z-ai) — seventh place with 952B tokens (4.7%).
GLM 5.2 (by z-ai) — eighth place with 820B tokens (4.0%).

GLM-5.2 on OpenCode
https://dev.to/danielbergholz/testing-glm-52-on-opencode-im-impressed-1780
The article's author, Daniel Bergholz, tested the model in real-world development conditions by integrating GLM-5.2 via OpenRouter into the free coding agent OpenCode.

In a practical test on an actual Next.js project, the model was tasked with implementing an article search feature with a 300ms debounce without cluttering the browser history. GLM-5.2 proved to be a somewhat slow but highly deliberate model: during the planning phase, it analyzed the project structure without additional prompting, recognized the difference between server and client components, and logically justified using client-side rendering for this task. It wrote clean, working code on the first attempt ("one-shot") and demonstrated a rare "restraint" for AI assistants by not trying to overcomplicate the existing project structure.

The entire session, which included repository analysis, planning, coding, review, and the final fix, cost the author only $0.265 (less than 27 cents).

#trae #glm #top #opencode

2026

2025

2024