CodeWithLLM-Updates
-

Microsoft presented a series of changes at its May Build 2026, shifting from simple AI assistance toward autonomous agents.

Proprietary MAI Models
https://microsoft.ai/news/building-a-hillclimbing-machine-launching-seven-new-mai-models/
They introduced a new family of MAI (Microsoft AI) models, totaling 7, including MAI-Code-1-Flash and MAI-Thinking-1. Microsoft is effectively reducing its dependence on OpenAI. The company claims frontier-level results for autonomous tasks.

The MAI-Code-1-Flash model has 5B active out of 137B parameters (medium-sized) and a 2-million-token context window (extremely large), operating with record-low latency. Presentation slides look perfect: they claim it outperforms previous generations of the flagship GPT-4o. It is being positioned as the core engine for pipelines integrated into GitHub Copilot, VS Code, and other MS products.

Discussion
https://news.ycombinator.com/item?id=48374466
In a active Hacker News discussion, people note that open models like Qwen 3.6 (35B) or DeepSeek V4 Flash yield better results and run significantly faster, while GitHub Copilot's new strict token billing might make using MAI-Code-1-Flash economically unviable.

GitHub Copilot — Now a Separate App
https://github.blog/2026-06-02-github-copilot-app-the-agent-native-desktop-experience/
Copilot used to be a plugin in VS Code, then became part of it. Now, following Codex, Cursor, Zed, and others, Microsoft decided to create a standalone GitHub Copilot App — an "agent-native" desktop chat application.

This is a single control plane for agents working in parallel across isolated git trees, creating PRs, and debugging code. MS are moving away from the editor concept toward delegating entire workflows. It includes support for MCP servers.

https://www.youtube.com/watch?v=5Q5mLNYJ6Hw

Instead of getting lost on the GitHub website, the app features a convenient "On your radar" (or Inbox) tab where all your Pull Requests (PRs) and Issues from selected repositories are gathered. You can open any PR, view code changes (diff), leave comments, or approve it. Furthermore, you can tag @copilot directly in comments for fixes or explanations.

Users can create "quick chats" for general questions (even non-code related, like D&D games) or sessions tied to a specific repository. The app allows switching between different LLMs. For instance, the author uses Claude Opus 4.7 for code generation.

Agent Isolation: MXC (Microsoft Execution Containers)
https://www.microsoft.com/en-us/security/blog/2026/06/02/microsoft-build-2026-securing-code-agents-and-models/
Since agents now execute real code and interact with systems and infrastructure, Microsoft is implementing MXC at the Windows 11 kernel level. This is a new level of isolation and sandboxing specifically for AI applications. Windows is essentially turning into an "Agent Runtime" platform.

The demonstration showed OpenClaw attempting to delete all files from the desktop and failing due to these restrictions.

Chinese AI giant MiniMax has announced a new generation of M models: M3.

MiniMax M3
https://www.minimax.io/blog/minimax-m3
https://www.minimax.io/models/text/m3
MiniMax M3 is built with an emphasis on deep reasoning, coding, and autonomous pipelines. It accepts text+image+video as input and produces text as output. The model is specifically optimized for agentic workflows and complex, long-term tasks rather than simple chat interactions.

MiniMax Sparse Attention (MSA) is a new sparse attention mechanism that radically reduces computational costs for long contexts (approximately 1/20th of the previous generation). It supports up to 1M tokens, with a guaranteed minimum of 512K in the cheaper API version. Token Plans are available (starting at $20/month), with a highlighted $50/month option.

Benchmarks look impressive: SWE-Bench Pro ~59% and Terminal-Bench 2.1 ~66%. This is on par with GPT-5.5 and Gemini 3.1 Pro, trailing only Claude Opus 4.8.

There is no active discussion on Hacker News yet.

MiniMax Code Updates
https://code.minimax.io/
With the M3 update, MiniMax Code has also received a significant upgrade, fully utilizing the model's capabilities: long context, agentic skills, and native multimodality. The program can not only generate code but also create documents, PDFs, slides, tables, and icons. Thanks to multimodality, MiniMax Code supports "computer use" (controlling the computer).

The concept is built around delegation - you don't write code alongside the AI; you manage it. It uses a Producer + Verifier adversarial loop where agents constantly generate, reflect, check, and fix errors in real-time. There is a "Smart Authorize" option to avoid manual monitoring of every agent action.

https://www.youtube.com/watch?v=mBHFGeU18MI

There is native support for MCP servers to connect external databases and documentation. It features a skills marketplace and integration with bots in Telegram, WeChat, and Lark for managing agents from a phone. Autonomous operation for several days without human intervention is possible, as well as scheduled task execution.