CodeWithLLM-Updates
-

A few interesting updates for May. Amidst the news about xAI, Anthropic also surprised many by announcing a partnership with SpaceX on May 6 to expand their computing power.

Anthropic Discounts and Transition to New Pricing
https://www.anthropic.com/news/higher-limits-spacex
Anthropic announced a temporary "spring discount" on their API models. They also stopped blocking OpenClaw-style usage. However, this is likely an attempt to smooth things over before big changes: the company is increasingly hinting at a revision of the classic "fixed subscription — unlimited chat" model. Instead of "per token" payment, Compute-based pricing is being introduced. The cost of a request will depend on how many computing resources the model spent on "reasoning."

Claude Code Updates
https://code.claude.com/docs/en/whats-new#week-18
On Windows, Claude Code finally no longer requires Git Bash; if it's missing, the tool now natively uses PowerShell.

Cloud functions. Public access (research preview) was opened for a new command /ultrareview, which spins up several autonomous AI agents in the cloud to check the repository for vulnerabilities and bugs in parallel. Before this, they also launched the /ultraplan command — a large planning task is pushed to Anthropic servers, where an isolated virtual machine is spun up for it (4 CPU cores, 16 GB RAM, with Node.js, Python, Rust, Docker, etc., pre-installed), eventually providing a link to a web interface with the results.

Managing OpenAI Codex from Mobile
https://openai.com/news/codex-mobile-app/
Responding to a similar feature in Claude Code, OpenAI released an update for Codex that allows managing AI agents from a smartphone. Now developers don't have to be near a laptop: they can approve pull requests, run testing pipelines, resolve merge conflicts, or give prompts to fix small bugs on the go. The interface is fully optimized for voice and quick commands — essentially, it's a pocket remote for the agent on your computer.

Gemma Models in Gemini CLI
https://geminicli.com/docs/changelogs/
The Gemini CLI terminal client update (v0.40.0) added experimental integration for local Gemma models. Version v0.41.0 added support for Gemma 4 models (experimental). While intelligent Model Routing and full offline agent execution are not yet available, the team is already preparing full local task execution.

Memory handling was also improved. Tiered Memory allows the agent to store context directly in Markdown files across four levels: from global developer styles (in ~/.gemini/GEMINI.md) to specific project directory rules. A new Auto Memory feature background-analyzes old sessions, finds successful solutions, and suggests saving them as reusable skills in SKILL.md. Auto Memory Inbox (from v0.42) is a system that automatically collects, classifies, and surfaces important pieces of information for an AI assistant’s long-term memory.

Voice mode has also been improved.