Next up this week was a presentation from Anthropic. The event was called "Code w/ Claude", meaning they directly focused on programming.
Google, citing Cursor statistics, said that in recent months many people had switched to Gemini 2.5 Pro in it. It's interesting whether the update will bring people back to Sonnet (the model is already available in Cursor settings).
https://www.anthropic.com/news/claude-4
They presented version 4 of Opus and Sonnet. The models have all modern features - reasoning, web search, code execution, tool usage and MCP, local file editing (which Opus can use as memory). The query cache was expanded from seconds to an hour. The default context window (200k) is smaller, but for an additional fee, it reaches Gemini's 1 million tokens.
They say Opus can work for 7 hours as a background autonomous agent.
In light of the release of specialized models for background agent behavior, it's important to note that these new models are essentially that. Only they are not versions of other models, like OpenAI Codex from o3. Anthropic seems to have shifted focus, as they clearly lost the battle with ChatGPT-Gemini-Grok for the everyday consumer chat app market. So they simply released only such agentic models and focused on programming.
They also updated their Claude Code tool. It now supports background tasks via GitHub Actions and integrations with VS Code and JetBrains, reflecting changes directly in the IDE.
This means they also created a background agent that can be assigned tasks from repositories and then the results can be checked: "Mention Claude Code in pull requests to respond to reviewer comments, fix CI errors, or change code. To install, run /install-github-app
from Claude Code."
Interestingly, a GitHub representative spoke, and it seems their background agent on the site also runs on an Anthropic model, not OpenAI Codex as I initially thought at their announcement. In the free plan of GitHub Copilot, it's still only Claude 3.5 Sonnet, but the Pro plan added Claude 4.0 Sonnet (Preview). To use Opus, you need to be on a Pro+ subscription.
#claudecode #newllmmodel