CodeWithLLM-Updates
-

Claude Code Source Code
https://twitter.com/Fried_rice/status/2038894956459290963
https://news.ycombinator.com/item?id=47584540
On March 31, someone accidentally published a production build with a sourcemap file (~60 MB) to npm — and the entire Claude Code source code became publicly available. Some thought it was a brilliant April Fools' prank. A mention of a rollout window specifically for April 1–7 was even found in the code. Whether it was a joke or a real mistake is still being debated.

What exactly leaked (based on thread discussions):

  • Full Claude Code agent architecture (tool use, computer use, bash, file operations, etc.).
  • Permission system and "Bypass Permissions Mode" — a detailed description of how guardrails work.
  • Full Claude Code system prompt (including security rules and "cyber risk instructions").
  • Telemetry logic — what exactly is sent to Datadog (model, session ID, subscription type, whether the user is an Anthropic employee, etc.).
  • Internal infrastructure: WebSocket sessions, JWT for IDE integration, feature flags via GrowthBook, session-ingress, etc.
  • Hidden/unreleased features (many posts with "hidden features" breakdowns).
  • "Undercover Mode" subsystem — designed to prevent Claude from disclosing Anthropic's internal information and publishing production builds with sourcemap files.

Analysis by Alex Kim
https://alex000kim.com/posts/2026-03-31-claude-code-source-leak/
https://news.ycombinator.com/item?id=47586778
Anthropic specifically injects fake tools to poison attempts to copy Claude's behavior. There is server-side text summarization with a cryptographic signature. A special mode (undercover.ts) forces the model to hide mentions of internal names (Capybara, Tengu, Slack channels, "Claude Code," etc.). Rigid security for bash commands (23 checks against injections, zero-width characters, etc.). A prompt caching system with "sticky latches" and 14 invalidation vectors.

The autonomous agent KAIROS is mentioned with a /dream skill, daily logs, GitHub webhooks, and updates every 5 minutes. It looks like the next big step after the current Claude Code.

The most meme-worthy moment — userPromptKeywords.ts contains a large regex that catches phrases like: wtf, ffs, omfg, shit, dumbass, fuck you, this sucks, damn it, showing that the user is angry, and the model likely reacts differently (the author assumes this is for experience improvement or escalation).

The leak is dangerous not so much for the code itself, but for revealing the roadmap and internal protection mechanisms.

Visualization
https://ccunpacked.dev/ and https://ccleaks.com/
https://news.ycombinator.com/item?id=47597085
Especially useful for developers who want to understand how Anthropic builds agentic systems (tool calling, multi-agent, planning loop, bash security, etc.).

https://www.youtube.com/watch?v=LA3l81oEzJQ

Key findings — hidden features:

  • KAIROS: A constantly active background agent that works 24/7, monitors repositories, and fixes bugs on its own.
  • ULTRAPLAN: Deep planning for up to 30 minutes in the cloud for complex tasks.
  • BUDDY: A terminal-based Tamagotchi companion with 18 species and statistics.
  • DREAM: An automatic self-cleaning and memory consolidation system.

Analysis by Joe Fabisevich
https://build.ms/2026/4/1/the-claude-code-leak/
https://news.ycombinator.com/item?id=47609294
An indie developer, author of Plinky, writes not about the leak itself, but about what it says about modern development. Anthropic immediately started sending DMCA notices to GitHub (even for their own forks of skills and examples). And then clean-room implementations in Python and Rust appeared.

The discussion jokes about "Claude leaking itself": the classic hype about the model deciding to "open" itself.

Analysis by Han HELOIR YAN, Ph.D.
https://medium.com/@han.heloir/everyone-analyzed-claude-codes-features-nobody-analyzed-its-architecture-1173470ab622
The article is more technical and calm - it focuses not on meme features (like Buddy, Undercover Mode or frustration regex), but on the architecture of Claude Code as a full-fledged production-grade AI agent.

Anthropic's moat is not in the model itself (LLM), but in the harness (the wrapper, the system around the model). It is thanks to this harness that Claude Code feels significantly more powerful than competitors, even if the model is not always the best.

While Claude Code was the undisputed favorite last year, with many tutorials and side projects, I can't quite understand what is happening with the project in 2026. Judging by the decreasing number of YouTube videos, other people can't either.

In February–March, Anthropic announced and rolled out several features that made Claude Code much more autonomous (agentic). There is an active transition from "a single agent in the terminal" to a managed task system and coordination of background agents (Ctrl+B) with an ecosystem of hot-reloaded MCP integrations, skills, hooks, and plugins. Through /teleport, you can initialize /remote-control sessions that can be managed from a mobile app. /loop was introduced for periodic prompt/command execution, along with in-session cron scheduling tools, etc.

Of the truly useful additions, only Auto Mode is worth noting.

Auto Mode
https://claude.com/blog/auto-mode
Presented as a "middle ground" between two extremes in Claude Code. Previously, you either had to manually approve every file change and bash command (very secure but annoying) or use the --dangerously-skip-permissions flag. The new Auto Mode allows Claude to decide for itself which actions are safe and execute them automatically without approval.

Before each tool call, a separate classifier (based on Sonnet 4.6) quickly checks the action for danger. Safe actions proceed automatically; risky ones are blocked. If the model persistently insists on blocked actions, a user prompt eventually appears anyway.

Claude Mythos Announcement Discussion
https://news.ycombinator.com/item?id=47679258
Anthropic describes the personality, goals, and limitations of the new model in a system card. It is not being released publicly—allegedly due to a sharp jump in capabilities and security risks. They claim Mythos has found thousands of zero-day vulnerabilities in OSs, browsers, virtual machines, etc. (including very old bugs). Many write that this could significantly change cybersecurity—for better or worse.

https://red.anthropic.com/2026/mythos-preview/
They also announced Project Glasswing, providing Mythos access to a limited circle of companies to fix critical software using the model.


Recently, many people paying for subscriptions have found Claude Code becoming practically unusable due to recent changes in Anthropic's policies and restrictions without clear rules. Even just mentioning OpenClaw in the system prompt causes the request to be rejected with an error. The system has also become worse at handling non-coding tasks.

Most likely, due to the launch of the new model, they had to maximize the squeeze on all compute that was previously distributed just to attract people to the infrastructure.

For about three years, most programming applications were clones of VS Code with a side chat. A new wave seems to have been started by Codex — they released their desktop app on Electron without VSC, as did OpenCode.

Cursor 3
https://cursor.com/blog/cursor-3
The company has completely abandoned the VS Code fork model and built a new interface code-named Glass. The main innovation is the ground-up Agents Window, which allows running an unlimited number of agents simultaneously in parallel: locally, in worktree, via SSH, in the cloud, or even across multiple repositories at once. The new part is reportedly written in Rust+TS.

https://cursor.com/blog/agent-web
Later, they integrated mobile devices via PWA. Cursor Agents on web and mobile is an official way to run cloud agents directly from a phone or mobile browser. You can start a chat on your phone and continue on your desktop (or vice versa).

https://www.youtube.com/watch?v=HTKGyLar8AU

The phrase "Cursor 3 just killed the IDE" is repeated as the main hook.

Discussion
https://news.ycombinator.com/item?id=47618084
Many praise the boldness and technical progress of an agentic future, but even more express disappointment or even outrage that Cursor is radically moving away from the familiar "IDE + plugins + AI assistant" model. Critics argue the company is chasing investor hype that "AI will replace developers" rather than addressing programmers' real needs.

People who want to write code rather than manage a team of agents will have to look for something else, like VS Code or Zed.

App from The Factory
https://factory.ai/news/factory-desktop
Another company made a similar interface clone for "agent management." Interestingly, after installing on Windows 11, it tells me "Not connected to Local Machine. Please download and start the Desktop app, or upgrade to a paid plan to unlock more features," asking me to download their app. While their design is very cool, I couldn't even test their buggy Electron app.

If Anthropic is following the path of integrating Claude Code into its Work desktop app (finally adding parallel sessions: https://claude.com/blog/claude-code-desktop-redesign), OpenAI is coming from a different angle: this week they updated the Codex coding app and added computer control features. Different paths — same result.

Codex as a Superapp
https://openai.com/index/codex-for-almost-everything/
On macOS, Codex can now see the screen, move its own cursor, click, type text, open any application, and work in the background. Across all platforms, there is a built-in browser, image generation, memory (remembers preferences and previous actions — not yet in EU/UK), and over 90 plugins and integrations.

https://www.youtube.com/watch?v=sdNoaztocs0

While Codex has introduced a very "Cursor-like" pleasant feature — where you can simply click on any element (button, block, text, image) in a generated website to immediately add it to the prompt as a reference — the general trend of both companies (Anthropic and OpenAI) expanding their product audiences is slightly concerning for programmers.

Discussion
https://news.ycombinator.com/item?id=47796469
Many see this as a revolution for ordinary people (non-programmers): agents will be able to create personal UIs, automate business processes, replace entire programs, and radically increase productivity. At the same time, programmers are wary — security and privacy are still neglected: full agent access turns a computer into a "hostile device" where even a .txt file becomes an attack vector.

ChatGPT Pro for $100/mo
https://help.openai.com/en/articles/9793128-about-chatgpt-pro-tiers
At the beginning of April, the Codex token promotion ended; now, a free account can only run about two simple tasks before hitting the weekly limit. The $20 Plus plan also doesn't offer much headroom now, as the weekly limit is only suitable for light work (1-2 hours a day). That’s why, as of April 9th, an intermediate option between Plus and the $200 Pro was added.

The new $100 Pro tier has 5× higher limits than Plus and provides access to GPT-5.4 Pro and GPT-5.3 Instant. There is also a promotion until May 31, 2026, offering double tokens.

This is a direct response to Anthropic, which has Claude Max for $100.

Opus 4.7
https://www.anthropic.com/news/claude-opus-4-7
Claude Opus has updated from 4.6 to 4.7 — same features, but even better on benchmarks. They added "adaptive thinking": the model decides for itself how much to "think" before responding and hides the internal reasoning (it no longer shows the full chain of thought by default).

Discussion
https://news.ycombinator.com/item?id=47793411
The model has become stronger, especially in coding and long contexts. However, it is becoming less debuggable. It is now impossible to properly disable adaptive thinking, which makes Claude Code even worse to use; one has to jump through hoops with commands like /effort xhigh, CLAUDE_CODE_DISABLE_1M_CONTEXT=1, "display": "summarized", etc., just to understand what the model is generating.

Anthropic makes cool models, but the programming tools around them are getting worse.

Models updated, all promising agency:

  • DeepSeek V3.2 -> V4. Two versions: V4-Pro and V4-Flash. Open-source. Context: 1M in, 384K out. China. Cheaper scenarios for long documents, agents, and automation. Code quality is lower than other announced models.
  • GPT-5.4 -> GPT-5.5. Presented as an agent that can be trusted with work where the model must plan several steps ahead. Code generation is even better according to tests, while token consumption remains the same. The best model on the market right now, according to OpenAI.
  • Kimi K2.5 -> K2.6. Open-source. China. Moonshot AI positions the model as an agent for long-term programming tasks.
  • GLM-5 -> 5.1. Open-source. China. Claims significant improvements in code generation and cybersecurity.
  • Qwen 3.5 -> 3.6. Qwen3.6-Plus released as a closed model, followed by the flagship Qwen3.6-Max-Preview.
  • MiniMax M2.5 -> M2.7. Open-weights. China. Also for long tasks; said to have good emotional intelligence and stability on OpenClaw skills.
  • Important open-source / open-weight releases of small Qwen3.6 models for coding: Qwen3.6-35B-A3B — MoE model (35B total / 3B active), and Qwen3.6-27Bdense 27B. These are particularly interesting for running on local hardware.

The difference between GPT-5.5, Kimi K2.6, GLM-5.1, Qwen3.6 Plus, MiniMax M2.7, and DeepSeek-V4-Pro-Max on the SWE-Bench Pro test lies in the 55–59% range, meaning it is already a dense group of strong coding/agent models.

End of free Qwen Code
https://www.reddit.com/r/Qwen_AI/comments/1skeeu5/goodbye_qwen_you_tried_but_you_failed/
The Qwen OAuth free tier for Qwen Code was disabled on April 15, 2026. The old "log in via browser and use for free" scenario no longer works or returns errors such as 401 invalid access token, token expired, Internal error, or free tier quota exceeded.

Claude Code removal test from $20 plan
https://www.reddit.com/r/ClaudeAI/comments/1ss3asp/does_claudes_20_plan_no_longer_include_claude_code/
On April 21, 2026, users noticed that Claude Code disappeared from the $20 Pro plan on Anthropic's pricing page, remaining only in the more expensive Max plans. Anthropic explained that this was an A/B test / pricing experiment affecting approximately 2% of new users.

It seems cheap AI coding is gradually coming to an end.

Zed Agent Interface
https://zed.dev/blog/parallel-agents
Following Cursor's lead, Zed is adapting its interface to manage multiple agent chats simultaneously. The main innovation is the Threads Sidebar, which helps group threads by project, flexibly configure agent access to repositories, and track their progress. AI panels have been moved to the left, while files and Git are now on the right.

https://www.youtube.com/watch?v=OLit5C1XE0k

Discussion
https://news.ycombinator.com/item?id=47866750
Many programmers are dissatisfied with the interface changes, noting that running several agents simultaneously creates a massive "cognitive load" and complicates code review, as AI still generates too much "garbage" code. Users also mention the unfinished Git interface and the lack of proper code review tools—issues that should be addressed first.

The biggest pain point remains the isolation of databases, configurations, ports, and test data. Developers are actively discussing how to automate this: some write custom shell scripts, some use Devcontainers, while others praise third-party tools like Conductor or Ouijit for managing the lifecycle of such environments.

Claude Design
https://www.anthropic.com/news/claude-design-anthropic-labs
Anthropic introduced a specialized AI tool based on the new Claude Opus 4.7 model and a design system (DESIGN.md) tailored for the product design process. It creates fully functional interactive prototypes, presentations, landing pages, and UI components, outputting ready-to-use HTML, CSS, and JavaScript code in real-time.

A one-click export allows users to transfer the finished design directly into the Claude Code environment.