Claude Code 2.1.199 Adds Stacked Slash-Skill Invocations, Fixes Subagent Error Reporting, the Background-Agent Daemon Crash Loop on Linux, and the Streaming-Response Discard Bug

lschvnJuly 3, 2026

TL;DR

•[Claude Code v2.1.199](https://github.com/anthropics/claude-code/releases/tag/v2.1.199) (released 2026-07-02, 23:35:18 UTC) ships 24 changelog entries: one new feature (stacked slash-skill invocations load up to 5 leading skills in one prompt) and 23 bug fixes. The fix list is dominated by background-agent and subagent reliability: subagents cut off by a rate limit or server error used to silently fail and now return their partial work; subagents that hit an API error used to report success and now report the error; the Linux background-agent daemon used to kill itself and every running agent every ~50 seconds after an unclean shutdown left a corrupted worker record; and a regression in v2.1.196 made background agents fail to cold-start over SSH on macOS with `Could not switch to audit session`.
•Three of the fixes target silent data loss: streaming responses that the API cut off mid-stream (overloaded, server error) used to be discarded with the partial output, and are now kept with an `incomplete-response` notice; SSL/TLS errors behind corporate proxies (TLS-inspecting proxies, missing `NODE_EXTRA_CA_CERTS`, expired certs) used to burn retries before showing actionable guidance and now fail immediately with the fix hint; and resetting a corrupted config file from the startup recovery dialog used to destroy it unrecoverably, and now backs it up first. The release also fixes a `/diff` panel that did not refresh on out-of-band commits, plan mode not auto-allowing read-only `browser_batch` calls, and a `SendMessage` misrouting bug when a re-spawned agent reused a previous agent's name.
•The release raises the default retry count for non-capacity transient errors to 300 and lifts the cap of 15 on `CLAUDE_CODE_MAX_RETRIES`, so transient 429s unrelated to the user's usage limit now retry automatically with backoff for subscribers instead of failing the turn. This is the third consecutive Claude Code release this week (after [v2.1.197 Sonnet 5 default](/articles/2026-07-01--claude-sonnet-5-default-model-claude-code-1m-context) and [v2.1.198 Chrome GA + background-agent auto-PR](/articles/2026-07-02--claude-code-2-1-198-chrome-ga-background-agents-auto-pr)); the day-after-day cadence suggests Anthropic is iterating the v2.1.19x line at the same speed the model itself is moving.

Claude Code v2.1.199 shipped on 2026-07-02 at 23:35:18 UTC, one day after v2.1.198 (the Chrome GA + background-agent auto-PR + /dataviz + Gateway on AWS release) and two days after v2.1.197 made Sonnet 5 the default model. The release notes are unusually long for a one-day cycle: 24 changelog entries, one new feature and 23 bug fixes, with the fix list concentrated in the same area v2.1.198 expanded (background agents, subagents, the auto-PR workflow, plan mode) plus a long tail of smaller UX fixes. The pattern matches the previous two releases: Anthropic is iterating the v2.1.19x line at the same pace as the underlying model, and the product surface for Sonnet 5 is being hardened release-on-release.

The one new feature: stacked slash-skill invocations

The single new feature in v2.1.199 is a small but useful expansion of the slash-skill surface. In v2.1.198 and earlier, a prompt like /skill-a /skill-b do XYZ loaded only the first skill, skill-a; the second /skill-b was treated as part of the prompt text, not as another skill invocation. v2.1.199 changes the rule: the parser walks the leading tokens of the prompt and loads up to 5 slash-skills before falling through to the rest of the message. A command like /plan /review /dataviz do XYZ now loads plan, review, and dataviz as the active skill set, instead of loading only plan and silently dropping the rest.

The 5-skill cap is a guardrail, not a feature ceiling: skills carry their own context and tool surface, and a 6th skill would be the kind of prompt shape that already warrants a custom subagent instead of an inline skill chain. The cap is generous for the common cases (2 to 3 skills to combine a planning step, a code-review step, and a visualization step) and conservative for the edge cases (an over-eager agent that would otherwise scope-creep into a half-dozen overlapping skill contexts). The change is additive: existing single-skill invocations are unchanged, and a user that wants only one skill in scope can still write /skill-a do XYZ and get the same behavior as before.

Subagent error reporting and partial-work recovery

The deepest cluster of fixes in v2.1.199 is in the subagent / background-agent surface, and three of them are about making failures visible to the parent agent. In v2.1.198 and earlier, three independent failure modes hid partial work from the orchestrator:

Subagents cut off by a rate limit or server error silently failed. A subagent that ran out of context, hit a 429, or got a 5xx from the API would return nothing to the parent. The parent had no way to tell the difference between an unfinished task and a task that had never been started, and the partial work the subagent had already done was lost.
Subagents reported API errors as successful results. A subagent that hit an API error mid-task (a usage-limit error, a 529 overloaded, a model error) would return a result envelope to the parent, with the error surfaced as text inside the result. The parent would treat the envelope as success and use the error text as the answer.
SendMessage silently misrouted when a re-spawned agent reused a previous agent's name. When an agent died and a new agent took its name in the agent panel, a follow-up SendMessage would sometimes hit the dead agent's record and route to a stale conversation. The parent had no error, just a missing reply.

v2.1.199 fixes all three. Subagents cut off by a rate limit or server error now return their partial work to the parent, with the error attached. Subagents that hit an API error now report the error to the parent, with the partial work kept as a separate field on the result envelope. SendMessage detects the name mismatch and asks the caller to retarget, surfacing the routing problem as a tool error rather than a silent miss. The pattern is the same as the v2.1.198 fix for agent teams (a teammate that died on an API error now reports failed to the lead), and the two releases together turn the v2.1.196-era "subagents look like they succeeded but actually did not" failure mode into a visible, recoverable one.

Background-agent daemon crash loop on Linux

The most operationally painful fix in the release is the background-agent daemon crash loop on Linux. Before v2.1.199, an unclean shutdown of the daemon (a kill -9, a host OOM, a power loss, a crash inside a child process) could leave a corrupted worker record on disk in the daemon's state directory. The daemon's recovery code on the next start would try to read the record, fail on the corruption, log an error, and exit; the supervisor would respawn the daemon, which would fail on the same record; and the cycle would repeat every ~50 seconds. Every running background agent in the user's workspace would die with the daemon.

The crash loop is the kind of issue that surfaces in long-running CI and dashboarded worktree agents rather than in interactive single-session use, which is why it lands two releases after v2.1.198 made the auto-PR flow the default for background agents. v2.1.199 fixes the recovery code so the corrupted record is detected and replaced with a fresh one instead of crashing the daemon; the supervisor respawn is now a recovery action, not a repeating failure. The release also fixes a related v2.1.196 regression: background agents that tried to cold-start over SSH on macOS were failing with Could not switch to audit session, which made the SSH-on-macOS launch path unusable for teams that run Claude Code on a remote Mac. The fix is a one-line change in the SSH launch wrapper that explicitly switches the audit session before the agent tries to bind to it.

Streaming-response preservation on mid-stream API errors

Two related fixes target the moment a model call is in flight and the API cuts it off. The first is the streaming-response discard bug: when the Anthropic API emitted a mid-stream overloaded (529) or server error (5xx) after the model had already produced partial output, Claude Code 2.1.198 and earlier discarded the partial response along with the error, leaving the user with nothing. v2.1.199 keeps the partial output and appends an incomplete-response notice. The change matters most for long generations: a Sonnet 5 1M-context run, a multi-file refactor with thousands of tool calls, a /dataviz chart with a long explanation, all of them can lose several seconds of work to a single late-stream 529 under v2.1.198. The fix is paired with the new retry policy (see below): a partial turn is no longer wasted on the first transient error, and the partial is kept when retries are exhausted.

The second is the SSL/TLS error handling behind corporate proxies. In v2.1.198 and earlier, an SSL handshake failure caused by a TLS-inspecting proxy, a missing NODE_EXTRA_CA_CERTS, or an expired certificate would burn the full retry budget before showing any guidance. v2.1.199 changes the path: SSL errors fail immediately with a fix hint that names the most likely cause (NODE_EXTRA_CA_CERTS for a missing CA bundle, certificate check for an expired cert, proxy trust store for a corporate MITM). The change is a quality-of-error improvement, not a behavior change: the same error is now diagnosed in one turn instead of five, and the retry budget is preserved for transient errors that actually deserve retries.

Plan mode, hook stderr, and the `claude agents` view

The plan-mode fixes in v2.1.199 are smaller but cover two distinct paper cuts. First, plan mode was not prompting for state-changing browser tool calls; a session in plan mode that tried to call a state-changing browser action would silently auto-allow the call instead of asking the user. v2.1.199 keeps the existing auto-allow for read-only browser_batch calls (the safe, read-only path through the Chrome extension) and adds the missing prompt for state-changing browser tool calls. The split matches the v2.1.198 fix that auto-allowed read-only tool calls when a session starts in plan mode, and it closes the asymmetric treatment of the read-only and state-changing browser tools.

Second, the SessionStart, Setup, and SubagentStart hooks were silently hiding stderr when they exited with code 2. A hook that failed with a missing dependency, a parse error, or a runtime exception would log to stderr, exit 2, and leave the user with a successful-looking hook result. v2.1.199 shows the stderr in the transcript when a hook exits non-zero, so a misbehaving hook is visible to the user instead of being a silent fail. The change matters for the same reason the v2.1.197 MCP trust fix mattered: a hook is a user-controlled script, and a silent failure of a user-controlled script is a debugging time-sink.

The release also fixes a claude agents view bug: idle subagents used to vanish from the agent panel while other subagents were still working, leaving the user with a partial view of their running agents. v2.1.199 collapses surplus idle agents into an expandable summary row, so the user always sees the active agents and can expand the row to see the idle ones. The same view gets a fix where typing /model or /fast while viewing a subagent was silently opening the lead's model picker; v2.1.199 shows a notice explaining the command applies to the lead, and the same fix prevents the model picker from being opened in the wrong context. The claude agents session rows also get a cosmetic fix: pull-request links now show as bare #N without the redundant "PR" label.

Retry policy, watchdog, and transient 429s

The most user-visible behavior change in v2.1.199 is the retry policy. The release raises the default retry count for non-capacity transient errors (429s unrelated to the user's usage limit, 500-class server errors, 529 overloaded) from a small fixed value to 300, with backoff. The release also lifts the previous cap of 15 on the CLAUDE_CODE_MAX_RETRIES environment variable, so users and CI scripts can set a higher cap without it being silently clamped. The change is targeted: capacity-style errors (the user's own rate limit, the Pro/Max usage cap) are not affected, and subscribers get automatic retry while a session on a usage cap still surfaces the cap as before. The CLAUDE_CODE_RETRY_WATCHDOG env var, which bounds the total time spent retrying across a turn, is unchanged, so a runaway retry loop still gets caught by the watchdog.

The fix is paired with the v2.1.198 network-drop retry (transient errors like ECONNRESET now retry with backoff instead of aborting the turn), and the two together turn the "transient network blip aborts a long generation" failure mode into "transient blip is retried, the partial is kept if retries are exhausted, the cap is still respected." For CI workflows that run Claude Code against a fleet of repos, the change is a strict improvement: a long generation that hits a 529 at minute 8 of a 10-minute run no longer needs to be re-run from scratch.

Smaller fixes worth flagging

The v2.1.199 release notes close a long tail of small fixes. Background job progress indicators no longer stall for minutes while the job runs long commands. Background sessions on memory-starved machines now indicate low memory and suggest freeing resources, instead of showing a generic error. Remote sessions no longer flap between Working and Idle in the agent view when a background agent completes. claude stop is no longer silently undone by a racing background-agent respawn; the respawn now honors the stop. Opening or resuming a session with no new messages no longer grows the transcript file. Backgrounding a session with ← or /background no longer drops its /color from the agent view row. Resetting a corrupted config file from the startup recovery dialog no longer destroys the file unrecoverably; it now backs it up first. claude --dangerously-skip-permissions daemon <subcommand> is no longer treated as a chat prompt; the subcommand is now run. Claude in Chrome no longer repeatedly opens the reconnect page when sessions run from different builds or config directories.

Why this matters for Claude Code users

v2.1.199 is the third consecutive Claude Code release this week, and the third consecutive release to harden the same surface: the background-agent and subagent workflow that v2.1.198 made the default for code work in a worktree, on top of the Sonnet 5 default that v2.1.197 introduced two days earlier. The pattern is consistent: the model is settled, the auto-PR flow is settled, and the work is on the failure modes that the auto-PR flow surfaces in production. The background-agent daemon crash loop on Linux, the streaming-response discard bug, the silent subagent failures, the SSL error handling behind corporate proxies, and the plan-mode asymmetry are exactly the kind of issues that only show up when the workflow is running unattended in a CI pipeline or a long-lived dashboarded worktree, not in the interactive single-session use that dominated Claude Code's first year.

For Claude Code users, the practical takeaway is a quieter claude update window on a Friday morning. The release is additive; existing single-session workflows are unchanged. The single new feature (stacked slash-skill invocations) is a strict superset of the previous behavior. The retry policy is a default that can be reverted with the existing CLAUDE_CODE_MAX_RETRIES env var. And the failure-mode fixes (subagent errors, streaming-response preservation, plan-mode asymmetry) are the kind of change a working Claude Code user notices only when they don't fire, which is the point.

Frequently Asked Questions

Rolldown 1.1.4 Disables `experimental.lazyBarrel` by Default Again, One Month After 1.1.0 Made It Default-On

Rolldown [v1.1.4](https://github.com/rolldown/rolldown/releases/tag/v1.1.4), published 2026-07-01T14:02:02Z, ships one feature change and 19 bug fixes. The feature change is a partial reversal of the [v1.1.0 default-flip](https://github.com/rolldown/rolldown/releases/tag/v1.1.0) that landed on 2026-06-03: `experimental.lazyBarrel` is now disabled by default again, after four weeks of correctness reports against the default-on behaviour. The release also hardens the dev-mode path by forcing `lazyBarrel` off whenever `experimental.devMode` is set (PR [#10060](https://github.com/rolldown/rolldown/pull/10060)), on top of the existing force-off for `treeshake`. The default-flip revert (PR [#10071](https://github.com/rolldown/rolldown/pull/10071)) and the dev-mode fix both author as one line each: "disable `experimental.lazyBarrel` by default" and "fix(dev): disable lazy barrel in dev mode", and the root-cause tracking is the new [issue #10085](https://github.com/rolldown/rolldown/issues/10085) "Tracking strictExecutionOrder correctness and architecture issues", opened the day after the release. The release follows [Rolldown v1.1.3](https://github.com/rolldown/rolldown/releases/tag/v1.1.3) (2026-06-24) and is the first release since 1.1.0 to touch the lazyBarrel config surface.

More coverage with overlapping topics and tags.

aiJul 2, 2026

Claude Code 2.1.198 Makes Claude in Chrome Generally Available, Lets Background Agents Commit, Push, and Open Draft PRs, Adds `/dataviz` Skill, and Brings Claude Platform on AWS to the Gateway

Anthropic released Claude Code [v2.1.198](https://github.com/anthropics/claude-code/releases/tag/v2.1.198) on 2026-07-01T20:45:36Z, the second consecutive day of Claude Code shipping after v2.1.197 (the Sonnet 5 default release) the day before. The headline change is that Claude in Chrome is now generally available inside Claude Code: the browser-side assistant that was a research preview moves to a stable channel and is no longer gated behind a feature flag or a waitlist. The release also lands the first end-to-end automation for background agents in `claude agents`, which now commit, push, and open a draft pull request when they finish code work in a worktree, instead of stopping to ask for permission. Other notable additions: a `/dataviz` skill for chart and dashboard design with a runnable color-palette validator, a Gateway upstream for Claude Platform on AWS (`anthropicAws`) with model-not-found responses advancing the failover chain, a built-in Explore agent that now inherits the main session's model (capped at opus) instead of running on Haiku, and subagents that inherit the parent session's extended thinking configuration. The release also removes the `/agents` wizard and replaces it with the conversational flow, fixes a long list of reliability bugs around background agents, network drops, and platform auth, and improves syntax highlighting by upgrading to highlight.js 11.

aiJul 1, 2026

Claude Sonnet 5 Goes Default in Claude Code 2.1.197 with a 1M-Token Context Window, $2/$10 Per Mtok Introductory Pricing, and Cyber Safeguards On by Default

Anthropic announced Claude Sonnet 5 on 2026-06-30 ([Introducing Claude Sonnet 5](https://www.anthropic.com/news/claude-sonnet-5)), and Claude Code [v2.1.197](https://github.com/anthropics/claude-code/releases/tag/v2.1.197) (released the same day) made it the default model. The launch is the first Sonnet-line model that closes the agentic gap with Opus 4.8, ships a native 1M-token context window, and lands on introductory pricing of $2 per million input tokens and $10 per million output tokens through August 31, 2026 (then $3/$15 per Mtok). Sonnet 5 is the new default for Free and Pro plans in Claude and is the default model in Claude Code, available immediately on update. The release ships with cyber safeguards enabled by default (the same safeguards as Opus 4.7 and 4.8, less strict than the Fable 5 set), a refreshed tokenizer that inflates token counts 1.0 to 1.35x, and rate-limit increases across Chat, Cowork, Claude Code, and the Claude Platform to absorb the higher token usage of the new effort-level controls. The Anthropic system card and the Claude Code release notes both ship the same day, so the Claude Code audience gets the new model the same day it is announced.

aiJun 29, 2026

Cline 4.0.1 Rolls Back the SDK Migration After 4.0.0 Regressions; 4.0.2 Brings the SDK Code Back with Reasoning Effort and ClinePass Fixes

Cline shipped v4.0.1 on June 28, 2026 and v4.0.2 on June 29, 2026 (github.com/cline/cline), a two-step recovery cycle for the v4.0.0 SDK migration that landed on June 26. v4.0.1 ships the pre-SDK 3.89.x VS Code extension under a 4.0.1 version number, built from a dedicated `legacy-extension` branch via a new `ext-vscode-publish-legacy.yml` workflow, to resolve regressions reported in 4.0.0 (broken diff previews in the editor, run_commands errors during file edits, broken file-editing flow with GLM 5.2 and MiniMax M3 through Ollama). v4.0.2 restores the SDK-backed code path on top of the same legacy branch, adding reasoning effort support (including `xhigh`) for DeepSeek thinking models (#11938), a centralized reasoning effort control layer for ClinePass (#11954), canonical Z.ai model ids (#11951), webview env replacement fix (#11955), ClinePass and Z.ai metadata polish (#11958), and a default focus chain settings fix (#11960). CLI v3.0.32 ships the same day with SDK v0.0.54 context-compaction improvements and ClinePass onboarding polish. The release sequence shows a project recovering a major migration in 72 hours by tagging the legacy branch forward rather than reverting the SDK work.

Comments

No comments yet. Be the first to share your thoughts.

Frequently Asked Questions

Related articles

Claude Code 2.1.198 Makes Claude in Chrome Generally Available, Lets Background Agents Commit, Push, and Open Draft PRs, Adds `/dataviz` Skill, and Brings Claude Platform on AWS to the Gateway

Claude Sonnet 5 Goes Default in Claude Code 2.1.197 with a 1M-Token Context Window, $2/$10 Per Mtok Introductory Pricing, and Cyber Safeguards On by Default

Cline 4.0.1 Rolls Back the SDK Migration After 4.0.0 Regressions; 4.0.2 Brings the SDK Code Back with Reasoning Effort and ClinePass Fixes

Comments