Claude Sonnet 4 and Opus 4 Retire June 15 — What Developers Need to Do Now

June 13 update: Two days remaining. If you have not migrated yet, today is the day. The migration is a single line in most codebases — update the model string and ship. Instructions below.

The short version: Anthropic retires claude-sonnet-4-20250514 and claude-opus-4-20250514 on June 15, 2026, at 9AM PT. Any production application still calling those model IDs will receive API errors after that point. The migration is usually a single line — update the model string, run parallel tests, ship. The successors are strictly better on every benchmark that matters.

What Is Being Deprecated

Both models trace back to the original Claude 4 launch on May 14, 2025. They were the first Anthropic models to unify extended thinking and multi-tool use in a single generation, and for roughly a year they were the default Claude API choice for production workloads.

Model ID	Launched	Retires
`claude-sonnet-4-20250514`	May 14, 2025	June 15, 2026
`claude-opus-4-20250514`	May 14, 2025	June 15, 2026

After retirement, requests to either model ID return an error. Context window, pricing, and output format questions become moot — the calls simply fail.

What to Migrate To

Anthropic’s recommended successors:

Retiring model	Recommended replacement
`claude-sonnet-4-20250514`	`claude-sonnet-4-6-20260217`
`claude-opus-4-20250514`	`claude-opus-4-7-20260416`

If your workloads are primarily agentic or involve complex multi-step reasoning, consider moving directly to claude-opus-4-7-20260416 regardless of whether you were previously on Sonnet 4. It is more capable than both retiring models on every published benchmark, and in several pricing configurations it costs less per useful output token than Opus 4 did on long-horizon tasks.

What You Actually Get from the Upgrade

Sonnet 4.6 vs. Sonnet 4

Context window: 1,048,576 tokens (up from 200K). This is the most operationally significant change. Any pipeline that was chunking, summarizing, or trimming input to fit within 200K can now pass the full document or codebase in a single call.

Tool use: Improved reliability on parallel function calls and multi-turn tool use chains. Sonnet 4 occasionally dropped tool calls mid-sequence under high concurrency; Sonnet 4.6 handles this more predictably.

Coding benchmarks: Higher scores across SWE-bench Verified and HumanEval variants, reflecting targeted improvements on the software engineering tasks that define most production Sonnet use cases.

Cost: Per-token pricing is comparable at the Sonnet tier. The 1M context window comes at no long-context surcharge.

Opus 4.7 vs. Opus 4

SWE-bench Verified: 87.6%, up from the ~75% range at Opus 4 launch. SWE-bench Pro (a harder held-out variant): 64.3%.

Hallucination rate: 36% on Artificial Analysis’s AA-Omniscience benchmark. At Opus 4 launch, the same benchmark did not exist in its current form; contemporaneous evaluations showed Opus 4 competitive but not dominant. Opus 4.7’s 36% is 50 percentage points lower than GPT-5.5 (86% on the same test) — the largest factual accuracy gap between frontier labs in the first half of 2026.

Reasoning architecture: Extended Thinking is replaced by Adaptive Thinking. Rather than requiring you to explicitly set a thinking budget, Opus 4.7 infers the right compute depth from the task. The practical difference: better results on ambiguous or underspecified prompts without needing manual budget tuning.

Context window: 1 million tokens with 128K maximum output.

Image resolution: 3.75 megapixel maximum per image (up from 1.15MP at Opus 4 launch), improving computer-use and document-analysis pipelines.

New tokenizer note: Opus 4.7 uses a revised tokenizer that may generate up to 35% more tokens on certain prompt patterns compared to Opus 4. Run token-count tests on your high-volume prompts before switching pricing projections.

How to Migrate

The mechanical part is a one-line change in almost every integration:

# Before
client.messages.create(
    model="claude-opus-4-20250514",  # retiring June 15
    ...
)

# After
client.messages.create(
    model="claude-opus-4-7-20260416",
    ...
)

The same applies to any SDK, HTTP client, or infrastructure layer that holds the model string.

Testing Before the Deadline

The failure mode to watch for is prompt-response distribution shift. Both successor models are more capable, which in practice means they may respond more verbosely, apply more reasoning steps, or interpret ambiguous instructions differently from their predecessors. This is generally positive, but it can surface issues in downstream parsing, schema-constrained outputs, or evaluation harnesses calibrated on Sonnet 4 or Opus 4 behavior.

Recommended test protocol:

Run parallel calls: For three to five days, send the same production requests to both the retiring model and the successor. Compare output quality, token counts, and downstream application behavior.
Watch tool-use sequences: If your pipeline chains tool calls, confirm the successor handles your specific tool schemas without regression.
Recheck context-window assumptions: If you were chunking to 200K on Sonnet 4, test whether the full document flow actually improves output quality — it usually does, but some chunking strategies introduced implicit filtering that becomes a dependency.
Rebaseline token budgets for Opus 4.7: The tokenizer change means the same prompts may cost more. Verify against your actual workload before committing the migration to production.

Claude Agent SDK Workloads

If you are using the Claude Agent SDK and have agents explicitly parameterized to claude-opus-4-20250514 or claude-sonnet-4-20250514, those configurations need to update before June 15 as well. Agent SDK billing and routing decisions do not automatically alias deprecated model strings to successors.

Timeline Summary

Date	What happens
June 13 (today)	2 days remaining. Migrate now if you have not already.
June 14	Last day before retirement. All production traffic must be on successors.
June 15, 9AM PT	`claude-sonnet-4-20250514` and `claude-opus-4-20250514` return errors.