#028

Claude Code Now Has 1M Tokens of Context — and You Shouldn't Fill Them

Opus 4.6 includes 1M context tokens on Max, Team, and Enterprise plans. 5x more than before, at no extra cost. But more context doesn't mean better results.

TL;DR Opus 4.6 includes 1 million context tokens by default on Max, Team, and Enterprise plans — 5x more than before, at no extra cost. But more context doesn't mean better results. Monitor with /context, compact proactively, and don't let the session degrade.

Since version 2.1.75, Claude Code uses Opus 4.6 with 1M tokens as the default model. The context window jumped from 200K to 1M — 5x more space before Claude needs to compact. This means longer sessions, fewer interruptions, and the ability to work with entire monorepos without fragmenting context.

But there's a catch: model performance degrades with distant tokens. This is known as context drift — the further information sits from the current point in the conversation, the less accurately the model retrieves it. Just because you can fill 1M tokens doesn't mean you should.

Who has access

Plan Opus 4.6 1M Sonnet 4.6 1M
Max, Team, Enterprise Included Requires extra usage
Pro Requires extra usage Requires extra usage
API (pay-as-you-go) Full access Full access

No surcharge — the per-token price is the same across the entire window. On plans where it's included, it doesn't consume additional credits.

Pro doesn't have it included yet, but Anthropic's pattern with premium features is clear: Enterprise/Max first, then it trickles down.

How to use it without wasting context

1. Monitor with /context

Use the /context command to see how much context you're consuming in real time. Don't wait for Claude to auto-compact — get ahead of it.

2. Compact before Claude does

Auto-compaction triggers at ~95% capacity. By that point, you've already lost precision on older tokens. Compact proactively:

/compact focus on the authentication module

The optional instructions tell Claude what to prioritize when summarizing.

3. Don't load context you don't need

Having 1M doesn't mean you should dump the entire codebase. Load selectively — specific directories, relevant files. Claude reads on demand; you don't need to pre-load everything.

4. Select the 1M model explicitly

If it's not active, select it manually:

/model opus[1m]
/model sonnet[1m]

Or when launching Claude Code:

claude --model opus[1m]

5. Control when compaction triggers

If you want compaction before 95%, adjust the threshold:

export CLAUDE_AUTOCOMPACT_PCT_OVERRIDE=70

Reference

Aspect Detail
Previous context 200K tokens
Current context 1M tokens (5x)
1M model Opus 4.6 (included on Max/Team/Enterprise)
Model aliases opus[1m], sonnet[1m]
Auto-compaction ~95% by default, adjustable with CLAUDE_AUTOCOMPACT_PCT_OVERRIDE
Disable 1M CLAUDE_CODE_DISABLE_1M_CONTEXT=1
Per-token price No surcharge — same price across the entire window

Official docs: Model configuration — Extended context

Get only what matters

If I have nothing worth saying, you won't hear from me. When I do, you'll be the first to know. 7,000+ professionals already trust this.

Are you a professional Web developer?
No

Unsubscribe at any time.