Anthropic has announced that Claude Sonnet 4 now supports up to 1 million tokens of context in the Anthropic API, a five-fold increase over the previous limit. This expanded "long context" capability allows developers to feed larger datasets into Claude in a single request, enabling use cases such as large-scale code analysis and synthesis of massive document collections. Pricing doubles for prompts over 200,000 tokens, but prompt caching and batch processing can cut costs by up to 50 percent. Early adopters have reported positive results using the feature.
Title: Anthropic Expands Claude Sonnet 4 Capabilities with 1 Million Token Context
Anthropic has announced a significant enhancement to its Claude Sonnet 4 AI model, increasing its context window to 1 million tokens. This substantial increase, a five-fold improvement over the previous limit, allows developers to process larger datasets in a single request. The expanded "long context" capability is particularly beneficial for use cases such as large-scale code analysis and synthesis of extensive document collections.
The new feature enables developers to load entire codebases, including source files, tests, and documentation, allowing Claude to understand project architecture, identify cross-file dependencies, and suggest improvements that consider the complete system design. Additionally, developers can analyze hundreds of documents simultaneously, making it easier to maintain context across multiple tools and workflows.
The pricing structure for the increased context window has been adjusted to account for the higher computational requirements. For prompts over 200,000 tokens, the cost per token has doubled, with input tokens priced at $6 per million and output tokens at $22.50 per million. However, Anthropic has introduced prompt caching and batch processing to reduce costs and latency by up to 50 percent.
Early adopters have reported positive results using the new feature. Bolt.new, a browser-based development platform, has integrated Claude Sonnet 4 into its workflows, enabling developers to work on larger projects with higher accuracy. Similarly, iGent AI, a London-based software development company, has leveraged the 1 million token context to advance its autonomous coding capabilities, allowing for multi-day sessions on real-world codebases.
The expanded context window is available in public beta on the Anthropic API for customers with Tier 4 and custom rate limits, with broader availability rolling out over the coming weeks. It is also available on Amazon Bedrock and will soon be available on Google Cloud's Vertex AI.
References:
[1] https://www.anthropic.com/news/1m-context
[2] https://www.ainvest.com/news/anthropic-releases-claude-opus-4-1-enhanced-coding-capabilities-2508/
[3] https://www.reddit.com/r/Anthropic/comments/1mocu53/claude_sonnet_4_now_supports_1m_tokens_of_context/
[4] https://finance.yahoo.com/news/anthropic-claude-ai-model-now-161546709.html
Comments
No comments yet