AI Agents, News & Updates

Anthropic Releases Claude Sonnet 4.6 with Major Coding and Computer Use Improvements

The upgraded Sonnet model approaches Opus-level performance at a lower price point, with developers preferring it over previous flagship models in early testing.

CA
Author
CWA Team
February 18, 2026
Anthropic Releases Claude Sonnet 4.6 with Major Coding and Computer Use Improvements

Image by Anthropic

Anthropic released Claude Sonnet 4.6 on Tuesday, calling it the company's "most capable Sonnet model yet" with significant upgrades across coding, computer use, and long-context reasoning capabilities.

The model is now the default for free and Pro plan users on claude.ai and Claude Cowork, maintaining the same pricing as its predecessor at $3 per million input tokens and $15 per million output tokens.

Developer Reception

Internal testing showed developers with early access preferred Sonnet 4.6 over Sonnet 4.5 roughly 70% of the time. More notably, users preferred the new model over Claude Opus 4.5—Anthropic's frontier model from November—59% of the time.

Users reported that Sonnet 4.6 "more effectively read the context before modifying code and consolidated shared logic rather than duplicating it," according to Anthropic. Testing also showed the model was "significantly less prone to overengineering and 'laziness,' and meaningfully better at instruction following."

Computer Use Advances

The model shows marked improvement on OSWorld, the standard benchmark for AI computer use that tests tasks across real software like Chrome, LibreOffice, and VS Code. Anthropic noted that early users are "seeing human-level capability in tasks like navigating a complex spreadsheet or filling out a multi-step web form."

Pace, an AI-powered insurance company, reported that "Claude Sonnet 4.6 hit 94% on our insurance benchmark, making it the highest-performing model we've tested for computer use."

Benchmark Performance

The model achieved 79.6% on SWE-bench Verified, 89.9% on GPQA Diamond, and 58.3% on ARC-AGI-2. On Anthropic's benchmarks for agentic financial analysis and office tasks, Sonnet 4.6 outperformed competitors including Google's Gemini 3 Pro and OpenAI's GPT 5.2.

Replit noted that "the performance-to-cost ratio of Claude Sonnet 4.6 is extraordinary—it's hard to overstate how fast Claude models have been evolving in recent months."

Technical Features

Sonnet 4.6 includes a 1 million token context window in beta—sufficient to hold entire codebases or dozens of research papers in a single request. The model supports adaptive thinking, extended thinking, and context compaction, which automatically summarizes older context as conversations approach limits.

Anthropic's safety evaluations concluded that Sonnet 4.6 shows "a broadly warm, honest, prosocial, and at times funny character, very strong safety behaviors, and no signs of major concerns around high-stakes forms of misalignment."

The company acknowledged the model "still lags behind the most skilled humans at using computers" but emphasized that the rate of progress suggests "substantially more capable models are within reach."

Share:

Other Latest News

Claude Outage Exposes Developer Dependency Risks as Anthropic Grapples With Surge in Demand
AI Agents, News & Updates

Claude Outage Exposes Developer Dependency Risks as Anthropic Grapples With Surge in Demand

A widespread Monday morning outage affecting Claude.ai and Claude Code left developers unable to access key tools for over two hours, raising questions about reliability as Anthropic navigates unprecedented user growth.

Mar 2, 2026
OpenAI Strikes Pentagon Deal With Safety Guardrails as Anthropic Gets Blacklisted Over Same Concerns
Industry Analysis

OpenAI Strikes Pentagon Deal With Safety Guardrails as Anthropic Gets Blacklisted Over Same Concerns

OpenAI secured a classified network deployment agreement with the Department of Defense that includes prohibitions on mass surveillance and autonomous weapons — the same safety red lines that contributed to Anthropic's blacklisting hours earlier.

Feb 28, 2026
Block Cuts Over 4,000 Jobs as Jack Dorsey Bets on AI to Replace Developer Teams

Block Cuts Over 4,000 Jobs as Jack Dorsey Bets on AI to Replace Developer Teams

Jack Dorsey's payments company Block is slashing its workforce nearly in half, from over 10,000 to under 6,000, in one of the most aggressive AI-driven restructurings yet — with major implications for software developers across the industry.

Feb 27, 2026
Figma's OpenAI Codex Integration Blurs the Line Between Designer and Developer

Figma's OpenAI Codex Integration Blurs the Line Between Designer and Developer

A week after partnering with Anthropic's Claude Code, Figma has integrated OpenAI's Codex—signaling a rapid push to make design-to-code workflows seamless for a new generation of design engineers.

Feb 26, 2026
Anthropic's Cowork Brings Autonomous AI Task Execution to Non-Technical Users

Anthropic's Cowork Brings Autonomous AI Task Execution to Non-Technical Users

Anthropic launches Cowork, a research preview feature that lets Claude access local files and complete knowledge work tasks autonomously — a potentially significant shift for solo entrepreneurs and small teams who lack dedicated support staff.

Feb 26, 2026
Cursor Gives AI Agents Their Own Computers, Signaling a Shift in How Developers Work
AI Agents

Cursor Gives AI Agents Their Own Computers, Signaling a Shift in How Developers Work

Cursor's updated cloud agents can now operate in isolated virtual machines, test their own code, and produce video demos — with the company reporting that over 30% of its internal pull requests are now created by autonomous agents.

Feb 25, 2026
← Scroll for more →