Anthropic Unveils Claude Opus 4.1: Stronger Coding and Agentic Reasoning Ahead of GPT‑5 Clash
On August 5, 2025, Anthropic released Claude Opus 4.1, a significant upgrade to its flagship AI model series, focusing on enhanced performance in coding, agentic tasks, and reasoning. The release comes just three months after the launch of Claude Opus 4 and positions Anthropic as a strong competitor amid rising industry pressure, including OpenAI’s recent release of open-source inference models and anticipated GPT-5 launch. Claude Opus 4.1 is now available to paid Claude users, in Claude Code, and via API, Amazon Bedrock, and Google Cloud’s Vertex AI. Pricing remains unchanged: $15 per million input tokens and $75 per million output tokens, maintaining its status as one of the most expensive AI models on the market. The most notable improvements are in real-world programming capabilities. On the SWE-bench Verified benchmark, Opus 4.1 achieved 74.5%, up from 72.5% on Opus 4, surpassing OpenAI’s o3 (69.1%) and Google’s Gemini 2.5 Pro (67.2%). On Terminal-Bench, it scored 43.3%—a substantial leap from Opus 4’s 39.2%, far outpacing competitors. GitHub reported significant gains in multi-file code refactoring, while Rakuten Group praised its precision in identifying exact fixes within large codebases without introducing new bugs. Windsurf, acquired by Cognition, noted a one-standard-deviation improvement over Opus 4—comparable to the leap from Sonnet 3.7 to Sonnet 4—making it a top performer for junior developer tasks. Security remains a priority. Opus 4.1 operates under Anthropic’s strictest safety standard, ASL-3. It rejected policy-violating requests 98.76% of the time, up from 97.27% on Opus 4. However, in broader reasoning benchmarks, gains were less pronounced. On GPQA Diamond, it scored 80.9%—matching Opus 4 but trailing Gemini 2.5 Pro (86.4%) and OpenAI o3 (83.3%). Performance on AIME and MMMU tests was competitive but not dominant, suggesting the update is strategically focused on coding and agentic workflows. This targeted upgrade reflects a shift in Anthropic’s product strategy. Rather than waiting for major generational leaps, the company is now prioritizing rapid, incremental improvements to maintain momentum. This approach is driven by strong commercial success: Anthropic’s annual recurring revenue (ARR) has surged from $1 billion to nearly $5 billion in seven months, with AI programming being the core growth engine. Claude Code, Anthropic’s developer-focused product, has become a key revenue driver, generating nearly $400 million annually and doubling in recent weeks. These results have bolstered confidence in a massive new funding round led by Iconiq Capital, aiming to raise $5 billion and push Anthropic’s valuation to $170 billion—nearly triple its $61.5 billion valuation in March. Anthropic has signaled that even larger model improvements are coming in the weeks ahead, framing the Opus 4.1 release as a stepping stone toward more transformative updates. The move underscores a strategic effort to stay ahead in the fast-evolving AI race, particularly as OpenAI prepares to launch GPT-5. In summary, Claude Opus 4.1 is not a revolution but a powerful evolution—tightening Anthropic’s lead in AI-powered software development, strengthening its commercial foundation, and setting the stage for a high-stakes showdown with OpenAI and other industry giants.