AI news

Significant Price Drop, Unlimited Chat, Coding Skills Surpassing Human Experts — Claude Opus 4.5 Regains the Top Model Crown

Honghao Wang

25 Nov 2025 — 4 min read

Claude Opus 4.5: Anthropic’s Most Powerful AI Model Yet

In the early hours of November 25, Anthropic unveiled Claude Opus 4.5, its most advanced AI model to date.

The company claims this release delivers state‑of‑the‑art performance in software engineering tasks, raising the stakes against competitors like OpenAI and Google.

---

Performance Milestones

Claude Opus 4.5 demonstrated top-tier results in Anthropic’s internal engineering evaluations — outperforming:

OpenAI GPT‑5.1‑Codex‑Max
Anthropic Sonnet 4.5
Google Gemini 3 Pro

Figure: Claude Opus 4.5’s SWE Bench benchmark results.

Key metric:

SWE‑bench Verified accuracy: 80.9%
GPT‑5.1‑Codex‑Max: 77.9%
Sonnet 4.5: 77.2%
Gemini 3 Pro: 76.2%

---

Pricing Update

Anthropic has significantly reduced pricing for Opus 4.5:

Input tokens: $5 per million
Output tokens: $25 per million

This is approximately two‑thirds lower than Claude Opus 4.1 (Input $15/million, Output $75/million).

The adjustment makes high-performance AI more accessible while increasing cost pressure on rivals.

---

Superior Judgment in Real-World Tasks

Testers report stronger judgment and intuitive reasoning across diverse use cases.

> “It’s like the model has suddenly clicked.” – Albert, Head of Developer Relations

Examples:

Delegating complete tasks—by connecting Slack and internal documentation, Opus 4.5 produces coherent, highly relevant summaries.
Enhanced cross‑software operations — e.g., creating a PowerPoint presentation from Excel data.

---

Outperforming Human Engineers

Opus 4.5 achieved record scores in Anthropic’s internal timed programming test — a two-hour evaluation originally designed for hiring engineers.

Highlights:

Parallel test-time computation:
Multiple solution attempts generated, best answer selected.
Result: higher than all human participants.
No time limit tests:
Performance matched the top human score in Claude Code’s environment.

> Note: These tests focus solely on technical and judgment skills, not teamwork or communication.

---

Efficiency Gains: 76% Token Reduction

Claude Opus 4.5 delivers superior results with fewer tokens:

Medium investment level:
Matches Sonnet 4.5’s top score while using 76% fewer output tokens.
High investment level:
+4.3 percentage point improvement over Sonnet 4.5.
Cuts token usage by 48%.

New Feature:

Investment Parameter — allows developers to control computational effort per task for an optimal balance between speed, cost, and accuracy.

---

Enterprise-Focused Features

Deep Office Integration

Excel-specific functions now available for Max, Team, and Enterprise users:
Pivot tables
Visual charts
File uploads
Chrome browser extension: available to all Max users.

Breaking Context-Length Limits

“Infinite Chat” intelligently summarizes previous conversation history, enabling near-limitless context.

Developer Capabilities

Programmatic tool invocation: Claude can write and execute code calling external functions.
Claude Code:
Upgraded plan mode
Desktop client (research preview)
Parallel AI agent sessions support

---

AI Self-Evolution & Market Pace

Rapid iteration: Opus 4.5 released just weeks after Haiku 4.5 and Sonnet 4.5.
Competing releases:
OpenAI’s autonomous Codex Max (24‑hour runs)
Google’s Gemini 3, refined over months.

Anthropic leverages Claude itself for R&D acceleration.

Price cuts aim to expand adoption among startups, despite potential margin pressures.

---

Industry Outlook

Profitability challenges: Heavy investment in compute and talent means long paths to profit.
No single dominant player yet in the trillion-dollar AI market.
For enterprises & developers: Persistent cost reduction + performance increase.
Real-world implication: AI now surpasses human capabilities in certain professional tasks.

---

Monetization Opportunity: AiToEarn

As AI technology accelerates, multi-platform publishing and monetization tools become critical.

AiToEarn is an open-source AI content monetization platform offering:

Simultaneous publishing to:
Douyin, Kwai, WeChat, Bilibili, Rednote, Facebook, Instagram, LinkedIn, Threads, YouTube, Pinterest, and X (Twitter).
Integrated tools for AI content generation, publishing, analytics, and model ranking.
Useful links:
AiToEarn博客
AiToEarn核心应用
AI模型排名

Albert notes:

> “Claude Opus 4.5 continually refines its methods, enhancing execution without parameter updates — a capability now extending beyond programming into document creation, spreadsheet work, and presentations.”

---

In summary: Claude Opus 4.5 signals an era where AI matches or outperforms humans in specific technical tasks, while reducing token costs and adding enterprise-grade integrations. For developers and creators, the combination of high efficiency and broad market reach through platforms like AiToEarn unlocks significant new opportunities.