Significant Price Drop, Unlimited Chat, Coding Skills Surpassing Human Experts — Claude Opus 4.5 Regains the Top Model Crown

Significant Price Drop, Unlimited Chat, Coding Skills Surpassing Human Experts — Claude Opus 4.5 Regains the Top Model Crown

Claude Opus 4.5: Anthropic’s Most Powerful AI Model Yet

image
image

In the early hours of November 25, Anthropic unveiled Claude Opus 4.5, its most advanced AI model to date.

The company claims this release delivers state‑of‑the‑art performance in software engineering tasks, raising the stakes against competitors like OpenAI and Google.

---

Performance Milestones

Claude Opus 4.5 demonstrated top-tier results in Anthropic’s internal engineering evaluations — outperforming:

  • OpenAI GPT‑5.1‑Codex‑Max
  • Anthropic Sonnet 4.5
  • Google Gemini 3 Pro
image

Figure: Claude Opus 4.5’s SWE Bench benchmark results.

Key metric:

  • SWE‑bench Verified accuracy: 80.9%
  • GPT‑5.1‑Codex‑Max: 77.9%
  • Sonnet 4.5: 77.2%
  • Gemini 3 Pro: 76.2%

---

Pricing Update

Anthropic has significantly reduced pricing for Opus 4.5:

  • Input tokens: $5 per million
  • Output tokens: $25 per million

This is approximately two‑thirds lower than Claude Opus 4.1 (Input $15/million, Output $75/million).

The adjustment makes high-performance AI more accessible while increasing cost pressure on rivals.

---

Superior Judgment in Real-World Tasks

Testers report stronger judgment and intuitive reasoning across diverse use cases.

> “It’s like the model has suddenly clicked.” – Albert, Head of Developer Relations

Examples:

  • Delegating complete tasks—by connecting Slack and internal documentation, Opus 4.5 produces coherent, highly relevant summaries.
  • Enhanced cross‑software operations — e.g., creating a PowerPoint presentation from Excel data.
image

---

Outperforming Human Engineers

Opus 4.5 achieved record scores in Anthropic’s internal timed programming test — a two-hour evaluation originally designed for hiring engineers.

Highlights:

  • Parallel test-time computation:
  • Multiple solution attempts generated, best answer selected.
  • Result: higher than all human participants.
  • No time limit tests:
  • Performance matched the top human score in Claude Code’s environment.

> Note: These tests focus solely on technical and judgment skills, not teamwork or communication.

---

Efficiency Gains: 76% Token Reduction

Claude Opus 4.5 delivers superior results with fewer tokens:

  • Medium investment level:
  • Matches Sonnet 4.5’s top score while using 76% fewer output tokens.
  • High investment level:
  • +4.3 percentage point improvement over Sonnet 4.5.
  • Cuts token usage by 48%.

New Feature:

  • Investment Parameter — allows developers to control computational effort per task for an optimal balance between speed, cost, and accuracy.

---

Enterprise-Focused Features

Deep Office Integration

  • Excel-specific functions now available for Max, Team, and Enterprise users:
  • Pivot tables
  • Visual charts
  • File uploads
  • Chrome browser extension: available to all Max users.

Breaking Context-Length Limits

  • “Infinite Chat” intelligently summarizes previous conversation history, enabling near-limitless context.

Developer Capabilities

  • Programmatic tool invocation: Claude can write and execute code calling external functions.
  • Claude Code:
  • Upgraded plan mode
  • Desktop client (research preview)
  • Parallel AI agent sessions support

---

AI Self-Evolution & Market Pace

  • Rapid iteration: Opus 4.5 released just weeks after Haiku 4.5 and Sonnet 4.5.
  • Competing releases:
  • OpenAI’s autonomous Codex Max (24‑hour runs)
  • Google’s Gemini 3, refined over months.

Anthropic leverages Claude itself for R&D acceleration.

Price cuts aim to expand adoption among startups, despite potential margin pressures.

---

Industry Outlook

  • Profitability challenges: Heavy investment in compute and talent means long paths to profit.
  • No single dominant player yet in the trillion-dollar AI market.
  • For enterprises & developers: Persistent cost reduction + performance increase.
  • Real-world implication: AI now surpasses human capabilities in certain professional tasks.

---

Monetization Opportunity: AiToEarn

As AI technology accelerates, multi-platform publishing and monetization tools become critical.

AiToEarn is an open-source AI content monetization platform offering:

  • Simultaneous publishing to:
  • Douyin, Kwai, WeChat, Bilibili, Rednote, Facebook, Instagram, LinkedIn, Threads, YouTube, Pinterest, and X (Twitter).
  • Integrated tools for AI content generation, publishing, analytics, and model ranking.
  • Useful links:
  • AiToEarn博客
  • AiToEarn核心应用
  • AI模型排名
image

Albert notes:

> “Claude Opus 4.5 continually refines its methods, enhancing execution without parameter updates — a capability now extending beyond programming into document creation, spreadsheet work, and presentations.”

---

In summary: Claude Opus 4.5 signals an era where AI matches or outperforms humans in specific technical tasks, while reducing token costs and adding enterprise-grade integrations. For developers and creators, the combination of high efficiency and broad market reach through platforms like AiToEarn unlocks significant new opportunities.

Read more

Translate the following blog post title into English, concise and natural. Return plain text only without quotes. 哈佛大学 R 编程课程介绍

Harvard CS50: Introduction to Programming with R Harvard University offers exceptional beginner-friendly computer science courses. We’re excited to announce the release of Harvard CS50’s Introduction to Programming in R, a powerful language widely used for statistical computing, data science, and graphics. This course was developed by Carter Zenke.