9:34:22 PM
theverge.com17 days ago

Anthropic Unveils Claude 3.7 Sonnet: A Smarter, 'Hybrid Reasoning' AI Model

Anthropic's latest AI model, Claude 3.7 Sonnet, introduces 'hybrid reasoning,' outperforming its predecessors in complex problem-solving, coding, and more. Alongside, they're testing Claude Code, an 'agentic' coding tool, signaling a move towards AI models that can handle a broader range of tasks efficiently.

Anthropic has just released Claude 3.7 Sonnet, marking its debut into "hybrid reasoning models," capable of tackling more intricate problems with enhanced proficiency in mathematics and coding. This model is not only a step up in AI capabilities but also comes with the unveiling of Claude Code, an "agentic" coding tool currently in limited research preview.

Claude Code: An Active AI Collaborator

While Anthropic already has a presence in AI-assisted coding through tools like Cursor, Claude Code aims to redefine AI's role in software development. It's designed as:

  • An active collaborator
  • Capable of searching and reading code
  • Editing files
  • Writing and running tests
  • Committing and pushing code to GitHub
  • Utilizing command line tools

Accessibility and Cost

Starting this week, Claude 3.7 Sonnet is accessible through the Claude app and for developers via Anthropic's API, Amazon Bedrock, and Google Cloud's Vertix AI. Remarkably, it maintains the same pricing as its predecessor, Claude 3.5 Sonnet:

  • $3 per million input tokens
  • $15 per million output tokens.

Hybrid Reasoning: A Simplified AI Experience

"We fundamentally believe that reasoning is a feature of the AI rather than a completely separate thing,"

says Dianne Penn, Anthropic product research lead.

This approach aims to streamline the user experience, embedding reasoning capabilities directly into the AI model rather than offering it as a separate component.

Performance and Capabilities

Claude 3.7 Sonnet demonstrates significant improvements in:

  • Agentic coding
  • Finance
  • Legal tasks

Its knowledge cut-off date extends to October 2024, offering more up-to-date information compared to previous versions. Moreover, developers can now influence the model's thought process through a scratchpad feature and even control response times.

"Sometimes the developer just needs to say it shouldn’t take more than 200 milliseconds to answer this question. And that’s a product decision.”

notes Anthropic’s VP of product, Michael Gerstenhaber.

Real-World Applications

Internally, Anthropic's team has leveraged the new model to:

  • Design front-end websites
  • Develop interactive games
  • Engage in extensive coding tasks, including building and editing test cases

The model's ability to advance through a Pokémon video game, surpassing its predecessor's limitations, further illustrates its enhanced capabilities.

The Future of AI: One Model to Do It All

As the AI landscape evolves rapidly, as highlighted by Elon Musk's Grok-3, Anthropic's Claude 3.7 Sonnet suggests a move towards comprehensive AI models. Rather than segregating functionalities, the industry may be heading towards a future where a single AI model can handle a wide array of tasks, streamlining efficiency and user experience.