AI company Anthropic releases Claude 3.7 Sonnet, its first “hybrid reasoning model” that can solve complex problems. The new model outperforms previous Claude models in areas like math and coding.
Along with Claude 3.7, Anthropic is also releasing a “limited research preview” of its “agentic” coding tool called Claude Code. Claude Code has been pitched as “an active collaborator that can search and read code, edit files, write and run tests, commit and push code to GitHub, and use command line tools,” setting it apart from other AI coding tools like Cursor, which are also coincidentally powered by Anthropic.
READ: China disrupts AI market with DeepSeek: A better, cheaper version of ChatGPT? (January 27, 2025)
Claude 3.7 Sonnet is available starting Monday in the Claude app and for developers through Anthropic’s API, Amazon Bedrock, and Google Cloud’s Vertex AI. The latest version costs the same as its predecessor, Claude 3.5 Sonnet, at $3 per million input tokens and $15 per million output tokens.
In a statement, Anthropic Product Research lead Diane Penn said “We fundamentally believe that reasoning is a feature of the AI rather than a completely separate thing,” noting that Claude shouldn’t take long to answer the question “What time is it?” versus responding to a more complex prompt like, “plan a two-week trip to Italy while considering the weather in late March.”
Penn also said that Claude 3.7 Sonnet performs noticeably better on “agentic coding,” finance, and legal tasks. While Claude still lacks real-time web search like other models, version 3.7’s knowledge cut-off date of October 2024 is more up to date.
READ: Winford Wealth to uplift financial advisors with AI-driven solutions (January 9, 2025)
Anthropic’s “scratchpad” also allows developers to steer how the model thinks, and even decide how much time it should take to respond.
Inside Anthropic, employees have used the new model to build front-end website designs, interactive games, and even spend up to 45 minutes on coding work by “building test sets and editing test cases back and forth iteratively,” according to Penn.

