Anthropic Unveils Its Smartest 'Hybrid Reasoning' AI Model Yet
Anthropic has just unveiled Claude 3.7 Sonnet, marking the debut of its first "hybrid reasoning model." This groundbreaking model is designed to tackle more intricate challenges and surpasses earlier iterations when it comes to tasks like mathematics and coding.
To complement this advancement, Anthropic is also launching a "limited research preview" of Claude Code, an agentic coding tool. While Anthropic already powers AI coding solutions like Cursor, Claude Code is being marketed as an interactive partner capable of searching and reading code, modifying files, writing and running tests, pushing code to GitHub, and utilizing command-line tools.
Claude 3.7 Sonnet will be accessible starting Monday within the Claude app and via Anthropic’s API, Amazon Bedrock, and Google Cloud’s Vertex AI. Pricing remains consistent with its predecessor, 3.5 Sonnet, at $3 per million input tokens and $15 per million output tokens.
Unlike competitors such as OpenAI, which offer distinct reasoning models, Anthropic emphasizes integrating reasoning capabilities directly into the model itself. As Dianne Penn, Anthropic’s product research lead, explained to The Verge, “We fundamentally believe that reasoning is a feature of the AI rather than something entirely separate.” For instance, Claude shouldn’t struggle much with straightforward queries like “What time is it?” but excels at handling complex prompts like planning a two-week trip to Italy while factoring in weather conditions.
Anthropic
Anthropic
Penn noted that Claude 3.7 Sonnet shows marked improvement in agentic coding, finance, and legal matters. Although Claude doesn’t yet support real-time web searches—a capability present in other models—it boasts a knowledge cutoff date of October 2024, making it more current. Developers can influence how the model operates through its scratchpad feature and specify exact response times. “Sometimes,” said Anthropic’s VP of product, Michael Gerstenhaber, “the developer simply needs to indicate that it shouldn’t take longer than 200 milliseconds to answer this question,” highlighting a strategic product decision.
Internally, Anthropic staff have utilized the new model to design front-end website interfaces, create interactive games, and engage in up to 45 minutes of coding activities, such as building test sets and refining test cases iteratively, according to Penn.

Claude Code. Anthropic
Penn mentioned that Anthropic evaluates its models’ capabilities by having them navigate an old-school Pokémon video game, mapping the model’s API to a controller interface. While Claude 3.5 Sonnet struggled to leave Pallet Town initially, Claude 3.7 successfully defeated several gym leaders.
Elon Musk’s recent unveiling of Grok-3 last week underscored the rapid pace of the AI model competition. For now, Anthropic stands ahead thanks to Claude 3.7 Sonnet’s impressive performance. Its release hints at a future where a single model handles every task, rather than requiring specialized tools for different functions.
Related article
WordPress.com now allows AI agents to write and publish posts, plus more
WordPress.com, the popular web hosting and publishing platform, is now embracing AI agents—a move that could reshape the look and feel of the web. The company announced Friday that it will allow AI agents to draft, edit, and publish content on custom
Kakao Mobility outlines Level 4 autonomous driving roadmap for physical AI
Kakao Mobility is planning to develop Level 4 autonomous driving technologies internally as part of its physical AI strategy.
At the 2026 World IT Show conference in Seoul's COEX, Kim Jin-kyu — vice president and head of Kakao Mobility's Physical AI
Barry Diller: Trust in Sam Altman irrelevant as AGI nears
Barry Diller, the billionaire media titan, does not believe OpenAI CEO Sam Altman is untrustworthy, despite recent reports suggesting otherwise. Speaking at the Wall Street Journal's "Future of Everything" conference this week, Diller defended Altman
Related Special Topic Recommendations
Comments (4)
0/500
Hybrid reasoning sounds like a game-changer for coding tasks, but I'm curious about the real-world cost. The article mentions new pricing tiers—will this make AI development more accessible or just widen the gap between big labs and indie researchers? 🤔
¡Otra IA 'más inteligente'? 😅 La verdad es que estos lanzamientos ya se sienten como una rutina mensual. Me interesa eso del "razonamiento híbrido", pero me pregunto: ¿realmente resolverá problemas del mundo real de forma más confiable, o solo será mejor en benchmarks artificiales? Veremos cómo se compara en usabilidad con GPT-o.
ハイブリッド推論モデルって何?数学やコーディングが得意なのはすごいけど、AIが複雑な問題を解けるようになると、人間の仕事が奪われるんじゃないかと少し心配😅 でも技術の進歩は止められないから、うまく付き合っていくしかないですね。
Anthropic has just unveiled Claude 3.7 Sonnet, marking the debut of its first "hybrid reasoning model." This groundbreaking model is designed to tackle more intricate challenges and surpasses earlier iterations when it comes to tasks like mathematics and coding.
To complement this advancement, Anthropic is also launching a "limited research preview" of Claude Code, an agentic coding tool. While Anthropic already powers AI coding solutions like Cursor, Claude Code is being marketed as an interactive partner capable of searching and reading code, modifying files, writing and running tests, pushing code to GitHub, and utilizing command-line tools.
Claude 3.7 Sonnet will be accessible starting Monday within the Claude app and via Anthropic’s API, Amazon Bedrock, and Google Cloud’s Vertex AI. Pricing remains consistent with its predecessor, 3.5 Sonnet, at $3 per million input tokens and $15 per million output tokens.
Unlike competitors such as OpenAI, which offer distinct reasoning models, Anthropic emphasizes integrating reasoning capabilities directly into the model itself. As Dianne Penn, Anthropic’s product research lead, explained to The Verge, “We fundamentally believe that reasoning is a feature of the AI rather than something entirely separate.” For instance, Claude shouldn’t struggle much with straightforward queries like “What time is it?” but excels at handling complex prompts like planning a two-week trip to Italy while factoring in weather conditions.
Anthropic
Anthropic
Penn noted that Claude 3.7 Sonnet shows marked improvement in agentic coding, finance, and legal matters. Although Claude doesn’t yet support real-time web searches—a capability present in other models—it boasts a knowledge cutoff date of October 2024, making it more current. Developers can influence how the model operates through its scratchpad feature and specify exact response times. “Sometimes,” said Anthropic’s VP of product, Michael Gerstenhaber, “the developer simply needs to indicate that it shouldn’t take longer than 200 milliseconds to answer this question,” highlighting a strategic product decision.
Internally, Anthropic staff have utilized the new model to design front-end website interfaces, create interactive games, and engage in up to 45 minutes of coding activities, such as building test sets and refining test cases iteratively, according to Penn.

Claude Code. Anthropic
Penn mentioned that Anthropic evaluates its models’ capabilities by having them navigate an old-school Pokémon video game, mapping the model’s API to a controller interface. While Claude 3.5 Sonnet struggled to leave Pallet Town initially, Claude 3.7 successfully defeated several gym leaders.
Elon Musk’s recent unveiling of Grok-3 last week underscored the rapid pace of the AI model competition. For now, Anthropic stands ahead thanks to Claude 3.7 Sonnet’s impressive performance. Its release hints at a future where a single model handles every task, rather than requiring specialized tools for different functions.
WordPress.com now allows AI agents to write and publish posts, plus more
WordPress.com, the popular web hosting and publishing platform, is now embracing AI agents—a move that could reshape the look and feel of the web. The company announced Friday that it will allow AI agents to draft, edit, and publish content on custom
Barry Diller: Trust in Sam Altman irrelevant as AGI nears
Barry Diller, the billionaire media titan, does not believe OpenAI CEO Sam Altman is untrustworthy, despite recent reports suggesting otherwise. Speaking at the Wall Street Journal's "Future of Everything" conference this week, Diller defended Altman
Hybrid reasoning sounds like a game-changer for coding tasks, but I'm curious about the real-world cost. The article mentions new pricing tiers—will this make AI development more accessible or just widen the gap between big labs and indie researchers? 🤔
¡Otra IA 'más inteligente'? 😅 La verdad es que estos lanzamientos ya se sienten como una rutina mensual. Me interesa eso del "razonamiento híbrido", pero me pregunto: ¿realmente resolverá problemas del mundo real de forma más confiable, o solo será mejor en benchmarks artificiales? Veremos cómo se compara en usabilidad con GPT-o.
ハイブリッド推論モデルって何?数学やコーディングが得意なのはすごいけど、AIが複雑な問題を解けるようになると、人間の仕事が奪われるんじゃないかと少し心配😅 でも技術の進歩は止められないから、うまく付き合っていくしかないですね。





Home






