option
Home
News
Alibaba's Open-Source Qwen AI Model Breaks Records in Reasoning

Alibaba's Open-Source Qwen AI Model Breaks Records in Reasoning

December 8, 2025
124

The Qwen team at Alibaba has unveiled a new version of their open-source reasoning AI model, showcasing remarkable benchmark results.

Introducing Qwen3-235B-A22B-Thinking-2507. For the last three months, the Qwen team has been intensively scaling up what they refer to as the model’s "thinking capability," striving to enhance both the quality and depth of its reasoning processes.

The outcome is a model that truly shines in the most demanding areas: logical reasoning, complex mathematics, scientific challenges, and advanced coding. In fields that typically demand human expertise, this latest Qwen model is now setting a new bar for open-source AI.

On reasoning benchmarks, Qwen's newest open-source AI model scores 92.3 on AIME25 and 74.1 on LiveCodeBench v6 for coding. It also performs strongly in broader capability evaluations, achieving a 79.7 on Arena-Hard v2, a metric that assesses alignment with human preferences.

Fundamentally, this is a large-scale reasoning AI model from the Qwen team, featuring a total of 235 billion parameters. However, it employs a Mixture-of-Experts (MoE) architecture, meaning only a subset of these parameters—approximately 22 billion—are active at any given time. Imagine it as a vast team of 128 specialists on standby, with only the top eight experts for a particular task actually working on it.

One of its standout attributes is its exceptional memory capacity. Qwen's open-source reasoning AI model natively supports a context length of 262,144 tokens, providing a significant advantage for tasks requiring the comprehension of extensive information.

For developers and enthusiasts, the Qwen team has streamlined the getting-started process. The model is accessible on Hugging Face and can be deployed using tools like sglang or vllm to set up a personal API endpoint. The team also highlights their Qwen-Agent framework as the optimal method for leveraging the model's tool-calling functionalities.

To achieve peak performance with this open-source AI reasoning model, the Qwen team offers several recommendations. They advise an output length of around 32,768 tokens for standard tasks, but for highly complex problems, increasing this to 81,920 tokens allows the AI sufficient space to "think." They also suggest using explicit instructions in your prompts, such as requesting a "step-by-step reasoning" approach for mathematical problems, to obtain the most precise and well-organized responses.

The launch of this new Qwen model delivers a powerful, open-source reasoning AI capable of competing with leading proprietary models, particularly in tackling intricate, intellectually demanding challenges. It will be fascinating to observe what the developer community creates with this technology.

See also: AI Action Plan: US leadership must be ‘unchallenged’

Interested in deepening your knowledge of AI and big data from industry experts? Attend the AI & Big Data Expo in Amsterdam, California, and London. This comprehensive event runs alongside other major conferences, including the Intelligent Automation Conference, BlockX, Digital Transformation Week, and the Cyber Security & Cloud Expo.

Discover more upcoming enterprise technology events and webinars powered by TechForge here.

Related article
WordPress.com now allows AI agents to write and publish posts, plus more WordPress.com now allows AI agents to write and publish posts, plus more WordPress.com, the popular web hosting and publishing platform, is now embracing AI agents—a move that could reshape the look and feel of the web. The company announced Friday that it will allow AI agents to draft, edit, and publish content on custom
Kakao Mobility outlines Level 4 autonomous driving roadmap for physical AI Kakao Mobility outlines Level 4 autonomous driving roadmap for physical AI Kakao Mobility is planning to develop Level 4 autonomous driving technologies internally as part of its physical AI strategy. At the 2026 World IT Show conference in Seoul's COEX, Kim Jin-kyu — vice president and head of Kakao Mobility's Physical AI
Barry Diller: Trust in Sam Altman irrelevant as AGI nears Barry Diller: Trust in Sam Altman irrelevant as AGI nears Barry Diller, the billionaire media titan, does not believe OpenAI CEO Sam Altman is untrustworthy, despite recent reports suggesting otherwise. Speaking at the Wall Street Journal's "Future of Everything" conference this week, Diller defended Altman
Related Special Topic Recommendations
Business Best AI Expense Trackers: Scan Receipts & Categorize Corporate Spend Automatically
Best AI Expense Trackers: Scan Receipts & Categorize Corporate Spend Automatically

2026 Latest Best AI Expense Trackers: Top-rated tools to scan receipts & categorize corporate spend automatically. Discover powerful, game-changing solutions for effortless expense management, accurate financial tracking, and streamlined compliance. Our curated, weekly-updated comparison of free vs paid options helps you find the perfect fit. Unlock your AI edge with XIX.AI's expert picks.

10 tools
xix.ai
Business Best AI Recruiting Tools: Screen Resumes & Automate Candidate Interview Scheduling
Best AI Recruiting Tools: Screen Resumes & Automate Candidate Interview Scheduling

Discover the 2026 latest top-rated AI recruiting tools on XIX.AI. Our curated list features powerful, game-changing solutions for screening resumes and automating candidate interview scheduling. Compare free vs paid options with real-world tests and weekly updated rankings. Find your perfect hiring assistant and streamline your recruitment today!

10 tools
xix.ai
Productivity AI Personal Wellness & Focus Coaches: Manage Burnout & Boost Mental Energy Levels
AI Personal Wellness & Focus Coaches: Manage Burnout & Boost Mental Energy Levels

Discover the 2026 best AI personal wellness and focus coaches on XIX.AI. Our curated rankings feature top-rated, game-changing tools to manage burnout and boost mental energy. Compare free vs paid options with real-world insights. Unlock your path to peak productivity and well-being today.

10 tools
xix.ai
chatbot Top-Rated AI Romantic Chatbots: Build Long-Term Relationships with Consistent Personalities
Top-Rated AI Romantic Chatbots: Build Long-Term Relationships with Consistent Personalities

Discover the 2026 latest top-rated AI romantic chatbots for building genuine, long-term connections. Our curated list features powerful, consistent personalities, free vs paid comparisons, and real-world tests. Find your perfect companion and start building today at XIX.AI.

10 tools
xix.ai
Education and Learning Best AI Data Science Mentors: Master SQL, Pandas & Machine Learning Workflows
Best AI Data Science Mentors: Master SQL, Pandas & Machine Learning Workflows

Discover the 2026 best AI data science mentors to master SQL, Pandas & ML workflows. Explore our top-rated, curated selection at XIX.AI for powerful, game-changing guidance. Compare free vs paid options with real-world insights. Unlock your data science mastery today.

10 tools
xix.ai
chatbot Best AI Flirting & Conversation Trainers: Improve Social Charisma and Confidence in Real-Time
Best AI Flirting & Conversation Trainers: Improve Social Charisma and Confidence in Real-Time

Discover the 2026 best AI flirting and conversation trainers on XIX.AI. Our curated, top-rated selection helps you build social charisma and confidence in real-time. Explore must-try, game-changing tools with free vs paid comparisons and weekly updated rankings. Unlock your social edge today.

10 tools
xix.ai
Comments (1)
0/500
HaroldMoore
HaroldMoore March 24, 2026 at 6:00:39 PM EDT

AlibabaのオープンソースAIがまたすごい成果を出しましたね!Qwenの推論能力、本当に進化が早い。最近は色んな企業が自社モデルを公開して競争が激しいけど、オープンソースでここまでできると、商用モデルもプレッシャー感じるんじゃないかな?個人的には、こういう技術がもっと手軽に使えるようになったら、普段の仕事の効率も上がりそうで楽しみです✨

OR