option
Home
List of Al models
Qwen1.5-14B-Chat

Qwen1.5-14B-Chat

Add comparison
Add comparison
Model parameter quantity
14B
Model parameter quantity
Affiliated organization
Alibaba
Affiliated organization
Open Source
License Type
Release time
February 4, 2024
Release time

Model Introduction
Qwen1.5 is the beta version of Qwen2, maintaining its architecture as a decoder-only transformer model with SwiGLU activation, RoPE, and multi-head attention mechanisms. It offers nine model sizes and has enhanced multilingual and chat model capabilities, supporting a context length of 32,768 tokens. All models have enabled system prompts for roleplaying, and the code supports native implementation in transformers.
Swipe left and right to view more
Language comprehension ability Language comprehension ability
Language comprehension ability
Often makes semantic misjudgments, leading to obvious logical disconnects in responses.
5.7
Knowledge coverage scope Knowledge coverage scope
Knowledge coverage scope
Has significant knowledge blind spots, often showing factual errors and repeating outdated information.
5.8
Reasoning ability Reasoning ability
Reasoning ability
Unable to maintain coherent reasoning chains, often causing inverted causality or miscalculations.
3.8
Related model
Qwen3-Next-80B-A3B-Thinking The latest released Qwen3-Next series in Qwen models, improving scaling efficiency through innovative model architecture.
Qwen3-235B-A22B-Thinking-2507 Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.
Qwen3-Max-2026-01-23 The flagship reasoning model newly released by Qwen, introduces two innovations: adaptive tool calling and test-time scaling.
Qwen3-Next-80B-A3B-Thinking The latest released Qwen3-Next series in Qwen models, improving scaling efficiency through innovative model architecture.
Qwen3-235B-A22B-Thinking-2507 Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.
Relevant documents
Reliance unveils $110B AI investment plan as India accelerates tech drive Mukesh Ambani, the billionaire chairman of India's Reliance conglomerate, announced on Thursday a ₹10 trillion (roughly $110 billion) plan to build AI computing infrastructure across India over the next seven years.Speaking at the India AI Impact Sum
Zhiyuan WITA Ends 'Naked' Robot Interaction with First Compliance Filing The embodied intelligence sector has reached a significant milestone. According to the latest announcement from the Shanghai Cyberspace Administration, the WITA large model developed by Zhiyuan has successfully completed the filing process, becoming
Anthropic Study Links Polished AI Content to Reduced Human Thinking When you see AI instantly produce a well-structured, logically clear piece of code or document, are you tempted to trust it without a second thought? According to AIbase, the leading AI company Anthropic recently published a research report titled "A
UK Government Departments Clash Over Energy Needs for AI Data Centers The UK government is grappling with a major challenge: advancing clean energy while aiming to become a global leader in artificial intelligence. Yet serious inconsistencies appear between the departments responsible for these goals. The Department fo
Cyberspace Administration of China mandates tagging of AI-generated and fictional short videos The Cyberspace Administration of China has rolled out a comprehensive plan to standardize short video content labeling, mandating that platforms offer six required tags—including "AI-generated content"—ushering in a new era of mandatory transparency
Model comparison
Start the comparison
OR