Xiaomi MiMo-V2.5 Series API Gets Permanent Price Cut, Up to 99% Off

Home

News

June 11, 2026

GeorgeSmith

Amid the escalating AI model price wars, Xiaomi officially announced on May 27 that its MiMo large model would permanently reduce prices for the MiMo-V2.5 series API while simultaneously optimizing the billing system to further lower developers' calling costs through technological advancements.

MiMo price reduction announcement

I. Significant API Price Cuts — Up to 99% Off

The price change took effect globally at 00:00 Beijing Time on May 27. It applies to the two core versions, MiMo-V2.5 and MiMo-V2.5Pro, and no longer differentiates based on context window length, simplifying the pricing structure for greater transparency.

Model VersionInput Cache Hit PriceMaximum DiscountOutput PriceMaximum DiscountMiMo-V2.5Pro0.025 yuan per million tokens, up to 99% off; output: 6 yuan per million tokens, up to 86% offMiMo-V2.50.02 yuan per million tokens, up to 98% off; output: 2 yuan per million tokens, up to 93% off

II. Billing System Upgrade — More Value at No Extra Cost

Beyond the direct API price cuts, Xiaomi has heavily optimized its Token Plan billing system:

Quadrupled Quota: Under the original pricing, the actual token usage quota has increased to 5 to 8 times the previous amount.

Simplified Rules: The introduction of Credits replaces the previous complex billing methods, making token consumption and cost calculation more intuitive for developers.

Billing system upgrade diagram

III. Technical Foundation — How Can It Keep Lowering Prices?

Xiaomi's official statement attributes these deep price cuts to technical breakthroughs in its underlying inference system architecture:

SWA Inference Optimization: By leveraging SGLang HiCache with full support for SWA (Sliding Window Attention Mechanism), the data transfer among GPU memory, CPU memory, and SSD has been reduced to one-seventh of the previous volume.

Improved Cache Efficiency: The number of cacheable tokens has increased nearly fivefold compared to the earlier optimized version, boosting cache hit rates and dramatically lowering per-inference cost.

Cluster Throughput Optimization: With the introduction of expert parallel (MoE) and input length bucketing strategies, the cluster's input throughput has seen a qualitative leap, maintaining high service quality while steadily reducing cost per token.

Xiaomi's move is seen as a proactive response to the current intense competition in large model commercialization. As price barriers continue to drop, the MiMo series' cost-effectiveness will become even more pronounced, accelerating the deep integration of AI capabilities across vertical industries and developer workflows.

AI Glasses Supply Chain Pursues Light and Chips as Horizon Technology Invests Heavily Ahead of iPhone Era By the second quarter of 2026, the AI glasses market is heating up rapidly, with the industry shifting from the early "hundred-glasses race" toward a more refined and specialized phase. Google announced its first AI glasses launching this fall, and m

Mind Robotics, a Rivian spin-out, lands $500M for industrial AI robots Mind Robotics, an industrial robotics lab that emerged from electric vehicle maker Rivian, has secured $500 million in a Series A funding round co-led by venture capital firms Accel and Andreessen Horowitz.Announced Wednesday, the financing follows a

Amazon's Generative AI Assistant Alexa+ Launches in Germany, Prime Members Get Free Perks On May 8, Amazon officially launched its next-generation generative AI assistant, Alexa+, in Germany, marking another key step in the company's global AI strategy. The service had already been rolled out in several countries and regions, including th

Related Special Topic Recommendations

Academic Research

Best AI PDF Chatbots for Researchers: Extract Methodology & Data from Journals Fast

Discover the 2026 latest top-rated AI PDF chatbots for researchers. XIX.AI's curated list features powerful, game-changing tools to extract methodology and data from journals fast. Compare free vs paid options with real-world tests. Unlock your research edge today.

10 tools

xix.ai

Productivity

Best AI Data Cleaning Tools for Excel: Automate Spreadsheet Workflows & Regex Tasks

Discover the 2026 best AI data cleaning tools for Excel on XIX.AI. Our curated, top-rated list features powerful solutions to automate spreadsheet workflows and complex regex tasks. Compare free vs paid options with real-world tests. Unlock your data productivity edge today.

9 tools

xix.ai

chatbot

Guide to Ethical AI Companionship: Understanding Data Handling and Safety Standards in 2026

Discover the 2026 guide to ethical AI companionship. Explore our curated, top-rated analysis of data handling and safety standards. Compare free vs paid options and understand real-world privacy protections. Find your trusted, secure companion with XIX.AI's expert rankings.

10 tools

xix.ai

Business

Best AI Competitor Analysis Tools: Reverse-Engineer Rival Marketing Strategies

Discover the 2026 best AI competitor analysis tools on XIX.AI. Our top-rated, curated list helps you reverse-engineer rival strategies with powerful insights. Compare free vs paid options using real-world tests and weekly updated rankings. Unlock your competitive edge today.

10 tools

xix.ai

Business

Free & Paid AI Business Name Generator Tools Compared

2026 Latest Comparison: Discover the best free and paid AI business name generator tools. Our curated list features top-rated, powerful options to spark creativity and find your perfect brand identity. Get weekly updated rankings, real-world tests, and clear value comparisons. Unlock your naming edge with XIX.AI's expert picks. Find your ideal tool today!

10 tools

xix.ai

chatbot

Best AI Knowledge Base Bots: Query Your Private Notion & Obsidian Databases via Chat

Discover the 2026 best AI knowledge base bots for effortless chat-based querying of your private Notion and Obsidian databases. Explore our top-rated, curated list with weekly updated rankings, free vs paid comparisons, and real-world tests. Find your perfect tool to unlock instant insights and boost productivity. Start exploring now on XIX.AI.

10 tools

xix.ai

Comments (0)

0/500

Please login first