Xiaomi Unveils Three MiMo-V2 Large Models; Lei Jun Announces $16B Boost for AI

Xiaomi’s ambitious large-model plans were fully revealed in the spring of 2026.
On March 19, Xiaomi officially launched three self-developed large models: MiMo-V2-Pro, MiMo-V2-Omni, and MiMo-V2-TTS. This is more than a routine tech upgrade — it marks a milestone in Xiaomi’s deep commitment to the “Agent era.”
To underline its AI focus, Xiaomi’s founder stated on social media that the company’s R&D and capital investment in AI this year will exceed 16 billion yuan. He also noted that the trillion-parameter MiMo-V2-Pro ranks eighth globally in Artificial Analysis’ comprehensive intelligence benchmark and fifth in brand ranking worldwide.
Each of the three models plays a distinct role, forming a complete Agent stack:
Flagship base model MiMo-V2-Pro: Designed for high-intensity Agent tasks. With over 1 trillion (1T) total parameters, it uses a hybrid attention mechanism, balancing efficiency and capacity with 42B activated parameters. It supports an ultra-long context of 1 million Tokens, excelling in complex logical reasoning and tool calling.
Omni-modal base model MiMo-V2-Omni: Natively integrates text, vision, and audio. It spans the entire pipeline from sensory understanding to action execution, making it essential for Agents to perceive the physical world.
Speech large model MiMo-V2-TTS: Gives Agents “warm” expression abilities with fine-grained emotional control, creating more human-like machine interactions.
On the commercialization front, Xiaomi has adopted an extremely aggressive pricing strategy. Input within a 256K context costs only $1 per million Tokens, significantly undercutting competitors at the same tier. Both the Pro and Omni versions have officially opened API services.
Notably, the driving force behind this elite AI team is known as the “AI prodigy.” The mysterious model “Hunter Alpha,” which stirred the developer community, is actually the internal test version of MiMo-V2-Pro.
Related article
Conntour secures $7M from General Catalyst and YC for AI-powered security video search
The surveillance technology industry is currently under scrutiny, though not for the most favorable reasons. Controversies have flared as U.S. Immigration and Customs Enforcement reportedly accessed Flock’s camera network for surveillance, and home c
Apple's first AI hardware revealed: camera-equipped AirPods enter DVT stage
Apple's ambitions in AI hardware are becoming clearer. Well-known tech journalist Mark Gurman reports that the long-anticipated AirPods with built-in cameras have entered the critical final development stage: Design Verification Testing (DVT). This m
iOS27 to Launch Standalone Siri App With Chatbot Interface
With less than a month to go before Apple's 2026 Worldwide Developers Conference (WWDC), renowned tech journalist Mark Gurman has shared new insights into iOS 27. In the upcoming system, codenamed "Rave," Siri is making a comeback as a standalone app
Related Special Topic Recommendations
Comments (0)
0/500

Xiaomi’s ambitious large-model plans were fully revealed in the spring of 2026.
On March 19, Xiaomi officially launched three self-developed large models: MiMo-V2-Pro, MiMo-V2-Omni, and MiMo-V2-TTS. This is more than a routine tech upgrade — it marks a milestone in Xiaomi’s deep commitment to the “Agent era.”
To underline its AI focus, Xiaomi’s founder stated on social media that the company’s R&D and capital investment in AI this year will exceed 16 billion yuan. He also noted that the trillion-parameter MiMo-V2-Pro ranks eighth globally in Artificial Analysis’ comprehensive intelligence benchmark and fifth in brand ranking worldwide.
Each of the three models plays a distinct role, forming a complete Agent stack:
Flagship base model MiMo-V2-Pro: Designed for high-intensity Agent tasks. With over 1 trillion (1T) total parameters, it uses a hybrid attention mechanism, balancing efficiency and capacity with 42B activated parameters. It supports an ultra-long context of 1 million Tokens, excelling in complex logical reasoning and tool calling.
Omni-modal base model MiMo-V2-Omni: Natively integrates text, vision, and audio. It spans the entire pipeline from sensory understanding to action execution, making it essential for Agents to perceive the physical world.
Speech large model MiMo-V2-TTS: Gives Agents “warm” expression abilities with fine-grained emotional control, creating more human-like machine interactions.
On the commercialization front, Xiaomi has adopted an extremely aggressive pricing strategy. Input within a 256K context costs only $1 per million Tokens, significantly undercutting competitors at the same tier. Both the Pro and Omni versions have officially opened API services.
Notably, the driving force behind this elite AI team is known as the “AI prodigy.” The mysterious model “Hunter Alpha,” which stirred the developer community, is actually the internal test version of MiMo-V2-Pro.
Conntour secures $7M from General Catalyst and YC for AI-powered security video search
The surveillance technology industry is currently under scrutiny, though not for the most favorable reasons. Controversies have flared as U.S. Immigration and Customs Enforcement reportedly accessed Flock’s camera network for surveillance, and home c
Apple's first AI hardware revealed: camera-equipped AirPods enter DVT stage
Apple's ambitions in AI hardware are becoming clearer. Well-known tech journalist Mark Gurman reports that the long-anticipated AirPods with built-in cameras have entered the critical final development stage: Design Verification Testing (DVT). This m
iOS27 to Launch Standalone Siri App With Chatbot Interface
With less than a month to go before Apple's 2026 Worldwide Developers Conference (WWDC), renowned tech journalist Mark Gurman has shared new insights into iOS 27. In the upcoming system, codenamed "Rave," Siri is making a comeback as a standalone app





Home






