Ant Group open-sourced its multimodal AI model Ming-Flash-Om - xix.ai

Tools

Models

Large Language Model

Multimodal Model

Prompts

News

Submit for inclusion

English 日本語 한국어 Português español Deutsch Русский Français 繁體中文简体中文

Sign In Sign up

Tools

Models

Large Language Model

Multimodal Model

Prompts

News

Submit for inclusion

Create an account Sign In

English

Settings

English EN 日本語 JA 한국어 KO Português PT español ES Deutsch DE Русский RU Français FR 繁體中文 ZH-TW 简体中文 ZH-CN

Home

Flash News

Content

MarkSanchez

MarkSanchez

February 11, 2026

Ant Group open-sourced its multimodal AI model Ming-Flash-Omni 2.0. It reportedly surpasses models like Gemini 2.5 Pro in some benchmarks for vision-language understanding, image editing, and audio generation. A key feature is its unified audio generation, producing speech, sound effects, and music on one track from natural language prompts. The model is built on the MoE-based Ling 2.0 architecture and designed as a reusable base for developers to simplify multimodal app development.

Ant Group open-sourced its multimodal AI model Ming-Flash-Omni 2.0. It reportedly surpasses models like Gemini 2.5 Pro in some benchmarks for vision-language understanding, image editing, and audio generation. A key feature is its unified audio generation, producing speech, sound effects, and music on one track from natural language prompts. The model is built on the MoE-based Ling 2.0 architecture and designed as a reusable base for developers to simplify multimodal app development.

Share Click to copy the link

Comments (0)

0/300

Submit

Author BillyMartinez

BillyMartinez June 8, 2026

Alibaba restructured its AI business, merging the Tongyi Large Model unit and Future Life Lab into the new Token Foundry unit under CEO Wu Yongming. Zhou Jingren was named Chief Scientist to lead the AI Future Research Institute. The Qwen-3.7 model ranks top three globally in coding. Alibaba's AI business has entered a commercialization return phase.

/live/5034

Author BruceSmith

BruceSmith June 8, 2026

Tencent Hy and leading research institutions launched MMAE, the first large-scale benchmark for instruction-driven audio editing. Tests show current AI models achieve an Exact Match Rate below 5%, highlighting major deficiencies in precise audio modification. MMAE uses 2000 real-world samples and 17,741 metrics to evaluate editing accuracy across sound, music, and speech.

Author EricYoung

EricYoung June 8, 2026

Shengshu Technology and Huace Film have formed a strategic partnership to launch an AI Audiovisual Creation Center, powered by Shengshu's Vidu video generation model. They aim to integrate AI virtual production with traditional filming, develop AI film and TV curricula, and deploy on-site shooting plus AI solutions to boost efficiency and reduce costs, marking a step toward intelligent, industrialized film production.

Author MarkSanchez

MarkSanchez June 8, 2026

KuaiKan Comics announced Livo, an AI digital life product led by founder Chen Anni, aiming to transform 13000 comic IPs into a self-sustaining digital world using AIGC. Now in demo testing, Livo features perception, real-time interaction, and an emotional response mechanism, shifting from chapter payments to experience and relationship payments to boost ARPPU.

Author RogerMartinez

RogerMartinez June 8, 2026

Xiaomi unveiled its humanoid robot at the 17T series launch, demonstrating autonomous phone grasping and volume-key zoom control. The robot stems from R&D since 2017, with models Tie Dan and Tie Da, and a robot company founded in 2023. In March, it achieved 3-hour continuous operation in an auto factory with 90.2% success rate. Upgraded CyberOne bionic hand reduced volume by 60% and boosted dexterity. This signals Xiaomi’s embodied intelligence accelerating from industrial to daily service scenarios, potentially speeding up robot industrialization.

/live/5030

Sign In Sign Up

OR

Email

Password

Remember me Forgot password

Sign In

Email

Password

Please enter the graphic verification code

Sign Up