Alibaba Unveils Wan2.1-VACE: Open-Source AI Video Solution
Alibaba has introduced Wan2.1-VACE, an open-source AI model poised to transform video creation and editing processes.
VACE is a key component of Alibaba’s Wan2.1 video AI model family, with the company claiming it’s the “first open-source model in the industry to deliver a comprehensive solution for diverse video generation and editing tasks.”
If Alibaba can streamline the video production process, consolidating multiple tools into a single platform, it could redefine industry standards.
What can VACE do? It generates videos from various inputs, such as text prompts, static images, or short video clips.
Beyond video creation, its editing capabilities include using reference images or frames to guide the AI, advanced video “repainting” features, modifying specific video sections, and extending video duration. Alibaba states these tools “empower users to combine tasks flexibly, boosting creative potential.”

Picture creating a video featuring specific characters based on photos you provide. VACE can reportedly make it happen. Have a static image you want animated? This open-source AI model can add lifelike motion to it.
For precision editing, VACE offers “video repainting” tools, enabling pose transfers between subjects, detailed motion control, depth adjustments, and color modifications.
A standout feature is its ability to “add, modify, or remove specific video areas without impacting the surroundings.” This is a game-changer for precise edits, ensuring backgrounds remain untouched. It can also expand the video canvas, filling new areas with contextually relevant content for a richer, more immersive result.
With VACE, you can transform a static photo into a video, dictate object movements by defining paths, swap characters or objects using references, animate those references, or precisely control their poses.
Alibaba highlights VACE’s ability to convert a tall, narrow image into a widescreen video, intelligently expanding it by incorporating additional elements from reference images or prompts.
VACE’s capabilities are powered by sophisticated technology designed to tackle the complexities of video editing. The Video Condition Unit (VCU) “enables unified processing of multimodal inputs like text, images, videos, and masks.”
Additionally, the “Context Adapter structure” integrates “formalized representations of temporal and spatial dimensions,” giving the AI a deep understanding of time and space within videos.
Alibaba envisions VACE excelling in applications like social media content creation, dynamic advertising, professional post-production for film and TV, and customized educational or training videos.
Alibaba Shares Wan2.1-VACE as Open-Source to Empower Creators
Developing advanced AI models typically demands significant resources, including vast computational power and data. Alibaba’s decision to open-source Wan2.1-VACE is a significant move.
“Open access reduces barriers, allowing more businesses to harness AI for creating tailored, high-quality visual content efficiently and affordably,” Alibaba notes.
This move aims to empower smaller businesses and individual creators by providing access to cutting-edge AI tools without high costs, fostering broader innovation.
Alibaba offers two versions: a robust 14-billion parameter model for high-performance systems and a lighter 1.3-billion parameter model for less demanding setups. Both are available for free on Hugging Face, GitHub, and Alibaba Cloud’s ModelScope community.
See also: US Tightens AI Diffusion Rules, Strengthens Chip Export Restrictions
Discover more about AI and big data from industry experts at the AI & Big Data Expo in Amsterdam, California, and London. This event is co-located with the Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.
Explore upcoming enterprise technology events and webinars hosted by TechForge here.
Related article
IBM Power11 Boosts Enterprise AI with Uninterrupted Performance
IBM’s Power11 enterprise servers tackle a key issue in enterprise computing: deploying AI workloads while maintaining the robust reliability required for mission-critical applications. Launched on Jul
AI-Powered Retail Experiment Fails Spectacularly at Anthropic
Imagine handing over a small shop to an artificial intelligence, entrusting it with everything from pricing to customer interactions. What could go wrong?A recent Anthropic study, released on Friday,
Unleash Your Artistic Potential with Advanced Generative Media Tools
We’re thrilled to unveil our latest generative media models, delivering groundbreaking advancements. These models produce stunning images, videos, and music, enabling artists to transform their creati
Comments (0)
0/200
Alibaba has introduced Wan2.1-VACE, an open-source AI model poised to transform video creation and editing processes.
VACE is a key component of Alibaba’s Wan2.1 video AI model family, with the company claiming it’s the “first open-source model in the industry to deliver a comprehensive solution for diverse video generation and editing tasks.”
If Alibaba can streamline the video production process, consolidating multiple tools into a single platform, it could redefine industry standards.
What can VACE do? It generates videos from various inputs, such as text prompts, static images, or short video clips.
Beyond video creation, its editing capabilities include using reference images or frames to guide the AI, advanced video “repainting” features, modifying specific video sections, and extending video duration. Alibaba states these tools “empower users to combine tasks flexibly, boosting creative potential.”

Picture creating a video featuring specific characters based on photos you provide. VACE can reportedly make it happen. Have a static image you want animated? This open-source AI model can add lifelike motion to it.
For precision editing, VACE offers “video repainting” tools, enabling pose transfers between subjects, detailed motion control, depth adjustments, and color modifications.
A standout feature is its ability to “add, modify, or remove specific video areas without impacting the surroundings.” This is a game-changer for precise edits, ensuring backgrounds remain untouched. It can also expand the video canvas, filling new areas with contextually relevant content for a richer, more immersive result.
With VACE, you can transform a static photo into a video, dictate object movements by defining paths, swap characters or objects using references, animate those references, or precisely control their poses.
Alibaba highlights VACE’s ability to convert a tall, narrow image into a widescreen video, intelligently expanding it by incorporating additional elements from reference images or prompts.
VACE’s capabilities are powered by sophisticated technology designed to tackle the complexities of video editing. The Video Condition Unit (VCU) “enables unified processing of multimodal inputs like text, images, videos, and masks.”
Additionally, the “Context Adapter structure” integrates “formalized representations of temporal and spatial dimensions,” giving the AI a deep understanding of time and space within videos.
Alibaba envisions VACE excelling in applications like social media content creation, dynamic advertising, professional post-production for film and TV, and customized educational or training videos.
Alibaba Shares Wan2.1-VACE as Open-Source to Empower Creators
Developing advanced AI models typically demands significant resources, including vast computational power and data. Alibaba’s decision to open-source Wan2.1-VACE is a significant move.
“Open access reduces barriers, allowing more businesses to harness AI for creating tailored, high-quality visual content efficiently and affordably,” Alibaba notes.
This move aims to empower smaller businesses and individual creators by providing access to cutting-edge AI tools without high costs, fostering broader innovation.
Alibaba offers two versions: a robust 14-billion parameter model for high-performance systems and a lighter 1.3-billion parameter model for less demanding setups. Both are available for free on Hugging Face, GitHub, and Alibaba Cloud’s ModelScope community.
See also: US Tightens AI Diffusion Rules, Strengthens Chip Export Restrictions
Discover more about AI and big data from industry experts at the AI & Big Data Expo in Amsterdam, California, and London. This event is co-located with the Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.
Explore upcoming enterprise technology events and webinars hosted by TechForge here.












