Whisk: AI-Powered Image Remix and Visualization Tool

Today, we're excited to introduce Whisk, our latest venture into the realm of generative AI, exclusively launching in the US. Unlike traditional methods where you'd need to craft lengthy text prompts to generate images, Whisk simplifies the process by allowing you to use images as prompts. All you need to do is drag and drop your images into the tool and start creating.
With Whisk, you can input three different images: one for the subject, another for the scene, and a third for the style. This allows you to mix and match these elements to produce something entirely unique, whether it's a digital plushie, an enamel pin, or a sticker. It's all about remixing to craft something that's distinctly yours.
Whisk - fantastical fish - generated image example
Whisk - whimsical walrus - generated image example
Whisk - glazed doughnut with sprinkles - generated enamel pin example
Whisk - fantastical cat with horns - generated image example
Behind the scenes, Whisk leverages the power of the Gemini model to automatically generate detailed captions of your input images. These captions are then fed into Google’s cutting-edge image generation model, Imagen 3. This method focuses on capturing the essence of your subject rather than creating an exact replica, enabling you to remix your subjects, scenes, and styles in innovative ways.
Since Whisk focuses on extracting just a few key characteristics from your images, the resulting creations might not always match your initial expectations. For instance, the generated subject could vary in height, weight, hairstyle, or skin tone. We recognize that these details can be vital for your project, and sometimes Whisk might not hit the mark. That's why we allow you to view and tweak the underlying prompts at any time.
During our initial tests with artists and creatives, Whisk has been described as a novel type of creative tool, distinct from traditional image editors. We designed it to facilitate rapid visual exploration, not for precise, pixel-perfect edits. It's all about sparking new ideas and letting you sift through numerous options until you find the ones that resonate with you.
If you're in the US, you can dive into Whisk today at labs.google/whisk and share your thoughts with us.
Google Labs is our playground for experimenting with the latest in generative AI, including models like Gemini, Imagen, and Veo. Our aim is to gather feedback on these new products and features as we collaboratively shape the future of technology. Keep up with Whisk and other exciting experiments by subscribing to our newsletter and following Google Labs on X, Reddit, and Discord.
Related article
Kakao Mobility outlines Level 4 autonomous driving roadmap for physical AI
Kakao Mobility is planning to develop Level 4 autonomous driving technologies internally as part of its physical AI strategy.
At the 2026 World IT Show conference in Seoul's COEX, Kim Jin-kyu — vice president and head of Kakao Mobility's Physical AI
Barry Diller: Trust in Sam Altman irrelevant as AGI nears
Barry Diller, the billionaire media titan, does not believe OpenAI CEO Sam Altman is untrustworthy, despite recent reports suggesting otherwise. Speaking at the Wall Street Journal's "Future of Everything" conference this week, Diller defended Altman
YouTube expands AI deepfake detection to politicians, government officials, and journalists
On Tuesday, YouTube announced it is expanding its deepfake detection technology to a select group of government officials, political candidates, and journalists. The tool identifies AI-generated likenesses and lets pilot participants request the remo
Related Special Topic Recommendations
Comments (26)
0/500
So you're telling me I can finally stop writing those novel-length prompts? 😂 This is a game-changer for visual thinkers like me. The US-only launch is a bummer though—hope it goes global soon! The image-as-prompt approach could really shake up how we prototype ideas.
Интересный подход — использовать изображения вместо текстовых подсказок! Это кажется гораздо более интуитивным способом для творчества. Но вот вопрос: как сервис справляется с авторскими правами на исходные картинки? Тема всегда скользкая в AI-индустрии. Хотелось бы попробовать, но пока только для США... Ждём расширения доступности 🌍
Parece interesante la idea de usar imágenes como prompts en lugar de texto, pero me preocupa cómo esto podría afectar los derechos de autor 😬. ¿Podrían las imágenes generadas terminar siendo usadas sin permiso de los creadores originales? La herramienta suena divertida, pero las implicaciones legales podrían ser complicadas.
Finde die Idee echt erfrischend! 🎨 Endlich muss man nicht mehr ellenlange Textbeschreibungen verfassen, um coole Bilder zu generieren. Aber frage mich, wie gut das mit komplexen Konzepten klappt – kann ein Bild wirklich so präzise sein wie ein detaillierter Prompt? Hoffe, das Tool kommt bald auch nach Europa!
Enfin un outil qui comprend qu'on est tous fatigués des prompts interminables ! 😅 Whisk a l'air super intuitif, mais est-ce que ça marche aussi bien avec des photos mal cadrées ? Vivement la version européenne !

Today, we're excited to introduce Whisk, our latest venture into the realm of generative AI, exclusively launching in the US. Unlike traditional methods where you'd need to craft lengthy text prompts to generate images, Whisk simplifies the process by allowing you to use images as prompts. All you need to do is drag and drop your images into the tool and start creating.
With Whisk, you can input three different images: one for the subject, another for the scene, and a third for the style. This allows you to mix and match these elements to produce something entirely unique, whether it's a digital plushie, an enamel pin, or a sticker. It's all about remixing to craft something that's distinctly yours.
Whisk - whimsical walrus - generated image example
Whisk - glazed doughnut with sprinkles - generated enamel pin example
Whisk - fantastical cat with horns - generated image example
Behind the scenes, Whisk leverages the power of the Gemini model to automatically generate detailed captions of your input images. These captions are then fed into Google’s cutting-edge image generation model, Imagen 3. This method focuses on capturing the essence of your subject rather than creating an exact replica, enabling you to remix your subjects, scenes, and styles in innovative ways.
During our initial tests with artists and creatives, Whisk has been described as a novel type of creative tool, distinct from traditional image editors. We designed it to facilitate rapid visual exploration, not for precise, pixel-perfect edits. It's all about sparking new ideas and letting you sift through numerous options until you find the ones that resonate with you.
If you're in the US, you can dive into Whisk today at labs.google/whisk and share your thoughts with us.
Google Labs is our playground for experimenting with the latest in generative AI, including models like Gemini, Imagen, and Veo. Our aim is to gather feedback on these new products and features as we collaboratively shape the future of technology. Keep up with Whisk and other exciting experiments by subscribing to our newsletter and following Google Labs on X, Reddit, and Discord.
Barry Diller: Trust in Sam Altman irrelevant as AGI nears
Barry Diller, the billionaire media titan, does not believe OpenAI CEO Sam Altman is untrustworthy, despite recent reports suggesting otherwise. Speaking at the Wall Street Journal's "Future of Everything" conference this week, Diller defended Altman
YouTube expands AI deepfake detection to politicians, government officials, and journalists
On Tuesday, YouTube announced it is expanding its deepfake detection technology to a select group of government officials, political candidates, and journalists. The tool identifies AI-generated likenesses and lets pilot participants request the remo
So you're telling me I can finally stop writing those novel-length prompts? 😂 This is a game-changer for visual thinkers like me. The US-only launch is a bummer though—hope it goes global soon! The image-as-prompt approach could really shake up how we prototype ideas.
Интересный подход — использовать изображения вместо текстовых подсказок! Это кажется гораздо более интуитивным способом для творчества. Но вот вопрос: как сервис справляется с авторскими правами на исходные картинки? Тема всегда скользкая в AI-индустрии. Хотелось бы попробовать, но пока только для США... Ждём расширения доступности 🌍
Parece interesante la idea de usar imágenes como prompts en lugar de texto, pero me preocupa cómo esto podría afectar los derechos de autor 😬. ¿Podrían las imágenes generadas terminar siendo usadas sin permiso de los creadores originales? La herramienta suena divertida, pero las implicaciones legales podrían ser complicadas.
Finde die Idee echt erfrischend! 🎨 Endlich muss man nicht mehr ellenlange Textbeschreibungen verfassen, um coole Bilder zu generieren. Aber frage mich, wie gut das mit komplexen Konzepten klappt – kann ein Bild wirklich so präzise sein wie ein detaillierter Prompt? Hoffe, das Tool kommt bald auch nach Europa!
Enfin un outil qui comprend qu'on est tous fatigués des prompts interminables ! 😅 Whisk a l'air super intuitif, mais est-ce que ça marche aussi bien avec des photos mal cadrées ? Vivement la version européenne !





Home






