Google Whisk
imageby google
An experimental image remixing tool from Google Labs that lets you use images as prompts instead of writing text descriptions.
Key features
Free tier available, AI Pro at $19.99/mo
Creative professionals exploring visual ideas rapidly by combining reference images instead of writing text prompts
Free tier with 50 daily AI credits lets you experiment without any commitment or credit card.
What it does
Image to Image Remixing
Three input slots (subject, scene, style) accept images instead of text prompts. Drag in photos, screenshots, or AI generated images and Whisk blends them into something new.
Learn moreWhisk Animate
Turn any static image into an 8 second animated video clip powered by Veo 3. Bring subjects to life with motion, camera movement, and ambient effects.
Learn moreEditable AI Prompts
Gemini generates text captions from your input images automatically. You can view, edit, and refine these generated prompts before Imagen 3 produces the final output, giving you precise control.
Quick Picks and Preset Styles
Browse a curated library of preset styles (watercolor, pixel art, claymation, sticker, and more) to instantly apply a visual aesthetic without providing your own style image.
Iterative Refinement
Not satisfied with the first result? Regenerate, swap input images, adjust the generated prompts, or remix further. Each generation costs AI credits, allowing rapid creative exploration.
Multi Language Support
Available in 37 languages, making image remixing accessible to a global audience without requiring English proficiency.
SynthID Watermarking
All generated images are invisibly watermarked with Google's SynthID technology, embedding provenance metadata that identifies AI generated content.
Content Safety Protections
Built in safety filters prevent the generation of harmful, misleading, or policy violating content. Input images and generated outputs are screened automatically.
Pricing
Free
Try Whisk with 50 daily AI credits shared with Flow. No subscription required. Daily generation caps apply.
- 50 daily AI credits (shared with Flow)
- Image to image remixing (3 input slots)
- Editable AI prompts
- Quick picks and preset styles
- SynthID watermarking
- Daily generation caps
AI Plus
200 monthly AI credits shared with Flow and other Google AI tools. Suitable for casual creative exploration.
- 200 credits per month (shared with Flow)
- Everything in Free
- Higher monthly generation capacity
AI Pro
1,000 monthly AI credits shared with Flow. Includes Whisk Animate powered by Veo 3 Fast at 20 credits per generation.
- 1,000 credits per month (shared with Flow)
- Whisk Animate with Veo 3 Fast (20 credits per generation)
- Everything in AI Plus
- Best value for regular creative use
AI Ultra
25,000 monthly AI credits for maximum generation capacity. Shared with Flow and other Google AI tools.
- 25,000 credits per month (shared with Flow)
- Whisk Animate with Veo 3
- Everything in AI Pro
- Maximum generation capacity
Pros & Cons
Pros
- Free tier with 50 daily AI credits lets you experiment without any commitment or credit card.
- Unique image as prompt approach eliminates the need to write text prompts; just drag in reference images and remix.
- No prompt engineering skills required. Gemini generates captions from your images automatically, making it accessible to anyone.
- Whisk Animate adds video generation from static images, powered by Veo 3, turning Whisk into a two in one creative tool.
- Iterative refinement lets you swap inputs, edit generated prompts, and regenerate until you get the result you want.
- Quick picks and preset styles provide instant visual aesthetics without needing to supply a style reference image.
Cons
- Not pixel perfect. Whisk captures the essence and feel of your input images, not an exact replica. Details like specific logos or precise compositions may not transfer faithfully.
- Not available in the EU, UK, India, and several other regions. Geographic restrictions limit access for a large portion of potential users.
- Browser only with no native app. There is no mobile or desktop application; you must use Whisk through a web browser.
- No API available. Developers cannot integrate Whisk's image remixing capabilities into their own applications or workflows programmatically.
- Experimental status means Google could change, restructure, or discontinue Whisk at any time without notice.
- AI credits are shared with Flow and other Google AI tools, so heavy video generation usage can eat into your Whisk budget.
How to get started
Visit Whisk in your browser
Navigate to labs.google/whisk in a supported country. Whisk runs entirely in the browser with no installation required. Sign in with your Google account.
Provide your input images
Drag images into the three input slots: Subject (what), Scene (where), and Style (how it looks). You can fill one, two, or all three slots. Leave a slot empty and Whisk will use a default or let you choose a quick pick.
Generate your remix
Click Generate and Whisk will use Gemini to caption your images, then Imagen 3 to produce a remixed output. Each generation costs AI credits from your daily or monthly allocation.
Refine and iterate
Review the result. You can edit the generated text prompt, swap input images, try different preset styles, or simply regenerate for a new variation. Iteration is key to getting the best results.
Try Whisk Animate (AI Pro and above)
If you have an AI Pro or AI Ultra subscription, use Whisk Animate to turn any static image into an 8 second video clip powered by Veo 3. Each animation costs 20 AI credits.
Download your creations
Download generated images and video clips to your device. All outputs include invisible SynthID watermarks identifying them as AI generated content.
Deep dive
Detailed guides with comparisons, tips, and visuals for each feature.
Image Remixing: Subject, Scene, and Style
How Whisk's three slot input system works, from image upload through Gemini captioning to Imagen 3 output.
Whisk Animate (Video from Images)
How Whisk Animate turns static images into 8 second video clips powered by Veo 3, and how the AI credit system works.
Whisk vs DALL-E vs Midjourney vs Nano Banana
How Whisk's image as prompt approach compares to text based AI image generators like DALL-E, Midjourney, and Google's own Nano Banana.
Links
Official
Features
Pricing
Similar Tools
Nano Banana 2
imageGoogle's fastest AI image generator. Pro quality at Flash speed, with text rendering, 4K resolution, and image editing built into Gemini.
Veo
videoOne of the best state of the art AI video models. Google DeepMind's Veo 3.1 produces cinematic quality clips with synchronized audio, dialogue, and sound effects.
Flow
videoGoogle's dedicated AI filmmaking platform built on Veo, Imagen, and Gemini for creating cinematic video with camera controls, scene building, and native audio.
Get notified about updates
We'll email you when this tool's pricing or features change.
Last updated: 2026-02-21