Image Generation (Nano Banana / Nano Banana Pro) | Gemini

What is AI Image Generation?

AI image generation creates original images from text descriptions. You describe what you want ("a cozy cabin in the mountains at sunset, watercolor style") and the AI produces the image in seconds. Modern image generators can create photorealistic photos, illustrations, logos, UI mockups, and artistic compositions. You can also upload existing images and ask the AI to edit, extend, or restyle them.

Common use cases

Creating marketing visuals, social media graphics, or presentation illustrations on demand
Generating product mockups, concept art, or design variations quickly
Editing existing photos: removing backgrounds, changing styles, or extending an image beyond its original borders

Gemini includes native image generation and editing capabilities directly in the conversation. Two models are available, each built for different use cases.

Nano Banana vs Nano Banana Pro

Nano Banana

Gemini 2.5 Flash Image

Max resolution

~1 MP (1024x1024)

Speed

3 to 4x faster

Text rendering

Basic

Complex prompts

Simple prompts only

Thinking mode

Best for

Quick drafts, social media

Nano Banana Pro

Gemini 3.1 Pro Image Preview

Max resolution

Up to 4K (4096x4096)

Speed

Slower, higher quality

Text rendering

Best in class

Complex prompts

Lighting, angles, depth of field

Thinking mode

Yes (complex multi element compositions)

Best for

Professional assets, branding

Nano Banana Pro launched globally on November 20, 2025, and is the default model on paid plans. Free tier users get Nano Banana Pro with limited daily quota; when exhausted, they revert to the base Nano Banana model.

Key Capabilities

Text to ImageDescribe any image and Gemini creates it: photos, illustrations, logos, art, diagrams, and infographics.

Image EditingUpload an existing image and modify it using natural language ("make the sky sunset colored," "remove the person on the left").

Character ConsistencyUpload up to 5 reference images of a person or character, and Nano Banana Pro maintains their appearance across multiple generations.

Multi Image CompositionProvide multiple reference images and Nano Banana Pro blends them into a single coherent scene.

Multilingual Text GenerationGenerate text overlays in multiple languages within the same image.

Style TransferUpload a style reference and apply it to new content while preserving the subject.

World KnowledgeGemini uses its general knowledge during generation, so prompts like "the Eiffel Tower at sunset in the style of Monet" work without additional context.

Limits Per Plan

	Free	AI Plus	AI Pro	AI Ultra
Images per day	~2	More than Free	~50 to 100	~1,000
Max resolution	1 MP (1024×1024)	Higher than Free	2K	4K (4096×4096)
Watermark
Nano Banana Pro	Limited daily quota

Google uses approximate limits because actual capacity varies by server load. Plan for the lower end of the range to avoid hitting limits unexpectedly.

Comparison to Competitors

	Nano Banana Pro	ChatGPT GPT Image	Flux
Text rendering	Excellent	Excellent	Good
Character consistency	Strong (up to 5 refs)	Good (conversation context)	Requires LoRA training
Resolution	Up to 4K	Up to ~2K	Up to 4K+
Creative control	Strong (lighting, angles, style)	Good	Excellent (ControlNet, etc.)
Speed	Fast (base) / Moderate (Pro)	Moderate	Varies by provider
Integration	Built into Gemini chat	Built into ChatGPT	Standalone / API only
Natural language editing			No (requires separate tools)

Image Generation Tips

Be specific: "A golden retriever puppy sitting in a field of sunflowers, golden hour lighting, shallow depth of field, photorealistic" produces better results than "a dog in flowers."
Specify style explicitly: Always mention the artistic style (photorealistic, watercolor, pixel art, 3D render, flat design, Studio Ghibli, etc.).
Use reference images: Upload an existing image and say "Create something in this style but with [changes]" for more predictable results.
For character consistency: Upload clear, well lit reference photos from multiple angles. The more reference images (up to 5), the more consistent the output.
For text in images: Be explicit about font style, size, placement, and color. Nano Banana Pro handles text significantly better than the base model.
Iterate in conversation: Gemini remembers previous generations, so you can say "make the background darker" or "add a hat" without re describing the whole scene.

API Pricing

Separate image generation models (Imagen 4) are also available through the API for developers. Imagen 4 Fast is priced at $0.02/image, Imagen 4 Standard at $0.04/image, and Imagen 4 Ultra at $0.06/image.