Google shipped three Nano Banana image models to general availability on June 30, 2026. The Lite variant (gemini-3.1-flash-lite-image) covers high-speed, low-cost generation. The standard Nano Banana 2 (gemini-3.1-flash-image) handles the volume middle tier. Nano Banana Pro (gemini-3-pro-image-preview) is the top tier: a dedicated editing endpoint, 4K output, Search-grounded generation, and the strongest in-image text rendering of any model in the Gemini stack.
This guide covers what Nano Banana Pro adds over its siblings, when the cost difference is justified, and how to integrate both the generation and editing endpoints.
Model IDs
Nano Banana Pro ships with two distinct endpoints:
| Mode | Model ID |
|---|---|
| Generation | gemini-3-pro-image-preview |
| Editing | gemini-3-pro-image-preview-edit |
The “preview” suffix in the model ID is retained even at GA — use these IDs exactly; the preview variants do not point to the same models.
What Nano Banana Pro Adds
Text rendering in images
In-image text has been a reliable failure mode across image generation models. Nano Banana Pro is Google’s answer: it reliably produces readable text in generated images, including long passages and multilingual layouts. For builders generating infographics, data visualizations, UI mockups, or any asset where legible text matters, this is the first Gemini image model where text rendering is not a workaround problem.
Editing endpoint
The gemini-3-pro-image-preview-edit endpoint accepts an original image and a natural language instruction. It preserves everything not explicitly asked to change. This is useful for iterative creative workflows — adjust a background, swap a color scheme, add a product to an existing scene — without regenerating from scratch.
Reference image submission uses base64 encoding in the reference_images parameter:
from google import genai
from google.genai import types
client = genai.Client()
with open("original.png", "rb") as f:
image_bytes = f.read()
response = client.models.edit_image(
model="gemini-3-pro-image-preview-edit",
prompt="Make the background a sunset gradient",
reference_images=[
types.RawReferenceImage(
reference_image=types.Image(image_bytes=image_bytes),
reference_id=0,
)
],
config=types.EditImageConfig(
number_of_images=1,
output_mime_type="image/png",
),
)
4K output
Both Nano Banana 2 and Nano Banana Lite top out at lower resolutions. Nano Banana Pro supports custom dimensions up to 4096×4096, configured via output_image_config:
config=types.GenerateImagesConfig(
number_of_images=1,
output_mime_type="image/png",
aspect_ratio="16:9",
output_image_config={"width": 3840, "height": 2160} # 4K 16:9
)
4K generation bills at $0.24 per image rather than the standard $0.134.
Search grounding
Nano Banana Pro can ground image generation in real-world data via Search. This constrains outputs to factual accuracy — useful for location-specific imagery, current event illustration, or product environments — but limits surreal or highly stylized outputs. For most builder use cases, grounding is opt-in; leave it disabled when stylization matters more than accuracy.
Multi-subject identity preservation
Nano Banana Pro preserves consistent visual identity for up to five distinct subjects across a generation or editing session. For product photography workflows, character sheets, or multi-person scenes, this is the enabling capability that Nano Banana 2 does not fully support.
Generation API
Basic generation follows the same generate_images pattern as other Nano Banana models:
response = client.models.generate_images(
model="gemini-3-pro-image-preview",
prompt="An infographic comparing quarterly revenue across five product lines, clean sans-serif typography, white background",
config=types.GenerateImagesConfig(
number_of_images=1,
output_mime_type="image/png",
aspect_ratio="16:9",
),
)
image_data = response.generated_images[0].image.image_bytes
Supported aspect ratios: 1:1, 16:9, 9:16, 4:3, 3:4.
SDK minimums: Python v1.52+, JavaScript/TypeScript v1.30+.
Pricing
| Resolution | Per-image | Batch/Flex |
|---|---|---|
| 1K or 2K | $0.134 | $0.067 |
| 4K | $0.24 | $0.12 |
Input tokens (prompts, reference images): $2.00/M. Output tokens: $12.00/M. At 1K/2K, each image consumes approximately 1,120 output tokens; 4K images consume approximately 2,000 output tokens.
Batch/Flex mode (Vertex AI only) cuts rates in half with asynchronous delivery — useful for large overnight generation runs where latency is not a constraint.
Free tier: 200 images per day through Google AI Studio before billing begins.
Decision Guide: Nano Banana Pro vs. Nano Banana 2 vs. Imagen 4 Ultra
| Criterion | Nano Banana Pro | Nano Banana 2 | Imagen 4 Ultra |
|---|---|---|---|
| Price (2K) | $0.134 | ~$0.02–$0.04 | $0.06 |
| Speed | 2–5s | <2s | 15–30s |
| Text in image | Strong | Basic | Weak |
| Image editing | Dedicated endpoint | Not available | Not available |
| 4K output | Yes ($0.24) | No | No |
| Photorealism | Good | Moderate | Industry-best |
| Search grounding | Yes | No | No |
| Multi-subject identity | Up to 5 | Limited | Limited |
| Best for | Infographics, editing, complex scenes | High-volume iteration | Portrait, product photo |
Use Nano Banana Pro when: text legibility in the image matters; you need an editing endpoint to modify existing assets; 4K output is required; or you need factual accuracy from Search grounding.
Use Nano Banana 2 when: you are generating at volume and cost efficiency matters more than text rendering or editing capabilities.
Use Imagen 4 Ultra when: photorealistic portraits or product photography is the primary output, and speed is not a constraint.
SynthID Watermarking
All Nano Banana Pro outputs embed an invisible SynthID watermark in the pixel data. The watermark is non-optional, does not affect visual quality, and is detectable by Google’s verification tools. Workflows that pass images downstream should account for watermark persistence through compression and light editing operations.
Builder Checklist
- Update SDK to Python v1.52+ or JS/TS v1.30+ — earlier versions lack typed support for
gemini-3-pro-image-preview - Use
gemini-3-pro-image-previewfor generation,gemini-3-pro-image-preview-editfor editing — they are distinct model IDs, not interchangeable - For 4K output, set
output_image_configwith explicitwidth/height— theaspect_ratioparam alone caps at 2K - Enable Batch/Flex mode (Vertex AI) for offline generation runs to halve per-image costs
- Disable Search grounding when stylization matters more than factual accuracy
- Test text rendering quality against your specific font and language requirements before committing to Nano Banana Pro — if your use case does not embed text, Nano Banana 2 covers the same use case at 3–7x lower cost
- For multi-subject workflows, confirm identity preservation across the images you intend to generate — it is accurate for up to five subjects but may degrade beyond that
- Connect to the Veo 3.1 pipeline when you need video output: generate a keyframe with Nano Banana Pro, pass it as a start frame to Veo 3.1 for high-quality video continuation
What to Watch
Google has not announced a GA model ID without the “preview” suffix yet. If Google moves to gemini-3-pro-image at stable GA, existing integrations using gemini-3-pro-image-preview will need a model ID update — pin your model ID as a configuration constant rather than hardcoding it inline to make this migration a single-line change.