by Google
Google's native image generation via Gemini with exceptional prompt adherence and JSON prompt support.
Gemini Imagen 3 is Google DeepMind's state-of-the-art text-to-image generation model, available through the Gemini API and Vertex AI. It excels at photorealistic imagery, diverse art styles, and follows complex compositional prompts with high fidelity.
Imagen 3 responds well to detailed natural language descriptions:
A weathered fisherman mending nets at a harbor during golden hour,
realistic photography, shallow depth of field, warm color palette
Ask for specific styles to get consistent results:
An impressionist oil painting of a Japanese garden in autumn,
thick brushstrokes, vibrant oranges and reds, Monet-inspired
Set negative_prompt as a separate API parameter — don't include it in the main prompt text.
| Parameter | Description |
|---|---|
| aspect_ratio | 1:1, 16:9, 9:16, 4:3, 3:4 |
| number_of_images | Number of outputs (1-4) |
| negative_prompt | Separate API parameter for exclusions |
| person_generation | dont_allow or allow_adult |
| seed | Reproducibility seed |
Quick tips from the community about what works with Gemini Imagen 3 right now.
Sign in to share a tip.
No tips yet. Add a tip for this model.
