Skip to main content

Overview

OpenAI’s latest image generation model, built on GPT Image 1.5, a natively multimodal architecture that processes text and images through a single unified network rather than treating them as separate systems. The practical result is a model that follows complex instructions with unusual precision, handles dense text rendering accurately, and makes targeted edits to images without destabilizing everything else in the frame. It’s slower and more credit-intensive than the Fast Models, but for tasks that require exact prompt adherence, legible text within images, or controlled iterative editing, it’s one of the most capable options on the platform.

Getting Started

  1. Go to Image Generation — Navigate to krea.ai/image and select this model from the dropdown.
  2. Select ChatGPT 1.5 — Open the model picker and choose ChatGPT 1.5 from the Intelligent Models section.
  3. Write your prompt — Be as specific and descriptive as possible. ChatGPT 1.5 is built for precise instruction following, so detailed prompts produce noticeably better results than vague ones.
  4. Add reference images (optional) — Upload images to guide composition, style, or subject matter.
  5. Choose your aspect ratio — Select portrait, landscape, or square depending on your use case.
  6. Generate — Click Generate. ChatGPT 1.5 is slower than fast models, but the output quality reflects the added processing time.
  7. Iterate — Ask for specific changes to your result. ChatGPT 1.5 will modify only what you ask for while keeping the rest of the image consistent.

At a Glance

FeatureDetail
SpeedSlow (1/3)
Credits~150 per generation
Underlying modelGPT Image 1.5 (OpenAI)
Best atComplex prompts, text rendering, precise image editing
Supported sizes1:1 square, 3:2 landscape, 2:3 portrait
Style reference supportYes

When to Use ChatGPT 1.5

ChatGPT 1.5 is the right model to reach for when precision matters more than speed. Its natively multimodal architecture means it understands the relationship between text and image at a deeper level than most models, which translates into stronger prompt adherence and more reliable results on complex or layered requests. Its text rendering capability is particularly strong. Where many models struggle to produce legible, correctly spelled text within an image, ChatGPT 1.5 handles dense and small-scale text accurately, making it a solid choice for any prompt that includes signage, typography, labels, or diagrams. It also excels at iterative editing. When you ask it to change one specific thing in an image, it adjusts only what you specified while preserving facial likeness, lighting, composition, and color tone across the rest of the frame. This addresses one of the most common frustrations with AI image generation, where asking for a small edit causes the entire image to be regenerated from scratch.
Use WhenAvoid When
Your prompt is complex and requires precise interpretationYou need fast results or are in an early drafting phase
Your image needs to include legible textYou’re on a tight credit budget
You need to make specific edits without changing the whole imageYou want a heavily stylized or artistic output
You’re working on diagrams, characters, or detailed scenesYou need LoRA style support
Facial likeness or visual consistency across edits matters

Common Use Cases

  • Diagrams and infographics: Technical illustrations with accurate labels and text
  • Character design: Consistent character appearance across multiple iterations
  • Marketing visuals: Layouts with readable copy, logos, or product callouts
  • Photo editing: Targeted modifications to existing images without full regeneration
  • Complex scenes: Multi-element compositions that require precise spatial relationships

Prompting Tips

Writing effective prompts

  • Write prompts the way you would give a detailed creative brief — describe subject, style, lighting, composition, and mood explicitly
  • For text within images, specify the exact wording, font style, size, and placement
  • Describe spatial relationships clearly: “a red mug on the left side of a white table, window light from the right”
  • ChatGPT 1.5 handles long, detailed prompts well — don’t abbreviate when you can be specific

Iterating on results

  • When editing, describe only the change you want and leave everything else unspecified — the model will preserve what you don’t mention
  • For character work, establish the appearance in your first generation, then reference it explicitly in follow-up edits
  • If the result isn’t quite right, refine your prompt language rather than regenerating with the same text

Getting the most out of text rendering

  • Put any text you need in the image in quotation marks within your prompt
  • Specify font style if it matters: “sans-serif,” “handwritten,” “bold uppercase”
  • For dense text layouts like posters or diagrams, break the layout into clear sections in your prompt

Examples

A photorealistic night scene on a narrow Barcelona street, warm amber streetlights , Gothic Quarter architecture lining both sides. In the foreground, a small tapas stall with a glowing sign reading "EL RACÓ" in bold yellow letters, a handwritten menu board underneath listing "Patatas Bravas, Croquetas, Pan con Tomate." Locals and tourists passing by, neon signs in Spanish and Catalan in the background.
A Photorealistic Night Scene On A Narrow Barcelona Street Warm Amber Streetlights Gothic Quarter A W6gluq1p0vlaxp7cvtll 1

Infographics

ChatGPT 1.5 is one of the strongest models on Krea for infographic generation. Unlike most models that simply place text onto an image, it reasons about hierarchy, spacing, and visual organization, understanding the relationship between written content and layout at a structural level. Combined with its accurate dense text rendering, it can take a complex multi-section prompt and return something that looks considered rather than approximate. A step-by-step process infographic titled "How Sourdough Bread is Made," showing 8 stages from starter to finished loaf — feeding the starter, mixing the dough, autolyse, bulk fermentation, shaping, proofing, scoring, and baking — each with a small hand-drawn style illustration and a time indicator. Warm cream background, hand-lettered headings, rustic editorial feel.
Omni D44c51a4 2adc 48c1 A5ac 45f045a22ba1

Complex scenes

Multi-element compositions with specific spatial relationships, interactions between subjects, and layered environmental detail. A busy Berlin market hall at 5am, three vendors in rubber aprons arranging fresh fish on crushed ice in the foreground, a fourth vendor mid-negotiation with a restaurant buyer in the middle ground, wooden crates stacked to the left, hanging overhead lights casting warm pools of yellow light across wet concrete floors, steam rising from a small food cart in the background selling hot broth to early morning workers, exposed iron roof structure and brick walls characteristic of a 19th century German markthalle visible above, depth of field pulling focus from the foreground vendors to the hazy activity behind, photorealistic, shot on 35mm.
A Busy Berlin Market Hall At 5am Three Vendors In Rubber Aprons Arranging Fresh Fish On Crushed Ice 77hnonwv5pjlnu91pdfh 0

Explicit Edit Instructions

ChatGPT Image 1.5 is significantly better at following direct image edit instructions. You can now treat prompts like precise change requests instead of re-describing the entire image. Edit the uploaded image. Remove the person in the background on the left in the pink shirt. Keep the lighting unchanged. Preserve facial identity and skin texture of the main subjects. Maintain original camera angle and depth of field.
588f5b43b90a4a22e9a2a7a14f8b7a50
Img