AI image generators can do readable text in images now, I tested 8 to find the best one

Here are the best ones with 3 clear winners. My how things have changed.

Jan 31, 2025

Good morning! The winter sickness season is upon us. Both kiddos have colds, and my voice is gone, but I can still type! Lots of talk about DeepSeek in the news. Thought about writing a deep dive on it, but wanted to ask ya’ll!

A few years ago, AI image generators couldn’t do hands, and they couldn’t do text.

Hands were fixed in 2023, and text was fixed in mid 2024.

Now in the year of 2025, there are multiple AI generator tools and models that can do readable text well.

I did a ton of research and experimentation, here are the kings of readable text.

The Test: Make me a Pirate Map 🗺️

The base prompt was “a picture of a pirate holding a pirates map, with the words 'set sail for adventure' on the map”

I wanted something that focused on the text, but still left some room for interesting images.

The image generation models I tested were:

Dalle3 (via CharGPT)
Imagen 3 (via Gemini)
Ideogram 2
Midjourney v6
Flux
Aurora (via Grok)
Adobe Firefly Image Model 3.0
Leonardo.ai Phoenix

Dalle3 (via ChatGPT)

URL: chat.com

Free version: yes, you a certain number of free requests per day

Results: ✅✅ Excellent

A detailed illustration of a pirate holding a tattered pirate map in an adventurous pose. The pirate has a rugged look with a tricorn hat, an eyepatch, and a long coat with gold trim. The map is old and weathered, with 'Set Sail for Adventure' written prominently in elegant, hand-drawn script. The background features a dramatic seascape with a pirate ship in the distance and a tropical island with a treasure chest partially visible. The scene evokes a sense of excitement and exploration, with vibrant and dynamic colors.

Dalle3 has improved quite a bit in the last year. The text is well done, and the image itself is high quality. Some weird stuff like the sword is too short and branches, but that is common across all AI image gen.

Imagen 3 - Google Gemini

URL: gemini.google.com

Free version: yes, you a certain number of free requests per day

Results: ✅ great text, image as a whole is basic

The text is spot on but the overall image is lacking prompt coherence (where is the pirate?) and is just a bit boring. As of October 2024, image generation with Imagen 3 is available to all Gemini users worldwide or sites can use it via the Imagen 3 API through the Gemini API, which provides a Python SDK for generating images. You can also access it thru Googles labs here https://labs.google/fx/tools/image-fx

Ideogram 2.0

URL: ideogram.ai

Free version: yes, you get some initial free credits.

Results: ✅ Text is excellent. Picture as a whole is basic.

When Ideogram first launched on August 22, 2023, it was THE best model for getting accurate text in generated images. Basically the only one that would nail it. Ideogram still does text great, but the images a little basic.

Midjourney

URL: midjourney.com

Free version: yes you get 25 free credits, good for 50 images

Results: ❌ Fail

I love Midjourney. But it struggles with generating text in images. The images are dynamic and fun, and still my favorite, but the text just never works.

And sadly Midjourney did not improve with repeated prompting.

Flux

URL: blackforestlabs.ai

Free version: yes there are multiple version of the model, some are open source and so you can find lots of free sites.

Results: ✅✅ Excellent

Flux AI, developed by Black Forest Labs, launched on August 1, 2024. It has three versions: Pro (for commercial use) which is only available via their API or partners, Dev (open weights for non-commercial use), and Schnell (a faster open-weights version).

Black Forest Labs was founded by former Stability AI (makers of Stable Diffusion) employees including Robin Rombach and Patrick Esser. They raised $31 million in seed funding for the project.

Because the smaller models, dev and schnell, (~~what kind of a name is schnell?~~ Thank you to

Les Andrews

for teaching me that schnell is German for fast!) are open source, tons of websites and tools offer it. And to make it more confusing, there is a fork of dev called FLUX 1.0Pro (link here) where users are enhancing the dev model.

Here are a few sites which offer Flux models you can use, most have free tiers or starting credits:

freepik.com/ai/image-generator
flux1.ai
creator.nightcafe.studio/flux-image-generator
pinokio.computer - install and run Flux on your PC, whether you're using Windows, Mac, or Linux3
huggingface.co/black-forest-labs/FLUX.1-dev

Update - A sharp eyed reader noticed the pirate has an extra finger in the result image. So I ran a bunch more tests and in each, Flux nailed the fingers. In two of the results the text itself is a little messed up. But upon reflection and further tests, I still recommend Flux for text in images, but a good reminder that fingers are still hard sometimes in 2025.

Aurora (Grok)

URL: x.com/i/grok

Free version: yes, you get a bunch of daily requests.

Results: ❌ Fail

Grok used to use Flux, but it was replaced with X.ai’s own newer Aurora model. The text sometimes isn’t quite right, but surprisingly, you can make it do better by just asking it to do better. Which is always a weird quirk of AI. Sometimes you just ask it to try harder and it works.

Currently, Aurora is only on X.com. However the xAI API which is currently in public beta does give access to Grok Aurora, so developers could utilize it in their apps and sites. However I haven’t seen any sites offer it yet.

Adobe Firefly Image Model 3.0

URL: firefly.adobe.com

Free version: text to image is free on their website, they also built it into a bunch of their paid tools.

Results: ❌ Fail

Adobe has created a bunch of tools under their Firefly AI brand. They use various models. The model for image generation is Firefly Image Model 3.0. They primarily focus on AI in their paid tools, but they offer a free web version. Doesn’t work great for text in images however.

And Firefly didn’t improve with repeated prompting

I’ve heard that you can generate great text in images in their paid programs like Photoshop, because it is using your existing picture for context, but from 0-1 generation it doesn’t work.

Leonardo.ai Phoenix Image Generation Model

URL: leonardo.ai

Free version: You get 150 credits, each image takes 8 credits minimum so around 18 images for free.

Results: ✅✅ Excellent

Leonardo.ai has created their own image generation model. Not sure what they based it off of, most likely Stability AI or Flux. They actually offer Flux models as an option in their tool, as well as their own Phoenix model.

They offer prompt enhancement as well, so I enabled that and my prompt of “a picture of a pirate holding a pirates map, with the words 'set sail for adventure' on the map“ was turned into “A swashbuckling pirate, adorned with a worn leather eye patch and a feathered tricorn, stands proudly holding a yellowed, parchment-style map, with the words 'Set Sail for Adventure' scrawled in bold, crimson ink across the center, amidst scattered X's and crude illustrations of mythical sea creatures. The pirate's weathered skin sports a bushy beard, braided with a few silver trinkets, and a sly, toothy grin, as they gaze out at the viewer with a mischievous glint in their bright, blue eye. The map itself is torn and creased, with worn folds and a hint of sea salt lingering on its surface, as if it has seen many grand adventures on the high seas.”

The results are very nice.

This video covers some of their text generation capabilities.

Who is the readable text in an AI image king?

Dalle3 (via ChatGPT) ✅✅ Excellent - Great text with high quality images
Imagen 3 (via Gemini) ✅ Great text but basic images - Text is accurate but lacks prompt coherence
Ideogram 2.0 ✅ Text excellent but basic images - Among the first to master text but images are simple
Midjourney v6 ❌ Fail - Great images but consistently poor text rendering
Flux ✅✅ Excellent - Strong text rendering, available in multiple versions
Aurora (via Grok) ✅ - Inconsistent text quality, can sometimes improve with prompting
Adobe Firefly Image Model 3.0 ❌ Fail - Poor text rendering in from-scratch images
Leonardo.ai Phoenix ✅✅ Excellent - Great text generation

So there are clearly 3 main groups.

Text in Image Leaders

DALLE-3 (via ChatGPT): ✅✅ Produced excellent text quality with high-quality overall images
Flux (by Black Forest Labs): ✅✅ Delivered excellent text rendering, available in multiple versions including open-source options
Leonardo.ai Phoenix: ✅✅ Generated excellent text, especially when using their prompt enhancement feature

Mid

Imagen 3 (via Google Gemini): ✅ Good text quality but basic overall images
Ideogram 2.0: ✅ Accurate text but simple image composition
Aurora (via Grok): ✅ Produced inconsistent results, though sometimes improved with additional prompting

Fail

Midjourney v6: ❌ Despite creating dynamic images, consistently failed at text rendering
Adobe Firefly Image Model 3.0: ❌ Web platform struggled with text in from-scratch images

My recommendation

If you need a quick image with text in it use ChatGPT Dalle3.

If you need to frequently make text in an AI image, reach for Flux. Available on multiple tools and in multiple versions including open-source options.

And if you can afford the credits or are a power user of text in AI images, purchase a subscription to Leonardo.ai Phoenix.

Thanks for reading, see you next week!

-josh

Mythical AI

Discussion about this post