Chatgpt’s New Images 2.0 Model Is Surprisingly Good At Generating Text

3 days ago

It utilized to beryllium easy capable to separate betwixt human-made and AI-generated imagery — conscionable 2 years ago, you couldn’t usage image models to create a paper for a Mexican restaurant without inventing caller culinary delights for illustration “enchuita,” “churiros,” “burrto,” and “margartas.”

Now, erstwhile I inquire nan marque caller ChatGPT Images 2.0 exemplary for a paper of Mexican food, it creates thing that could instantly beryllium utilized successful a edifice without customers noticing that something’s off. (However, ceviche priced astatine $13.50 mightiness make maine mobility nan value of nan fish).

For comparison, here’s nan consequence I sewage from DALL-E 3 2 years ago. (At nan time, ChatGPT did not make images):

AI image generators person historically struggled to spell because they mostly utilized diffusion models, which activity by reconstructing images from noise.

“The diffusion models […] are reconstructing a fixed input,” Asmelash Teka Hadgu, laminitis and CEO of Lesan AI, told TechCrunch successful 2024. “We tin presume writings connected an image are a very, very mini part, truthful nan image generator learns nan patterns that screen much of these pixels.”

Researchers person since explored different mechanisms for image generation, for illustration autoregressive models, which make predictions astir what an image should look for illustration and usability much for illustration an LLM.

Unfortunately, OpenAI declined to reply a mobility successful a property briefing this week astir what benignant of exemplary is powering ChatGPT Images 2.0.

Techcrunch event

San Francisco, CA | October 13-15, 2026

The institution did, however, explicate that nan caller exemplary has “thinking capabilities,” which springiness it nan expertise to hunt nan web, make aggregate images from 1 prompt, and double-check its creations — this allows Images 2.0 to create trading assets successful various sizes, arsenic good arsenic multi-paneled comic strips.

OpenAI besides says that Images has a stronger knowing of non-Latin matter rendering successful languages for illustration Japanese, Korean, Hindi, and Bengali. The model’s knowledge cuts disconnected successful December 2025, which could effect really accurately it tin make definite prompts involving caller news.

“Images 2.0 brings an unprecedented level of specificity and fidelity to image creation. It tin not only conceptualize much blase images, but it really brings that imagination to life eﬀectively, capable to travel instructions, sphere requested details, and render nan fine-grained elements that often break image models: mini text, iconography, UI elements, dense compositions, and subtle stylistic constraints, each astatine up to 2K resolution,” OpenAI said successful a property release.

These capabilities mean that image procreation isn’t arsenic accelerated arsenic typing a mobility to ChatGPT, but generating thing analyzable for illustration a multi-paneled comic still takes conscionable a fewer minutes.

All ChatGPT and Codex users will beryllium capable to entree Images 2.0 starting Tuesday; paid users will beryllium capable to make much precocious outputs. The institution will besides make nan gpt-image-2 API available, pinch pricing limited connected nan value and solution of outputs.

When you acquisition done links successful our articles, we whitethorn gain a mini commission. This doesn’t impact our editorial independence.