Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Nathaniel Whittemore

๐Ÿ‘ค Speaker
17756 total appearances

Appearances Over Time

Podcast Appearances

Many users pointed to very impressive generations from LM Arena that included things like handwritten notes, layouts of a YouTube page, and a simple kind of janky iPhone-style image of a retail store.

What people were noticing about these things is how little they felt like AI images.

They just seemed like a random iPhone photo or a screenshot.

And people also identified that they seemed to have good world knowledge.

They weren't just making stuff up in their images, they were actually bringing what the model knew into their ability to create.

Yesterday, as I mentioned at the end of the show,

OpenAI teased that the new model would be coming in the afternoon, and indeed, on Tuesday around 3 p.m.

Eastern, we got the new ChatGPT Images 2.0.

From a sheer quality standpoint alone, there is absolutely no denying that the model is fairly stunning.

Arena announced that not only did GPT Image 2 take the number one slot in their ELO score human preference board, it absolutely dominated.

The number 2 through 15 image generators are all clustered basically within 100-130 points of each other.

Number 15, Flux2Dev, had a score of 1149, whereas the previous leader, Nanobanana2, had a score of 1271.

GPT-Image 2 came in over the top with a 1,512.

Arena points out that that is a record-breaking 242-point lead in the text-to-image category and the largest gap they've ever seen.

In their announcement post, OpenAI gets into a lot of what makes this model different and what it can do.

They write, or more accurately, generate, in an image, an announcement post that argues that, quote, this model is a step change in detailed instruction following, placing and relating objects accurately, and rendering dense text, with the ability to generate across aspect ratios.

They say that it has better composition and visual taste, meaning it feels less AI-generated, it has, as people were speculating, more world knowledge, and the ability to actually reason and think.

When a thinking model is selected in ChatGPT, Images 2.0 can search the web for real-time information, create multiple distinct images from one prompt, and double-check its own outputs.