GPT Image 1.5

<- Back

GPT Image 1.5

charlierguo

Comments (255)

vunderba
Okay results are in for GenAI Showdown with the new gpt-image 1.5 model for the editing portions of the site!https://genai-showdown.specr.net/image-editingConclusions- OpenAI has always had some of the strongest prompt understanding alongside the weakest image fidelity. This update goes some way towards addressing this weakness.- It's leagues better at making localized edits without altering the entire image's aesthetic than gpt-image-1, doubling the previous score from 4/12 to 8/12 and the only model that legitimately passed the Giraffe prompt.- It's one of the most steerable models with a 90% compliance rateUpdates to GenAI Showdown- Added outtakes sections to each model's detailed report in the Text-to-Image category, showcasing notable failures and unexpected behaviors.- New models have been added including REVE and Flux.2 Dev (a new locally hostable model).- Finally got around to implementing a weighted scoring mechanism which considers pass/fail, quality, and compliance for a more holistic model evaluation (click pass/fail icon to toggle between scoring methods).If you just want to compare gpt-image-1, gpt-image-1.5, and NB Pro at the same time:https://genai-showdown.specr.net/image-editing?models=o4,nbp...
minimaxir
I have a Nano Banana Pro blog post in the works expanding on my experiments with Nano Banana (https://news.ycombinator.com/item?id=45917875). Running a few of my test cases from that post and the upcoming blog post through this new ChatGPT Image model, this new model is better than Nano Banana but MUCH worse than Nano Banana Pro which now nails the test cases that previously showed issues. The pricing is unclear but gpt-image-1.5 appears to be 20% cheaper than the current gpt-image-1 model, which would put a `high`-quality generation in the same price range as Nano Banana Pro.One curious case demoed here in the docs is the grid use case. Nano Banana Pro can also generate grids, but for NBP grid adherence to the prompt collapses after going higher than 4x4 (there's only a finite amount of output tokens to correspond to each subimage), so I'm curious that OpenAI started with a 6x6 case albeit the test prompt is not that nuanced.
oxag3n
If this was a farm of sweatshop Photoshopers in 2010, who download all images from the internet and provide a service of combining them on your request, this would escalate pretty quickly.Question: with copyright and authorship dead wrt AI, how do I make (at least) new content protected?Anecdotal: I had a hobby of doing photos in quite rare style and lived in a place where you'd get quite a few pictures of. When I asked gpt to generate a picture of that are in that style, it returned highly modified, but recognizable copy of a photo I've published years ago.
agentifysh
I am very impressed a benchmark I like to run is have it create sprite maps, uv texture maps for an imagined 3d modelNoticed it captured a megaman legends vibe ....https://x.com/AgentifySH/status/2001037332770615302and here it generated a texture map from a 3d characterhttps://x.com/AgentifySH/status/2001038516067672390/photo/1however im not sure if these are true uv maps that is accurate as i dont have the 3d models itselfbut ive tried this in nano banana when it first came out and it couldn't do it
blurbleblurble
It's really weird to see "make images from memories that aren't real" as a product pitch
sharkjacobs
Was it ever explained or understood why ChatGPT Images always has (had?) that yellow cast?
encroach
This outperforms Gemini 3 pro image (nano banana pro) on Text-to-Image Arena and Image Edit Arena. I'm surprised they didn't mention this leaderboard in the blog post.I like this benchmark because its based upon user votes, so overfitting is not as easy (after all, if users prefer your result, you've won).https://lmarena.ai/leaderboard/text-to-image https://lmarena.ai/leaderboard/image-edit
mingabunga
Did an experiment to give a software product a dark theme. Gave Both (GPT and Gemini/Nano) a screenshot of the product and an example theme I found on Dribbble.- Gemini/Nano did a pretty average job, only applying some grey to some of the panels. I tried a few different examples and got similar output.- GPT did a great job and themed the whole app and made it look great. I think I'd still need a designer to finesse some things though.
abbycurtis33
I still use Midjourney, because all of these major players are so bad at stylistic and creative work. They're singularly focused on photorealism.
password-app
Impressive image quality improvements. Meanwhile, AI agents just crossed a milestone: Simular's Agent S hit 72.6% on OSWorld (human-level is 72.36%).We're seeing AI get better at both creative tasks (images) and operational tasks (clicking through websites).For anyone building AI agents: the security model is still the hard part. Prompt injection remains unsolved even with dedicated security LLMs.
rw2
Having used it compared to Nano Banana:-The latency is still too high, lower than 10 seconds for nano banana and around 25 seconds for GPT image 1.5-The quality is higher but not a jump like previous google models to Nano Banana Pro. Nano banana pro is still at least equivalently good or better in my opinion.
anonfunction
So the announcement said the API works with the new model, so I updated my Golang SDK grail (https://github.com/montanaflynn/grail) to use but it returns a 500 server error when you try to use it, and if you change to a completely unknown model it's not listed in the available models: POST "https://api.openai.com/v1/responses": 500 Internal Server Error { "message": "An error occurred while processing your request. You can retry your request, or contact us through our help center at help.openai.com if the error persists. Please include the request ID req_******************* in your message.", "type": "server_error", "param": null, "code": "server_error" } POST "https://api.openai.com/v1/responses": 400 Bad Request { "message": "Invalid value: 'blah'. Supported values are: 'gpt-image-1' and 'gpt-image-1-mini'.", "type": "invalid_request_error", "param": "tools[0].model", "code": "invalid_value" }
chakintosh
Can't wait to generate fake memories with my 20 years ago dead grandma
yuni_aigc
One thing I’ve noticed when comparing these models is that “quality” and “realism” don’t always move together.Some models are very strong at sharp details and localized edits, but they can break global lighting consistency — shadows, reflections, or overall scene illumination drift in subtle ways. GPT-Image seems to trade a bit of micro-detail for better global coherence, especially in lighting, which makes composites feel more believable even if they’re not pixel-perfect.It’s hard to capture this in benchmarks, but for real-world editing workflows it ends up mattering more than I initially expected.
aziis98
I know this is a bit out of scope for these image editing models but I always try this experiment [1] of drawing a "random" triangle and then doing some geometric construction and they mess up in very funny ways. These models can't "see" very well. I think [2] is still very relevant.[1]: https://chatgpt.com/share/6941c96c-c160-8005-bea6-c809e58591...[2]: https://vlmsareblind.github.io/
alasano
It's still not available in the API despite them announcing the availability.They even linked to their Image Playground where it's also not available..I updated my local playground to support it and I'm just handling the 404 on the model gracefullyhttps://github.com/alasano/gpt-image-1-playground
aymenfurter
Big jump from gpt-image, but I would love to see the same kind of step change we have seen in recent language model releases.
xnx
Great to have continued competition in the different model types.What angle is there for second tier models? Could the future for OpenAI be providing a cheaper option when you don't need the best? It seems like that segment would also be dominated by the leading models.I would imagine the future shakes out as: first class hosted models, hosted uncensored models, local models.
ChrisArchitect
Post: https://openai.com/index/new-chatgpt-images-is-here/ (https://news.ycombinator.com/item?id=46291827)
smlavine
This is terrifying. Truth is dead.
zkmon
AI-generated images would remove all the trust and admire for human talent in art, similar to how text-generation would remove trust and admire for human talent in writing. Same case for coding.So, let's simulate that future. Since no one trusts your talent in coding, art or writing, you wouldn't care to do any of these. But the economy is built on the products and services which get their value based how much of human talent and effort is required to produce them.So, the value of these services and products goes down as demand and trust goes down. No one knows or cares who is a good programmer in the team, who is great thinker and writer and who is a modern Picasso.So, the motivation disappears for humans. There are no achievements to target, there is no way to impress others with your talent. This should lead to uniform workforce without much difference in talents. Pretty much a robot army.
neom
Anyone else have issues verifying with openai? I always get a "congrats you're done" screen with a green checkmark from Persona, nothing to click, and my account stays unverified. (Edit, mystically, it's fixed..!)
KaiserPro
Is there a watermarking, or some other way for normal people to tell if its fake?
sfmike
Hope to see more "red alert" status from the ai wars putting companies into al hands on deck. This is only helping cost of tokens and efficacy. As always competition only helps the end users.
Garlef
GPT images is the new MS Word "Arial + clip art"
gs17
> Still some scientific inaccuracies, but ~70% correctThat's still dangerously bad for the use-case they're proposing. We don't need better looking but completely wrong infographics.
fock
Good to see that hands are still not solved...
surrTurr
not super impressed. feels like 70% as good as nano banana pro.
sroussey
“ Photo of a blond male in his 50s with half gray hair “Still fails. Every photo of a man with half gray hair will have the other half black.
anon
undefined
ezero
Even from their own curated examples, this looks quite a bit worse than nano banan in terms of preserving consistency on image edits.
ge96
I get the tech implementation is amazing, I wonder if it takes away from genuineness of events, like the Astronaut photo, I get it's just a joke/funny too but it's like a photo of you in a supercar vs. actually buying one. Or fake AI companions vs. real people. Beauty filters/skinny filters vs. actually being healthy.
celeryd
If it can't generate non-sexual content of a woman in a bikini, I am not interested.
andai
Sam Altman Christmas decoration isn't real, he can't hurt me...
catigula
Nano Banana Pro is so good that any other attempt feels 1-2 generations behind.
eterm
I have a "go to" prompt for images:> In the style of a 1970s book sci-fi novel cover: A spacer walks towards the frame. In the background his spaceship crashed on an icy remote planet. The sky behind is dark and full of stars.Nano banana pro via gemini did really well, although still way too detailed, and it then made a mess of different decades when I asked it to follow up: https://gemini.google.com/share/1902c11fd755It's therefore really disappointing that GPT-image 1.5 did this:https://chatgpt.com/share/6941ed28-ed80-8000-b817-b174daa922...Completely generic, not at all like a book cover, it completely ignored that part of the prompt while it focused on the other elements.Did it get the other details right? Sure, maybe even better, but the important part it just ignored completely.And it's doing even worse when I try to get it to correct the mistake. It's just repeating the same thing with more "weathering".
raw_anon_1111
I still can’t get it to draw a “13 hour clock” correctly
dzonga
we seriously can't be burning GW of energy just to have sama in a GPT-Shirt Ad generated by A.Iimpressive stuff though - as you can give it a base image + prompt.
mohsen1
Unlike Nano Banana it allows generating photos of children. Always fun to ask AI to imagine children of a couple but it's also kinda concerning that there might be terrible use cases.
GaryBluto
God OpenAI are so far behind. Their own example shows that trying to only change specific parts of the image doesn't work without affecting the background.
sipsi
the combination of two images the last gpt-image (nano banana) generated seem to be inappropriate
gostsamo
Alt text is one of the nicest uses for ai and still Open AI didn't bother using it for something so basic. The dogfooding is not strong with their marketing team.
anon
undefined
brador
Every person in every picture in their examples is white except for 1 Asian dude. Like a 46:1 ratio for the page (I counted). Not one Middle Eastern or Black or Jewish or Indian or South American person.Not even one. And no one on the team said anything?Come on Sam, do better.
v9v
Lots of em-dashes in this copy.
pdevr
>Now remove the two men, just keep the dog, and put them in an OpenAI livestream that looks like the attached image.Where is the image given along with the prompt? If I didn't miss it: Would have been nice to show the attached image.
0dayman
nah Nano Banana Pro is much better
nightshift1
What is the endgame? Why is OpenAI throwing that much money on image/video generation? Is there a profitable market for AI-generated image slop? Do people choose ChatGPT instead of Gemini/Grok/Claude because of the image generation capabilities? To me, it looks like a huge fiery money pit.
StarterPro
In the image they showed for the new one, the mechanic was checking a dipstick...that was still in the vehicle.I really hope everyone is starting to get disillusioned with OpenAI. They're just charging you more and more for what? Shitty images that are easy to sniff out?In that case, I have a startup for you to invest in. Its a bridge-selling app.
enigma101
Really can't stand the image slop suffocating the internet.
randall
double popped collar ftw
ares623
My copium is that analog photography makes a come back as a way to recover some level of trust and authenticity.
thumbsup-_-
now you can create good memories with your family without meeting them
JustinXie
[dead]
JustinXie
[dead]
hamonrye
[dead]
kitsune1
[dead]
gpt-image
[dead]
youknow123
[dead]
animanoir
[dead]
nycdatasci
[flagged]
rvz
Another bunch of "startups" have been eliminated.
jdthedisciple
Why is the emphasis of these promos always to create fake social media pictures of people and things that didnt happen?Aren't we plagued enough by all the fake bullshit out there.Ffs!/rantSorry gotta be honest and blunt every one of those times...
adammarples
Still can't pass my image testTwo women walking in single fileAlthough it tried very hard and had them staggered slightly