<- Back
Comments (10)
- valineThis is such a natural extension to LLMs. I’m shocked it hasn’t been tried before.When I ask a diffusion model to generate a chessboard, I’d expect the pieces to be placed randomly. We are getting closer to image generators that not only know what chess pieces look like but also where to place them.
- cosmicjediYou can talk to the authors directly on alphaXiv! https://www.alphaxiv.org/abs/2408.11039v1
- BaculumMeumEstStupid question: is their 7B model available? Is there public inference code that we could run? Or do they not usually release them along with these kinds of papers?
- ilakshHmm. I wonder if this is similar to Diffusion Transformers?
- littlestymaarWould such a model be able to give more accurate description of images as well?