Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Given the bullet point of "DALL·E 3 is built natively on ChatGPT" and the tight integration between ChatGPT and the corresponding image generation (and no research paper released with the announcement), I strongly suspect that DALL-E 3 is a trial run of GPT-4 multimodal capabilities and may be run on a similar infrastructure.


GPT-4 can only do text-to-text and image-to-text. It can't generate images itself. So it will simply use an API call. Really nothing special, Bing does the same thing.


The art produced by GPT-4 so far hasn't been at this level but this may be a newer version. See: https://arxiv.org/pdf/2303.12712.pdf




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: