I am amazed by level of comprehension that DALL-E shows. At first glance the results also look amazing. Zoomed in though it turns into the stuff of nightmares. Quite literally actually. I am an enthusiastic practicionor of lucid dreaming and this stuff really feels similar to what I see when I closly observe details in lucid dreams. Ostensibly everything looks real but this reality falls apart when actually observed.
I too noticed how "dream-like" it is. Like it seems to have similar capabilities and limitations to how my brain renders things in dreams... they look fine at a glance but if you focus on details things get weird.
Like what? I went back and looked at all of them closely again. Maybe the only weird thing is the mummies faces. Other than that it really is stellar. I mean just look at the detail on the “shaped like a heart” ice cream scoop, or the paper mache Godzilla.
FYI, openai has eased the rule on realistic face generation. Now you can generate and publish Photorealistic faces. They will internally filter those to make sure they don't match famous faces.
I had the weirdest reaction to that last sentence.
As an engineer, the first thing that went through my head was "lots of photos of their faces available, easy filter. My face specifically: hard filter"
But as a person it was "ok wow, f_ck you too then?"
I assume it's more so you can't do something like "Hilary Clinton stabs a child in a dark basement caught on webcam" and then pass it off as fake news somewhere.
I would love to see what comes out with certain aspects of the prompts negated.
- "lemon gelato that’s been shaped to look like a heart, on a handmade waffle cone being held up to the camera in a cobblestone courtyard somewhere in italy" ... what about "somewhere not in italy"?
- "Diorama made of clay of a group of computer programmers looking disapprovingly at their CMO who has just given them diet pepsi instead of mountain dew" ... "looking approvingly"?
- "friends gathering around a tabletop “shichirin” grill where an assortment of meats and seafoods are being grilled over glowing binchotan charcoal; everyone is happy." ... "everyone is unhappy"?
I think getting good Dall-E results wil end up being an "art" in its own. Dall-E is like a broad brush and honestly I've never been good at getting great results. I think figuring out how to push Dall-E in a way that aligns with what you want with the right descriptors really goes a long way.
I think to get there, we need a good dictionary or wall of examples that tell you what you can even do. I didn't even know you could have it create clay dioramas.
the way it generates specific art styles and textures is amazing. it's interesting to try and spot out subtle details that it misses/ignores entirely (who is drinking diet coke out of a mug? why are the mummies interpreted as skeletons in sock-raincoats?)
> How long were you on the waiting list before you got access? I only signed up a week or two ago, so I assume I'm in for a long wait.
It's not based on duration, but rather multi-faceted criteria which it seemingly determines based on email association, social media, Linkedin, Geographical location etc... it's not quite clear how it's weighed but those are all things it asks you for when you resister for the Beta [0].
The thing that was found out recently that their might be a paid public release coming soon because they integrated Stripe into Dall-E 2 [1].
probably an analytics plug-in trying to fingerprint you, I remember i've had a lot of trouble with websites interrupting audio on my phone when there's not anything playing
I'm curious what kind of camera you have that makes a sound when accessed