I saw a longer video of this that Ethan Mollick posted and in that one, the sequ...

wavemode · 2024-08-28T13:17:52 1724851072

Yes it definitely is very good for simulating gameplay footage, don't get me wrong. Its input for predicting the next frame is not just the previous frame, it has access to a whole sequence of prior frames.

But to say the model is simulating actual gameplay (i.e. that a person could actually play Doom in this) is far fetched. It's definitely great that the model was able to remember that the gray wall was still there after we turned around, but it's untenable for actual gameplay that the wall completely changed location and orientation.

TeMPOraL · 2024-08-28T14:02:05 1724853725

> it's untenable for actual gameplay that the wall completely changed location and orientation.

It would in an SCP-themed game. Or dreamscape/Inception themed one.

Hell, "you're trapped in Doom-like dreamscape, escape before you lose your mind" is a very interesting pitch for a game. Basically take this Doom thing and make walking though a specific, unique-looking doorway from the original game to be the victory condition - the player's job would be to coerce the model to generate it, while also not dying in the Doom fever dream game itself. I'd play the hell out of this.

(Implementation-wise, just loop in a simple recognition model to continously evaluate victory condiiton from last few frames, and some OCR to detect when player's hit points indicator on the HUD drops to zero.)

(I'll happily pay $100 this year to the first project that gets this to work. I bet I'm not the only one. Doesn't have to be Doom specifically, just has to be interesting.)

kridsdale1 · 2024-08-28T15:34:46 1724859286

Check out the actual modern DOOM WAD MyHouse which implements these ideas. It totally breaks our preconceptions of what the DOOM engine is capable of.

https://en.wikipedia.org/wiki/MyHouse.wad

jsheard · 2024-08-28T17:09:02 1724864942

MyHouse is excellent, but it mostly breaks our perception of what the Doom engine is capable of by not really using the Doom engine. It leans heavily on engine features which were embellishments by the GZDoom project, and never existed in the original Doom codebase.

wavemode · 2024-08-28T14:25:07 1724855107

To be honest, I agree! That would be an interesting gameplay concept for sure.

Mainly just wanted to temper expectations I'm seeing throughout this thread that the model is actually simulating Doom. I don't know what will be required to get from here to there, but we're definitely not there yet.

KajMagnus · 2024-08-28T14:46:57 1724856417

Or if training the model on many FPS games? Surviving in one nightmare that morphs into another, into another, into another ...

ValentinA23 · 2024-08-28T14:37:41 1724855861

What you're pointing at mirrors the same kind of limitation in using LLMs for role-play/interactive fictions.

lawlessone · 2024-08-28T16:29:24 1724862564

Maybe a hybrid approach would work. Certain things like inventory being stored as variables, lists etc.

Wouldn't be as pure though.

crooked-v · 2024-08-28T22:16:47 1724883407

Give it state by having a rendered-but-offscreen pixel area that's fed back in as byte data for the next frame.

TeMPOraL · 2024-08-29T09:17:04 1724923024

Huh.

Fun variant: give it hidden state by doing the offscreen scratch pixel buffer thing, but not grading its content in training. Train the model as before, grading on the "onscreen" output, and let it keep the side channel to do what it wants with. It'd be interesting to see what way it would use it, what data it would store, and how it would be encoded.

dr_dshiv · 2024-08-28T13:37:17 1724852237

It's an empirical question, right? But they didn't do it...