Has anyone seen any additional work in this vein? I’d like to think that, if this is just the starting point, the possibilities are relatively open for varied application. I’m drawing a blank on how this concept, in a general sense, could be expanded on and incorporated into a more traditional game context; but I know there should be.
Mostly there isn't. It's about as useful as writing a program in brainfuck. https://en.wikipedia.org/wiki/Brainfuck# You do it precisely because it's hard, not because it's a good way to do it.
There are things you can do with shaders that might be useful for games. For example large simulations. But in generally, getting a game to work the way they work on ShaderToy is a fun puzzle but extremely inefficient and slow.
Even if consider shaders, just new way to run C programs, they have totally different ground - another RAM organization, another variable types than CPU (or even GPU), another variables visibility (and special methods to deal with it), some very special limitations (on code size, on exec time), and non-guaranteed RAM integrity (yes, for VRAM is normal to lost data).
What I see, IF you could fit there, you will got at least one additional high-end CPU core speed for free, even on integrated graphics (for example, nearly all game examples successfully run on intel 630 GPU, and as I understand, CPU there used only to transfer data in-out and to JIT compile).
GPGPU was possible before compute shaders, but very painful. ShaderToy is the hard way today (and for the last ~15 years).
Simulations commonly target GPU, and a game is just an interactive simulation, so in some respect it's straightforward to write games for GPU. From a utilization perspective, you'll need minimally tens of thousands of dynamic game entities, which is one among many reasons why it's uncommon.
"the possibilities are relatively open for varied application."
They are actually very limited, because a GPU does not work, like many cpus in parallel (how I initially assumed).
So you will be limited and restricted in everything you do, making every step painful.
The trick is to find the parts that can be offloaded to the GPU - and come calculated back at the right time.
Games are usually performance critical - and communicating between the GPU and CPU can induce lots of lags as the coordination is tricky and not really under your control.
Well, I am not an expert but something that can perform the same simple operations massivly parallel. Meaning, you want to avoid branching for example and there are lots of footguns, otherwise everything will be slow, or not working. And debugging is a pain.
I guess it would be more accurate to say that GPUs are an array of massively parallel SIMD processors, which are their own kind of parallelism. It's parallelism all the way down