I decided to use a runtime DS to allow things like splicing and looping templates etc.
It's also possible to make the runtime render the compiled time DS directly, which should be even faster if your requirements are simple enough