The 'next word' is just intermediate state. Internal to the model, it knows where it is going. Each inference just revives the previous state.
The 'next word' is just intermediate state. Internal to the model, it knows where it is going. Each inference just revives the previous state.