So instead of chunks of tokens, we can input stream of tokens and then some poin...

		antupis on Oct 3, 2023 \| parent \| context \| favorite \| on: Efficient streaming language models with attention... So instead of chunks of tokens, we can input stream of tokens and then some point say "LLM take a wheel". So it is very nice but not revolutionary.