Apparently Google is capable of doing cool things. DeepMind’s speculative sampli...

		visarga on Feb 14, 2023 \| parent \| context \| favorite \| on: Google employees criticize CEO for “dumpster fire”... Apparently Google is capable of doing cool things. DeepMind’s speculative sampling achieves 2–2.5x decoding speedups in LLM. That brings cost down significantly, without degradation in quality.