(Translated by https://www.hiragana.jp/)
[2402.11131] Speculative Streaming: Fast LLM Inference without Auxiliary Models