(Translated by https://www.hiragana.jp/)
[2402.15758] Chimera: A Lossless Decoding Method for Accelerating Large Language Models Inference by Fusing all Tokens