(Translated by https://www.hiragana.jp/)
[2309.07937] Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks