(Translated by https://www.hiragana.jp/)
[2401.14717] Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion