(Translated by https://www.hiragana.jp/)
[2205.05590] A neural prosody encoder for end-ro-end dialogue act classification