(Translated by https://www.hiragana.jp/)
[2110.06650v2] Multistage linguistic conditioning of convolutional layers for speech emotion recognition