(Translated by https://www.hiragana.jp/)
[2406.18871] DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment