(Translated by https://www.hiragana.jp/)
[2301.03344] Universal Multimodal Representation for Language Understanding