(Translated by https://www.hiragana.jp/)
[2312.09237] Pixel Aligned Language Models