(Translated by https://www.hiragana.jp/)
[2403.02330] RegionGPT: Towards Region Understanding Vision Language Model