(Translated by https://www.hiragana.jp/)
[2404.07973] Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models