(Translated by https://www.hiragana.jp/)
[2404.09857] Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL