(Translated by https://www.hiragana.jp/)
[2403.03174] MOKA: Open-Vocabulary Robotic Manipulation through Mark-Based Visual Prompting