(Translated by https://www.hiragana.jp/)
[2112.09583] Align and Prompt: Video-and-Language Pre-training with Entity Prompts