(Translated by https://www.hiragana.jp/)
[2004.13106] Learning to Rank in the Position Based Model with Bandit Feedback