(Translated by https://www.hiragana.jp/)
[2110.00284] Learning Reward Functions from Scale Feedback