(Translated by https://www.hiragana.jp/)
[2403.03950] Stop Regressing: Training Value Functions via Classification for Scalable Deep RL