[1]
2020. Reward Redistribution as Align-RUDDER: Learning from a Few Demonstrations. International Journal of Reciprocal Symmetry and Theoretical Physics. 7, (Feb. 2020), 1–8.