Reward Redistribution as Align-RUDDER: Learning from a Few Demonstrations. (2020). International Journal of Reciprocal Symmetry and Theoretical Physics, 7, 1-8. https://ojs.bdtopten.com/3404.upright/index.php/ijrstp/article/view/52