(1)
Reward Redistribution As Align-RUDDER: Learning from a Few Demonstrations. Int. j. recipr. symmetry theor. phys. 2020, 7, 1-8.