Discovering a Diverse Set of Near-Optimal Policies for Reinforcement Learning
hand-picked
walker.stand
walker.walk
walker.robustness
walker.discrimitation
dog.stand
dog.walk
cheetah.run
cheetah.robustness
cheetah.discrimination
Walker [discrimination].
Average
Min
None
Robustness