Table 1 Range of curriculum parameters.

From: Learning agility and adaptive legged locomotion via curricular hindsight reinforcement learning

 

Start

End

Unit

Reward factor

  

 Angle velocity

0

0.21

–

 Joint torque penalties

0

0.018

–

 Base acceleration penalties

0

0.1

–

 Energy

0

0.012

–

 Foot slip

0

0.3

–

Domain randomization

  

 Contact friction

[1.25, 1.25]

[0.4, 1.25]

–

 Payload mass

\([-1.0, -1.0]\)

\([-1.0, 5.0]\)

kg

 External disturbance force

[0, 0]

[0, 15]

N

 External disturbance torque

[0, 0]

[0, 5]

Nm

 Gravity direction shift

[0, 0]

[0, 15]

degree

Terrians

  

 Height fields range

[0, 0]

\([-2.5, 2.5]\)

cm

Commands

  

 Linear velocities

[0, 0.8]

[0, 3.5]

m/s

 Direction of velocities

\([-0.5, 0.5]\)

\([-\pi , \pi ]\)

rad