Reinforcement Learning for Quadrupedal Locomotion

Training robust walking policies for the Unitree Go2 robot using Proximal Policy Optimization (PPO)

Under construction


Ubuntu IsaacLab PyTorch NumPy wandb ***