Humanoid Whole-Body Locomotion on Narrow Terrain via Dynamic Balance and Reinforcement Learning

1Institute of Artificial Intelligence (TeleAI), 2Shanghai Jiao Tong University
*Corresponding Author

Video

Abstract

Humans possess delicate dynamic balance mechanisms that enable them to maintain stability across diverse terrains and under extreme conditions. However, despite significant advances recently, existing locomotion algorithms for humanoid robots are still struggle to traverse extreme environments, especially in cases that lack external perception (e.g., vision or LiDAR). This is because current methods often rely on gait-based or perception-condition rewards, lacking effective mechanisms to handle unobservable obstacles and sudden balance loss. To address this challenge, we propose a novel whole-body locomotion algorithm based on dynamic balance and Reinforcement Learning (RL) that enables humanoid robots to traverse extreme terrains, particularly narrow pathways and unexpected obstacles, using only proprioception. Specifically, we introduce a dynamic balance mechanism by leveraging an extended measure of Zero-Moment Point (ZMP)-driven rewards and task-driven rewards in a whole-body actor-critic framework, aiming to achieve coordinated actions of the upper and lower limbs for robust locomotion. Experiments conducted on a full-sized Unitree H1-2 robot verify the ability of our method to maintain balance on extremely narrow terrains and under external disturbances, demonstrating its effectiveness in enhancing the robot's adaptability to complex environments.

Method


Experiments

Track

Track Close-up

Dense Conical Obstacles

Obstacles Close-up

Push on Balance Beam

Push on Slope

Push on Unstructured Terrains

Push Close-up

Stumble

Carry Payload

BibTeX

        
          @article{xie2025humanoid,
            title={Humanoid Whole-Body Locomotion on Narrow Terrain via Dynamic Balance and Reinforcement Learning},
            author={Xie, Weiji and Bai, Chenjia and Shi, Jiyuan and Yang, Junkai and Ge, Yunfei and Zhang, Weinan and Li, Xuelong},
            journal={arXiv preprint arXiv:2502.17219},
            year={2025}
          }