Skip to content

[arXiv 2023] Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response

Notifications You must be signed in to change notification settings

KK-Zhou/HIMLoco

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 

Repository files navigation


Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response

Junfeng Long*Zirui Wang*Quanyi LiJiawei GaoLiu CaoJiangmiao Pang
Shanghai AI Laboratory Zhejiang University Tsinghua University

demo

🏠 About

Dialogue_Teaser
This paper presents a Hybrid Internal Model (HIM) based method for legged locomotion control in quadruped robots. The method aims to address the limitations of existing learning-based locomotion control paradigms, which suffer from information losses, noisy observations, sample efficiency, and difficulties in developing general locomotion policies for robots with different sensor configurations. The proposed HIM method leverages joint encoders and an Inertial Measurement Unit (IMU) as the only sensors for predicting robot states. Our framework consists of two components: the information extractor HIM and the policy network. Unlike previous methods that explicitly model environmental observations such as ground elevation, friction, restitution, etc, HIM only explicitly estimates velocity and implicitly simulates the system response as an implicit latent embedding, with velocity and this embedding, the policy can estimate the environmental disturbance then perform robust locomotion control. The embedding is learned through contrastive learning, which enhances robustness and adaptability in disturbed and unpredictable environments. The proposed method is validated through simulations in different terrains and real-world experiments on the Unitree robots. The results demonstrate that HIM achieves substantial agility over challenging terrains with minimal sensors and fast convergence.

🔥 News

  • [2023-12] We release the paper of HIMLoco. Please check the 👉 webpage 👈 and view our demos! 🎇;

🔍 Overview

Modeling

The classical Internal Model Control(IMC) suggests that we can perform robust control without directly modeling the disturbance. As shown in the above figure, it uses an internal model to simulate the system response and further estimate the system disturbance, increasing the closed-loop stability. The more accurate the internal model is, the more robust control it can perform. In the context of locomotion, the system disturbance from the environment can be estimated from the response of the robot. Therefore, we consider external environmental properties such as elevation maps, ground friction, and ground restitution as disturbances, and do not exploit them for modeling. As shown in the figure, we modify the original IMC for the locomotion task. The commands contain the reference velocity of our robot, however, there also exists an underlying command that requires the robot to keep stable through the whole process. To achieve a closed-loop control system, we need feedback containing the robot's velocity and an implicit response indicating stability that can not be directly accessed from the robot. Following the principles of the IMC framework, we can build an internal model that can simulate the robot's velocity and the implicit response indicating stability. With this model, we can estimate the disturbance brought by the environment and perform robust locomotion control.

Framework

The policy network receives partial observations and the hybrid internal embedding, which is optimized to be close to the robot's successor state where the response of the robot system is naturally embedded, we use contrastive learning in this process to utilize batch-level information and deal with noise.

🔗 Citation

If you find our work helpful, please cite:

@article{long2023himloco,
  author    = {Junfeng Long, Zirui Wang, Quanyi Li, Jiawei Gao, Liu Cao, Jiangmiao Pang},
  title     = {Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response},
  journal   = {Arxiv},
  year      = {2023},
}

📄 License

Creative Commons License
This work is under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

👏 Acknowledgements

  • legged_gym: Our codebase is built upon legged_gym.

About

[arXiv 2023] Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published