RL2 Lab @ SJTU - Embodied Intelligence and Robot Learning

Research Interests

Humanoid Robots & Legged Locomotion

鲁棒与安全的足式智能：聚焦人形、四足及六足机器人在复杂非结构化环境中的高动态感知、控制与安全部署。我们的研究致力于突破机器人在负载突变、硬件退化以及未知物理扰动等极端工况下的运动极限，赋予其在复杂地形中稳定、可靠的作业能力。

Reinforcement Learning in Robotics

真实世界中的机器人技能学习：结合强化学习与表征学习，突破高效的 Sim-to-Real 迁移瓶颈。通过从人类示范、视频及轨迹数据中提取先验知识，赋予机器人平滑行走、复杂节律运动、精准全身模仿以及跨越极高难度地形的泛化与自适应能力。

Vision-Language-Action & Navigation

视觉-语言-动作与具身导航：面向开放世界环境，构建从大尺度场景理解到复杂任务执行的端到端框架。将多模态视觉与自然语言指令深度融合，实现高精度的视觉语言导航与长程任务下的精准动作生成，打通从语义理解到物理交互的闭环。

Affective Computing & Social Robotics

情感化具身交互与共情系统：探索人机共融的新范式，打造具备情感感知与情绪共鸣的具身机器人。融合多模态情绪识别、共情行为决策与细腻的情感姿态表达技术，为机器人赋予理解和反馈人类情感的能力，实现有温暖且富有同理心的陪伴。

Research Highlights

Representative works grouped by research direction. For the full list, see Publications.

01 Humanoid Robots & Legged Locomotion

Active Exploration and Online Perception of Terrain Physics with Legged Robots

Huangxuan Lin, Haoyang Li, Yue Gao*

RA-L 2025

Active exploration strategy coupled with online terrain-physics estimation for safe legged locomotion on varied surfaces.

Paper

Contrastive Forward Prediction Reinforcement Learning for Adaptive Fault-Tolerant Legged Robots

Yangqing Fu, Yang Zhang, Qiyue Yang, Liyun Yan, Zhanxiang Cao, Yue Gao*

CoRL 2025

A cerebellum-inspired dual-pathway architecture for legged robots that adapts gait and tolerates joint failures in real time.

Paper

Constrained Dirichlet Distribution Policy: Guarantee Zero Constraint Violation for Continuous Robotic Control

Jianming Ma, Zhanxiang Cao, Yue Gao*

RA-L 2024

A novel policy parameterization based on the Dirichlet distribution that guarantees zero constraint violations for safe sim-to-real transfer.

Paper

Select before Act: Spatially Decoupled Action Repetition for Continuous Control

Buqing Nie, Yangqing Fu, Yue Gao*

ICLR 2025

Spatially selective action repetition that improves sample efficiency and robustness in continuous locomotion control tasks.

arXiv

HiWET: Hierarchical World-Frame End-Effector Tracking for Long-Horizon Humanoid Loco-Manipulation

Zhanxiang Cao, Liyun Yan, Yang Zhang, Sirui Chen, Jianming Ma, Tianyue Zhan, Shengcheng Fu, Yufei Jia, Cewu Lu, Yue Gao*

RSS 2026

Hierarchical framework for long-horizon humanoid loco-manipulation; world-frame end-effector tracking decouples locomotion and manipulation, enabling robust whole-body control for complex real-world tasks.

arXiv

02 Reinforcement Learning in Robotics

A2CF: Adaptive Assistive Curriculum Force

A2CF: Learning Motion Skills with Adaptive Assistive Curriculum Force in Humanoid Robots

Zhanxiang Cao, Yang Zhang, Buqing Nie, Huangxuan Lin, Haoyang Li, Yue Gao*

ICRA 2026

Assistive-force curriculum co-training with the robot's motion policy: applies assistive forces during early learning and gradually withdraws them, enabling stable walking, dancing, and backflip skills.

arXiv

Robust Humanoid Motion Skills: Complex Terrain

Keep on Going: Learning Robust Humanoid Motion Skills via Selective Adversarial Training

Yang Zhang, Zhanxiang Cao, Buqing Nie, Haoyang Li, Zhong Jiangwei, Qiao Sun, Xiaoyi Hu, Xiaokang Yang, Yue Gao*

AAAI 2026

Non-zero-sum adversarial training with a budget-constrained attack policy; enables humanoid robots to traverse complex terrain and stairs in the real world.

arXiv

SE-Policy: Coordinated Humanoid Locomotion

Coordinated Humanoid Robot Locomotion with Symmetry Equivariant Reinforcement Learning Policy

Buqing Nie, Yang Zhang, Rongjun Jin, Zhanxiang Cao, Huangxuan Lin, Xiaokang Yang, Yue Gao*

AAAI 2026

Symmetry-equivariant actor and symmetry-invariant critic improve multi-directional tracking success; outperforms DreamWaQ by 1.3× on challenging scenarios.

arXiv

Robust Locomotion with Lipschitz Constraint

Robust Locomotion Policy with Adaptive Lipschitz Constraint for Legged Robots

Yang Zhang, Haoyang Li, Yue Gao*

RA-L 2024

Lipschitz-constrained reinforcement learning that produces smoother and more disturbance-robust locomotion policies.

Paper

Attention-based Terrain Encoder and Foothold Planner

Global-Local Attention Decomposition for Terrain Encoding in Humanoid Perceptive Locomotion

Shengcheng Fu, Yue Gao*

Under Review

Cross-attention over height-map features with proprioceptive queries; foothold-point features dynamically focus the policy on traversable regions, deployed on G1 with onboard LiDAR.

03 Vision-Language-Action & Navigation

GATER: Learning Grasp-Action-Target Embeddings and Relations for Task-Specific Grasping

Ming Sun, Yue Gao*

RA-L 2022

Jointly learns grasp-action-target embeddings and their relations to enable task-specific grasping, bridging the gap between object affordances and downstream manipulation goals.

arXiv

FocusNav: Spatial Selective Attention for Humanoid Navigation

FocusNav: Spatial Selective Attention with Waypoint Guidance for Humanoid Local Navigation

Yang Zhang, Jianming Ma, Liyun Yan, Zhanxiang Cao, Yazhou Zhang, Haoyang Li, Yue Gao*

Under Review

Spatial selective attention with waypoint guidance improves humanoid local navigation by focusing on task-relevant regions and planning robust traversal paths in cluttered environments.

arXiv

Hierarchical Humanoid Manipulation with Physically Grounded Motion Intention Representations

Sirui Chen, Yue Gao

Under Review, RA-L 2026

Physically grounded motion intention representations enable hierarchical decomposition of complex humanoid manipulation tasks for robust real-world execution.

04 Affective Computing & Social Robotics

Multi-Modal Hierarchical Empathetic Framework for Social Robots With Affective Body Control

Yue Gao*, Yangqing Fu, Ming Sun, Feng Gao

TAFFC 2024

Multimodal affective understanding and hierarchical empathetic body-behavior generation for social robots in real-world HRI.

Paper

DanceHAT: Generate Stable Dances for Humanoid Robots with Adversarial Training

Buqing Nie, Yue Gao*

ICRA 2022

Adversarial training framework for generating physically stable and expressive dance motions for humanoid social robots from human demonstrations.

Paper

View All Publications →

RL² Lab 具身机器人与表征学习实验室

About

Shaping the Future of Embodied AI

Research Interests

Humanoid Robots & Legged Locomotion

Reinforcement Learning in Robotics

Vision-Language-Action & Navigation

Affective Computing & Social Robotics

Research Highlights

Welcome to Join Us!

RL2 Lab 具身机器人与表征学习实验室

About

Shaping the Future of Embodied AI

Research Interests

Humanoid Robots & Legged Locomotion

Reinforcement Learning in Robotics

Vision-Language-Action & Navigation

Affective Computing & Social Robotics

Research Highlights

Welcome to Join Us!

RL² Lab 具身机器人与表征学习实验室