A Modular Residual Learning Framework to Enhance Model-Based Approach for Robust Locomotion

Korea Advanced Institute of Science and Technology (KAIST)
IEEE Robotics and Automation Letters (RA-L), July 2025

*Corresponding Author

Supplementary Video

Abstract

This paper presents a novel approach that combines the advantages of both model-based and learning-based frameworks to achieve robust locomotion. The residual modules are integrated with each corresponding part of the model-based framework, a footstep planner and dynamic model designed using heuristics, to complement performance degradation caused by a model mismatch. By utilizing a modular structure and selecting the appropriate learning-based method for each residual module, our framework demonstrates improved control performance in environments with high uncertainty, while also achieving higher learning efficiency compared to baseline methods. Moreover, we observed that our proposed methodology not only enhances control performance but also provides additional benefits, such as making nominal controllers more robust to parameter tuning. To investigate the feasibility of our framework, we demonstrated residual modules combined with model predictive control in a real quadrupedal robot. Despite uncertainties beyond the simulation, the robot successfully maintains balance and tracks the commanded velocity.

Method

Overall Framework

Overall modular residual learning framework

Residual Dynamics Module

Residual dynamics module diagram

The LPF smooths the residual dynamics, preventing high-frequency noise from destabilizing the controller. The supervised learning (SL)-based module learns a mapping from the proprioceptive information to the residual dynamics, providing a more stable compensation for model mismatch.

Residual Planner Module

Residual planner module diagram

A reinforcement-learning residual adjusts high-level nominal references (e.g., foothold and gait-phase references) to improve task performance and robustness.

Result

Effect of Residual Modules

Body roll, residual dynamics, and footstep residual under payload and disturbances
Detailed disturbance recovery with stance duration and footstep position adaptation
Red: A - Normal Green: B - With payload Purple: C - With disturbances

With a 6kg off-center payload and external pushes, the nominal controller cannot rely on its fixed model and heuristic contacts alone. The residual dynamics module reacts to the induced model mismatch, while the residual planner changes foothold timing and lateral placement, allowing the robot to recover balance under combined uncertainty.

Walking at 1 m/s with Disturbance

Velocity tracking, stance duration, footstep residual, and residual dynamics during walking
Green: In-place stepping Purple: Acceleration toward 1 m/s Gray: Disturbance recovery

From in-place stepping to acceleration, the residual planner shortens the gait phase duration, showing faster step transitions for high-speed locomotion. When the robot receives a lateral disturbance during acceleration, the gait sequence changes for recovery, the footstep residual shifts the foothold in the disturbance direction, and the residual dynamics captures the posture deviation induced by the disturbance.

Hyperparameter Sensitivity

Success rate and tracking error across MPC weight settings

MPC performance is strongly affected by cost-function weights, which are usually tuned by hand. Across varied position and velocity weight settings, the residual-augmented controller maintains more consistent success rates and tracking errors, showing that the learned modules can reduce sensitivity to imperfect nominal parameter choices.

Performance Consistency

Velocity tracking consistency on flat terrain
Velocity tracking consistency on rough terrain

Across 100 command trials, the proposed controller improves tracking consistency beyond simply avoiding failure. On flat terrain, both controllers succeed, but the residual modules reduce the spread of velocity-tracking errors; on rough terrain, the proposed controller keeps robust tracking where vanilla MPC becomes unreliable at more demanding commands.

OOD Generalization

Out-of-distribution payload scenario with asymmetric leg-mounted masses
Out-of-distribution velocity tracking comparison
Proposed method behavior under out-of-distribution sandbag condition

The asymmetric leg-mounted payload is outside the training distribution, where an end-to-end RL baseline loses tracking performance. The model-based backbone preserves consistent behavior, and the residual modules add physically meaningful adaptations in dynamics compensation, gait timing, and foothold placement toward the payload-induced imbalance.

BibTeX

@article{11091478,
  author={Kim, Min-Gyu and Kang, Dongyun and Kim, Hajun and Park, Hae-Won},
  journal={IEEE Robotics and Automation Letters}, 
  title={A Modular Residual Learning Framework to Enhance Model-Based Approach for Robust Locomotion}, 
  year={2025},
  volume={10},
  number={9},
  pages={9072-9079},
  keywords={Uncertainty;Training;Robots;Legged locomotion;Adaptation models;Vehicle dynamics;Degradation;Computational modeling;Payloads;Computational efficiency;Legged robots;machine learning for robot control;optimization and optimal control},
  doi={10.1109/LRA.2025.3592067}
}