Among existing trials for high-dimensional musculoskeletal control, deep reinforcement learning (DRL) are predominantly used, where the successes come with the following significant requirements:
• long training time to generate effective control
• reference trajectories to guide the learning process
• specific task and model to reduce training difficulty
Below we demonstrate the control performance from the current state-of-the-art DRL-based
method, DynSyn, on the MS-Human-700 model, where we observe that DynSyn:
• fails to learn a natural gait without reference trajectories
• fails over unexpected terrain conditions
• fails when the model suddenly changes
We propose Model Predictive Control with Morphology-aware Proportional Control
(MPC2), a hierarchical model-based planning algorithm to address the challenges of high-dimensional musculoskeletal control.
Our method has two major components:
• Model predictive position controller
A sampling-based model predictive controller
plans the target posture of the agent, with instant rollouts for rapid response to state changes.
• Morphology-aware proportional controller
A proportional
controller adaptively coordinates the actuators
to achieve the target joint positions, with gain parameters dynamically adjusted according to the system morphology.
We show control sequences over full-body motion tasks using MPC2.
Our proposed method demonstrates:
• near real-time stable control of the full-body musculoskeletal model, where no model-based planning algorithm has achieved
• zero-shot adaption to different tasks and terrain conditions, where no previous DRL-based method has demonstrated success in whole-body musculoskeletal systems.
We also demonstrate that the training-free, near-real-time control generation of MPC2 enables efficient reward engineering, facilitating stable control of sports imitation.
Compared to the fragile control policy of DRL-based methods, MPC2 is capable of:
• zero-shot adaption to sudden model changes
• achieving robust control even in the presence of actuator faults.
MPC2 is capable of rapid adaption to sudden perturbation forces to the pelvis, demonstrating robust control performance.
We demonstrate that MPC2 is capable of controlling the ostrich model (120 muscles) with the same cost function used for human model walking.
Combining Bayesian optimization with MPC2, we can optimize the cost function weights to improve the forward speed of the ostrich without manual tuning.
We demonstrate that MPC2 maintains larger polygon support than DynSyn during walking, enhancing the stability.
We demonstrate that MPC2 is capable of reducing the energy consumption by over 75% compared to DynSyn during walking.
We demonstrate that baseline MPC methods provided by Mujoco MPC fail to achieve walking over the MS-Human-700 model.
Non sampling-based MPC require long planning time due to the computation of the derivative of the high-dimensional dynamics, hinders real-time decision making.
Sampling based MPC struggles to sample effective control sequences due to the high-dimensional action space.
Below we show the control performance of MPO and DEP-RL on the MS-Human-700 model (choosing best from 3 random seeds). We found them fail to either stand or walk with the high-dimensional model.
We demonstrate that MPC2 is capable of controlling the arm musculoskeletal model (85 muscles) to manipulate cube to a sequence of target orientations.
We propose a high-dimensional control method, MPC2, that is capable of:
• achieving near real-time stable control of comprehensive musculoskeletal systems
• enabling training-free full-body motion control across a wide range of motion tasks, many of which have not been achieved by state-of-the-art DRL-based methods
• rapid adaption to sudden model changes, fully leveraging the over-actuated nature to achieve robust control.
@inproceedings{iclr2025mpc2,
title={Motion Control of High-Dimensional Musculoskeletal Systems with Hierarchical Model-Based Planning},
author={Wei, Yunyue and Zhuang, Shanning and Zhuang, Vincent and Sui, Yanan},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025}
}