Masked Proprioception Autoencoders for Humanoid Locomotion: Self-Supervised Learning to Reconstruct IMU Signals

This post explores a Masked Autoencoder (MAE) approach applied to proprioceptive signals — specifically IMU data from humanoid locomotion rollouts.

Inspired by masked image modeling, the idea is to mask a subset of IMU measurements across time and train a transformer encoder-decoder to reconstruct the missing signals. This forces the model to learn rich temporal and kinematic structure from locomotion data in a fully self-supervised manner, without any labels.

Read the full write-up on Medium: Masked Proprioception Autoencoders for Humanoid Locomotion




Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • Rapid Motor Adaptation (Sensorimotor Adaptation) ~ Physical Intelligence?
  • Compression Progress ~ Computational Intelligence ?
  • Towards robust Humanoid Loco-Manipulation using Deep Reinforcement Learning
  • Mean Flow Matching for PushT Task: One-Step Action Generation
  • Building a Convolutional Neural Network(CNN) From Scratch