Discrete Latent Diffusion Motion Planning#

Abstract#

we introduce a framework that reduces computational costs while ensuring collision-free trajectory generation. The method integrates trajectory encoding, discrete diffusion, and transformer-based decoding to produce high-quality, goal-directed motion plans. Our approach demonstrates the potential of discrete generative models to enhance robotic planning performance in complex environments.

Introduction#

Trajectory planning is a core challenge in robotics, where the goal is to generate smooth, goal-directed paths that avoid collisions with obstacles. Traditional motion planners often rely on continuous optimization techniques, which can be computationally expensive and struggle with high-dimensional environments.

Recent advances in generative models, especially diffusion models, have shown strong performance in structured output generation tasks. Inspired by their success, we explore how discrete latent diffusion can be leveraged to efficiently generate feasible trajectories in robotic settings.

Our goal: generate goal-directed, collision-free trajectories with lower computational cost.

DLDMP Main

Methodology#

DLDMP Architecture

Latent Diffusion Models (LDM)#

LDMs show that moving diffusion to a compressed latent space (e.g., VQ-VAE codes) drastically reduces computational load while preserving fidelity.

Achieves state-of-the-art in high-res image tasks (inpainting, super-resolution) via cross-attention for flexible conditioning.

Key takeaway: Encode robot trajectories (e.g., sequences of joint angles or end-effector positions) into a discrete latent embedding (via VQ-VAE). Perform diffusion in this smaller, quantized space rather than in raw continuous waypoints.

Latent Diffusion Models

Collision Check#

We used Gilbert-Johnson-Keerthi (GJK) algorithm to perform the collision check.

  • Takes the new trajectory sample output from the decoder (sequence of robot poses).
  • Compares each pose or link of the robot arm with known obstacle shapes in the environment.
  • Checks for intersections (i.e., potential collisions) by computing the shortest distance between the robot and the obstacles using convex shape representations.
  • If a collision is detected, the new trajectory sample is iteratively refined until a collision-free trajectory is generated.
Collision Check

Policy Evaluation#

Without Obstacle

With Obstacle