ManipulateGaussian: 3D Gaussian Manipulation from Observed Vibrations

Carnegie Mellon University

ManipulateGaussian turns monocular videos into interactive 3D Gaussians with physics constraints.

By applying different external forces, the reconstructed 3D Gaussians exhibit diverse dynamics.

Abstract

Understanding and manipulating the vibration modes of 3D objects is critical for advancements in computer graphics, robotics, and physics-based simulation. A fundamental challenge in precise simulation is constructing interpretable modal representations that facilitate material property estimation, structural analysis, and physically grounded animation. Inspired by recent progress in learning-based dynamic reconstruction, this work extends vibration recovery from 2D dynamic observations into 3D particle simulation. We present a framework for learning and visualizing physically plausible vibration behaviors directly from video data using off-the-shelf 3D Gaussian Splatting (3DGS) models. By leveraging optical flow, deformation modeling, and differentiable rendering, our system estimates modal bases that accurately describe the object's intrinsic structural dynamics.

Modules Overview

Modules Overview

Input

We render synthetic data using our designed 3D assets and animations as the input, with camera parameter and pose.

Girl

Hand

Lamp

Reconstructed dynamic 3D Gaussians

With DG-Mesh we reconstruct the 3D Gaussians from the input videos, with cananical gaussian and their movements. The following visualization shows mesh image, mesh, point cloud.

Girl

Hand

Lamp

Trajectory FFT Analysis

We perform FFT analysis on the motion trajectory of each 3D Gaussian to extract its dominant frequency components.

Trajectory FFT Analysis

Physics Simulation with Motion Formulation

Each frequency modality is simulated with a position and speed, and the complex number represents the phase of the harmonic motion.

Output

After reconstructing 3D Gaussians and estimating their physical properties, we apply external forces to generate new dynamics.

Girl - Manipulated Dynamics

Hand - Manipulated Dynamics

Lamp - Manipulated Dynamics

Real World Data

Results on real-world video data showing input video, rendered Gaussian, and rendered point cloud.

Input Video

Input Video

Rendered Gaussian

Rendered Gaussian

Rendered Point Cloud

Rendered Point Cloud

Related Links

Our method is inspired by the paper “Visual Vibrometry: Estimating Material Properties from Small Motion in Video”.
Original work: https://ieeexplore.ieee.org/document/7728146

We also thank the authors of Dynamic Gaussian Mesh for providing their excellent open-source implementation:
https://github.com/Isabella98Liu/DG-Mesh