Bolt3D: Generating 3D Scenes in Seconds
Published on: 2025-06-08 08:30:56
Geometry VAE
The key to generating high-quality 3D scenes with a latent diffusion model is our Geometry VAE, capable of compressing pointmaps with high accuracy. We find empirically that our VAE with a transformer decoder is more appropriate for autoencoding pointmaps than a VAE with a convolutional decoder or a VAE pre-trained for autoencoding images. Below we visualize colored point clouds using (1) Pointmaps from data, (2) Pointmaps autoencoded with our VAE, (3) Pointmaps autoencoded with a VAE with a convolutional decoder and (4) Pointmaps autoencoded with a pre-trained Image VAE.
... Read full article.