Native and Compact Structured Latents for 3D Generation
trellis2.mp4
(Compressed version due to GitHub size limits. See the full-quality video on our project page!)
TRELLIS.2 is a state-of-the-art large 3D generative model (4B parameters) designed for high-fidelity image-to-3D generation. It leverages a novel "field-free" sparse voxel structure termed O-Voxel to reconstruct and generate arbitrary 3D assets with complex topologies, sharp features, and full PBR materials.
✨ Features
1. High Quality, Resolution & Efficiency
Our 4B-parameter model generates high-resolution fully textured assets with exceptional fidelity and efficiency using vanilla DiTs. It utilizes a Sparse 3D VAE with 16× spatial downsampling to encode assets into a compact latent space.
Resolution Total Time* Breakdown (Shape + Mat) 512³ ~3s 2s + 1s 1024³ ~17s 10s + 7s 1536³ ~60s 35s + 25s
*Tested on NVIDIA H100 GPU.
2. Arbitrary Topology Handling
... continue reading