Leveraging autoregressive models and a novel tokenization scheme, UniRig delivers state-of-the-art performance across diverse characters and objects, poised to break the 3D animation bottleneck.
The landscape of 3D content creation is exploding. Fueled by both sophisticated traditional workflows and the rapid rise of AI-powered generation tools (like our own at Tripo), the demand for high-quality 3D assets is surging. Yet, a critical bottleneck persists: rigging. Transforming a static 3D mesh into an animatable character with a skeleton and skinning weights remains a complex, time-consuming, and often manual process requiring significant expertise.
Existing automated solutions offer partial relief but often fall short. Template-based methods excel within their predefined structures (like standard bipeds) but lack the flexibility for the sheer diversity of models being created today. Template-free approaches offer more adaptability but frequently struggle with generating topologically valid skeletons or require complex post-processing, hindering practical adoption.
Today, Tripo is excited to introduce UniRig, a novel, unified framework for automatic skeletal rigging designed to overcome these limitations. As detailed in our latest research paper "One Model to Rig Them All: Diverse Skeleton Rigging with UniRig", UniRig presents a powerful model capable of generating high-quality skeletal rigs for an unprecedented variety of 3D models – from humans and animals to complex fictional characters and even inorganic structures.
At its core, UniRig leverages the power of large autoregressive models, akin to those driving advancements in language and image generation. Instead of predicting pixels or words, UniRig predicts the structure of a 3D skeleton, joint by joint. This sequential prediction process is key to ensuring the generation of topologically valid skeletons.
A critical design enabling this is our Skeleton Tree Tokenization method. Representing a hierarchical skeleton structure with complex joint interdependencies as a linear sequence suitable for a transformer is non-trivial. Our tokenization scheme efficiently encodes:
This optimized tokenization (reducing sequence length by ~30% compared to naive approaches) allows the autoregressive model (based on the OPT architecture) to learn the underlying patterns of skeletal structures effectively, conditioned on the input mesh geometry processed by a shape encoder.
Once a valid skeleton is predicted, UniRig employs a Bone-Point Cross Attention mechanism to predict per-vertex skinning weights. This module effectively captures the complex influence of each bone on the surrounding mesh surface, incorporating geometric features from the mesh and skeleton, crucially augmented by geodesic distance information for improved spatial awareness.
Furthermore, UniRig predicts bone-specific attributes (like stiffness or gravity influence for spring bones), enabling more physically plausible secondary motion directly from the learned parameters, evaluated via differentiable physics simulation during training for enhanced realism.
A model is only as good as its data. To train UniRig for broad applicability, we curated Rig-XL, a new large-scale dataset containing over 14,000 diverse, rigged 3D models. Derived and meticulously cleaned from resources like Objaverse-XL, Rig-XL spans multiple categories (bipeds, quadrupeds, birds, insects, static objects, etc.) and provides the necessary scale and variety to train a truly generalizable rigging model. We complemented this with a VRoid dataset to refine performance on detailed anime-style characters with spring bones.
UniRig significantly advances the state-of-the-art in automatic rigging:
UniRig represents a significant step towards solving the rigging bottleneck in modern 3D pipelines. By providing a fast, accurate, and versatile automated solution, it has the potential to:
In line with Tripo's commitment to advancing the field, we are open-sourcing UniRig. We believe this technology can significantly benefit the creator community and foster further innovation.
We invite you to dive deeper:
UniRig is more than just an algorithm; it's a foundational piece for the next generation of 3D content creation, making animation more accessible, efficient, and versatile than ever before.