How to Use TRELLIS 2: Complete AI 3D Generation Guide

Last updated: April 14, 2026

TRELLIS 2 by Microsoft Research generates high-quality 3D assets from a single image or text prompt in about 3 seconds. The model uses a 4-billion parameter architecture with Sparse 3D VAE that achieves 16x spatial compression, as described in the research paper published at CVPR 2025. This guide covers every step of the process — from preparing your input to exporting production-ready models — whether you're using the open-source repository or an online platform.

What You Need Before Starting

TRELLIS 2 offers two ways to generate 3D models:

Approach	Requirements	Best For
Online platform	A web browser	Quick generation, no setup
Local installation	NVIDIA GPU (16GB+ VRAM), Python 3.10+	Custom pipelines, batch processing

For most users, an online platform is the fastest way to start. If you need local installation, see our TRELLIS 2 installation guide.

Try TRELLIS 2 online — no installation required

Method 1: Image-to-3D Generation

Image-to-3D is the most popular workflow. You upload a single photo and TRELLIS 2 reconstructs a full 3D model.

Step 1: Prepare Your Input Image

The quality of your output depends heavily on the input image. Follow these guidelines:

Ideal input images have:

A single, clearly defined subject
Clean background (solid color or transparent PNG)
Even, diffused lighting
Resolution of 512x512 pixels or higher
The subject centered and fully visible

Avoid:

Multiple overlapping objects
Blurry or low-resolution photos
Harsh shadows or extreme lighting
Significant occlusion (parts hidden behind other objects)

A well-prepared image can improve output quality by 30-50% compared to a random snapshot.

Step 2: Upload Your Image

On our platform:

Navigate to the 3D generation page
Upload your image via drag-and-drop or file picker
The system automatically detects the subject and prepares it for 3D generation

If you're running TRELLIS 2 locally, place your image in the project directory and run:

python infer.py --image_path your_image.png

Step 3: Configure Generation Parameters

TRELLIS 2 exposes several parameters that control output quality:

Parameter	Default	Range	Effect
Resolution	512	256-1536	Higher = more detail, slower generation
Sampling Steps	12	4-50	More steps = better quality, slower
Guidance Scale	7.5	1.0-20.0	Higher = more faithful to input
Seed	Random	Any integer	Fixed seed = reproducible results

Recommended settings by use case:

Use Case	Resolution	Steps	Guidance Scale
Quick preview	256	4	5.0
Standard quality	512	12	7.5
High quality	1024	25	10.0
Maximum quality	1536	40	12.0

Step 4: Generate and Review

Click generate and wait approximately 3-10 seconds (depending on resolution and hardware). After generation, review your model:

Rotate the model — check all angles, not just the front
Check the back side — AI estimates the unseen portions, which are less accurate
Inspect fine details — look for symmetry issues or distorted features
Evaluate texture quality — textures should match the original image

If the result needs improvement, try:

A different source image angle
Increasing sampling steps
Adjusting the guidance scale

Step 5: Export Your Model

TRELLIS 2 supports multiple export formats:

Format	Extension	Best For
GLB	`.glb`	Game engines (Unity, Unreal), Web viewers, AR/VR
OBJ	`.obj`	Universal compatibility, 3D editing software
STL	`.stl`	3D printing (geometry only, no textures)
3D Gaussian Splatting	`.ply`	Real-time rendering, web-based 3D viewers
NeRF	`.npz`	Photorealistic visualization

Choose GLB for game development, STL for 3D printing, and Gaussian Splatting for web-based 3D experiences.

Export your 3D model in any format — try it free

Method 2: Text-to-3D Generation

Text-to-3D lets you describe what you want in natural language and TRELLIS 2 generates it.

Step 1: Write an Effective Prompt

Good prompts are specific and descriptive. Here's a formula that works well:

[Subject] + [Key Features] + [Style/Material] + [Optional: Color, Pose, etc.]

Example prompts:

Quality	Prompt
Basic	"a sword"
Good	"a medieval longsword with a leather-wrapped handle"
Excellent	"a medieval longsword with a double-edged steel blade, leather-wrapped handle, brass crossguard, and a ruby set in the pommel"

Step 2: Generate from Text

On our platform:

Switch to Text-to-3D mode
Enter your prompt
Optionally add a negative prompt (what to avoid)
Click generate

Locally:

python infer.py --text_prompt "a medieval longsword with a ruby pommel"

Step 3: Iterate and Refine

Text-to-3D often requires iteration. If the first result isn't what you envisioned:

Add more detail to the prompt
Use a reference image — combine text with an image for more control
Try different seeds — the same prompt can produce varied results
Use negative prompts — specify what you don't want ("blurry", "low quality", "deformed")

Advanced Techniques

Multi-View Generation

TRELLIS 2 can accept multiple views of the same object to improve reconstruction quality. If you have photos from the front, side, and back, upload all of them:

python infer.py --image_path front.png --image_path side.png --image_path back.png

Multi-view input significantly improves back-side accuracy and overall geometric fidelity.

Local Editing

One of TRELLIS 2's unique features is local editing — modify specific parts of a generated 3D model without regenerating everything:

Generate the initial 3D model
Select the region you want to modify
Provide new instructions (text or image patch)
The model updates only the selected region

This is particularly useful for:

Fixing artifacts in specific areas
Changing materials on certain parts
Adding details to a base model

Batch Processing

For generating multiple models, use batch mode:

python infer.py --batch_dir ./input_images/ --output_dir ./output_models/

This processes all images in the input directory sequentially, saving results to the output directory.

Parameter Tuning Deep Dive

Sampling Steps

Sampling steps control the denoising process. More steps produce cleaner geometry and sharper textures:

Steps	Quality	Speed	Use Case
4	Draft	~1s	Quick preview
12	Good	~3s	Standard use
25	Very Good	~6s	Production assets
40+	Excellent	~10s	Final output

Guidance Scale

Guidance scale controls how closely the output follows the input. Think of it as "creativity vs. accuracy":

Low (1-5): More creative, may deviate from input
Medium (5-10): Balanced creativity and accuracy
High (10-20): Strict adherence to input, less variation

Resolution Trade-offs

Higher resolution means more detail but requires more VRAM and time:

Resolution	VRAM Required	Generation Time	Detail Level
256	8 GB	~1s	Basic shapes
512	12 GB	~3s	Good detail
1024	16 GB	~6s	High detail
1536	24 GB	~10s	Maximum detail

Output Format Guide

For Game Development

Export as GLB with these considerations:

Polygon count: AI models often generate 50k-200k polygons — decimate to 10k-50k for real-time rendering
Textures: Check UV mapping quality in Blender
Scale: Set real-world units in your 3D software

For 3D Printing

Export as STL or OBJ:

Run mesh repair to ensure watertight geometry
Check for non-manifold edges and flipped normals
Verify scale matches your intended print size

For Web/AR

Export as 3D Gaussian Splatting or GLB:

Gaussian Splatting provides real-time rendering in browsers
GLB is widely supported in WebXR frameworks
Optimize file size for web delivery

Resources

Source code: github.com/microsoft/TRELLIS.2 (MIT license)
Model weights: Hugging Face — TRELLIS.2-4B
Research paper: "Native and Compact Structured Latents for 3D Generation" — arXiv:2512.14692
v1 paper: "Structured 3D Latents for Scalable and Versatile 3D Generation" — arXiv:2412.01506
Online demo: Hugging Face Spaces

Common Issues and Solutions

Issue	Cause	Solution
Blurry textures	Low sampling steps	Increase to 25+
Distorted geometry	Poor input image	Use a cleaner, well-lit photo
Missing details	Low resolution	Increase to 1024+
Artifacts on back	Single-view input	Provide multiple views
Slow generation	High resolution + steps	Use online platform with optimized hardware
Out of memory	High resolution on limited GPU	Reduce resolution or use cloud generation

TRELLIS 2 vs Other 3D Generation Tools

According to community testing on Reddit and benchmarks published by 3D AI Studio, TRELLIS 2 currently leads in generation speed and overall output quality among open-source 3D generation models.

Feature	TRELLIS 2	Tripo3D	Meshy AI	Hunyuan3D
Generation speed	~3s	~10s	~30s	~15s
Max resolution	1536³	1024³	1024³	1024³
Output formats	GLB, OBJ, STL, GS, NeRF	GLB, OBJ, FBX	OBJ, FBX, STL, GLB	OBJ, GLB
Local editing	Yes	No	No	No
Open source	Yes (MIT)	No	No	Yes
Multi-view input	Yes	Yes	No	Yes
Text-to-3D	Yes	Yes	Yes	Yes

Best Practices Summary

Prepare your input — a clean, well-lit image dramatically improves results
Start with default settings — generate a preview before tweaking parameters
Review from all angles — the back side is estimated and may need attention
Choose the right format — match your export format to your end goal
Iterate — AI generation is fast enough to try multiple approaches
Post-process — use Blender or MeshLab for final cleanup and optimization

FAQ

What is TRELLIS 2?

TRELLIS 2 is a 4-billion parameter AI model developed by Microsoft Research that generates high-quality 3D assets from text prompts or images. It uses a Sparse 3D VAE and DiT architecture to produce 3D models in approximately 3 seconds. The source code is available on GitHub under the MIT license.

Is TRELLIS 2 free to use?

Yes. TRELLIS 2 is open source under the MIT license. You can run it locally for free if you have a compatible NVIDIA GPU. For those without GPU hardware, our online platform offers a free tier for 3D generation.

What file formats does TRELLIS 2 export?

TRELLIS 2 supports GLB, OBJ, STL, 3D Gaussian Splatting (.ply), and NeRF (.npz) export formats. GLB is recommended for game engines, STL for 3D printing, and Gaussian Splatting for web-based 3D viewers.

Can I use TRELLIS 2 generated models commercially?

Yes. TRELLIS 2 is released under the MIT license, which permits commercial use. However, always verify the license of any input images you use and check for potential trademark issues in generated content. See the GitHub license discussion for details.

How much VRAM does TRELLIS 2 need?

TRELLIS 2 requires a minimum of 8 GB VRAM for 256-resolution generation. For standard 512-resolution output, 12 GB VRAM is recommended. High-quality 1024-1536 resolution generation requires 16-24 GB VRAM. If your GPU doesn't meet these requirements, use the online platform instead.

How does TRELLIS 2 compare to Tripo3D and Meshy AI?

TRELLIS 2 generates 3D models in approximately 3 seconds, significantly faster than Tripo3D (~10s) and Meshy AI (~30s). It supports higher resolution (up to 1536³), offers local editing, and is the only fully open-source option among the three. See the detailed comparison table above for specifics.

Get Started with TRELLIS 2

The fastest way to try TRELLIS 2 is through our online platform:

Generate Your First 3D Model — Free

No GPU needed. No Python installation. Upload an image or describe what you want and get a production-ready 3D model in seconds.

Feature	Self-Hosted	Our Platform
Setup time	2-4 hours	0 minutes
GPU required	Yes (16GB+ VRAM)	No
Technical knowledge	Python, CUDA	None
Max resolution	Limited by your GPU	Up to 1536³
Batch processing	Yes	Yes