https://github.com/comput3ai/c3-trellis-gradio

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (13.9%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: comput3ai
Language: Python
Default Branch: main
Size: 30.3 MB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created about 1 year ago · Last pushed 12 months ago

Metadata Files

Readme

🎨 C3-TRELLIS-Gradio

A comprehensive comparison and implementation of different TRELLIS model variants for 3D asset generation from images. This project analyzes and consolidates multiple TRELLIS implementations to create an optimized Gradio interface.

🎯 Project Overview

This project examines four different TRELLIS implementations: - 🔧 FurkanGozukara-TRELLIS: Extended implementation with API and additional features - ⚡ TRELLIS-Imagen3D: Simplified, working implementation by cavargas10 - 🎯 TRELLIS-innoai: Multi-image support with both single and multi-image tabs - 🌐 microsoft-TRELLIS: The original official implementation

The main contribution is creating a unified app.py for TRELLIS-innoai that combines the best features from all implementations while removing unnecessary dependencies.

🚀 Key Features

✨ Unified Implementation

🎭 Dual-mode interface: Single image and multi-image 3D generation in one app
🔧 No LitModel3D dependency: Uses standard Gradio components for better compatibility
✂️ Split image functionality: Automatically splits concatenated multi-view images
✨ Clean UI: Based on the working TRELLIS-Imagen3D interface design

🖼️ Multi-Image Support

📷 Upload multiple views of the same object for improved 3D reconstruction
🎲 Supports both stochastic and multidiffusion algorithms
🧪 Experimental feature that works best with consistent object views

📦 Export Options

🎮 GLB files: Industry-standard 3D format with texture
✨ Gaussian splats: PLY format for 3D Gaussian representation
🏛️ Adjustable mesh simplification and texture resolution

📋 Implementation Details

🔄 Key Changes Made

🚫 Removed gradio_litmodel3d dependency: Replaced with standard gr.Model3D
✅ Preserved split_image functionality: Essential for processing multi-view images
🔗 Unified configuration: Server runs on 0.0.0.0:7860 like Imagen3D
🧤 Simplified codebase: Removed unnecessary docstrings while maintaining functionality

⚙️ Technical Architecture

```python

Core pipeline configuration

pipeline = TrellisImageTo3DPipeline.from_pretrained("jetx/trellis-image-large") pipeline.cuda()

Two-stage generation process

Stage 1: Sparse Structure Generation (guidance: 7.5, steps: 12)

Stage 2: Detail Enhancement (guidance: 3.0, steps: 12)

```

🔧 Installation

Clone the repository: bash git clone https://github.com/yourusername/c3-trellis-gradio.git cd c3-trellis-gradio
Install dependencies: bash pip install -r requirements.txt
Run the application: bash python app.py

The application will be available at http://localhost:7860

🏆 Honorable Mentions

This project builds upon and references several TRELLIS implementations:

🏛️ Original Implementation

microsoft/TRELLIS: The official TRELLIS repository with full training code and documentation
Paper: Structured 3D Latents for Scalable and Versatile 3D Generation

🤗 Hugging Face Spaces

cavargas10/TRELLIS-Imagen3D: Clean, working implementation that served as the base for our unified approach
innoai/TRELLIS: Implementation with multi-image support that inspired our dual-mode interface

🧠 Model Weights

jetx/trellis-image-large: Mirror of the TRELLIS model weights on Hugging Face

💡 Usage Example

```python import os os.environ['SPCONV_ALGO'] = 'native'

from PIL import Image from trellis.pipelines import TrellisImageTo3DPipeline from trellis.utils import renderutils, postprocessingutils

Load pipeline

pipeline = TrellisImageTo3DPipeline.from_pretrained("jetx/trellis-image-large") pipeline.cuda()

Single image generation

image = Image.open("path/to/image.png") outputs = pipeline.run(image, seed=1)

Extract outputs

glb = postprocessingutils.toglb( outputs['gaussian'][0], outputs['mesh'][0], simplify=0.95, texture_size=1024 ) glb.export("output.glb") ```

🛠️ Project Structure

c3-trellis-gradio/ ├── TRELLIS-innoai/ # Unified implementation │ ├── app.py # Main Gradio interface │ ├── requirements.txt # Dependencies (no litmodel3d) │ └── assets/ # Example images ├── TRELLIS-Imagen3D/ # Reference implementation ├── FurkanGozukara-TRELLIS/ # Extended features └── microsoft-TRELLIS/ # Original implementation

📄 License

This project is licensed under the MIT License, following the original TRELLIS licensing.

🙏 Acknowledgments

Special thanks to: - 💙 The Microsoft Research team for creating TRELLIS - ⭐ cavargas10 for the clean Imagen3D implementation - 🌟 innoai for the multi-image support implementation - 🤗 The Hugging Face community for hosting the demos and models

📜 Citation

If you use this project, please cite the original TRELLIS paper:

bibtex @article{xiang2024structured, title = {Structured 3D Latents for Scalable and Versatile 3D Generation}, author = {Xiang, Jianfeng and Lv, Zelong and Xu, Sicheng and Deng, Yu and Wang, Ruicheng and Zhang, Bowen and Chen, Dong and Tong, Xin and Yang, Jiaolong}, journal = {arXiv preprint arXiv:2412.01506}, year = {2024} }

Owner

Name: comput3.AI
Login: comput3ai
Kind: organization
Email: hello@comput3.ai

Website: https://comput3.ai
Twitter: comput3ai
Repositories: 1
Profile: https://github.com/comput3ai

Cloud infrastructure for the future of AI.

GitHub Events

Total

Push event: 4
Create event: 2

Last Year

Push event: 4
Create event: 2

Dependencies

Dockerfile docker

nvidia/cuda 12.8.1-cudnn-devel-ubuntu22.04 build

extensions/nvdiffrast/setup.py pypi

numpy *

requirements.txt pypi

easydict ==1.13
gradio *
igraph ==0.11.8
imageio ==2.36.1
imageio-ffmpeg ==0.5.1
onnxruntime ==1.20.1
opencv-python-headless ==4.10.0.84
pillow ==10.4.0
pymeshfix ==0.17.0
pyvista ==0.44.2
rembg ==2.0.60
scipy ==1.14.1
spaces *
spconv-cu120 ==2.3.6
torch ==2.4.0
torchvision ==0.19.0
tqdm ==4.67.1
transformers ==4.46.3
trimesh ==4.5.3
xatlas ==0.0.9
xformers ==0.0.27.post2