https://github.com/comput3ai/c3-trellis-gradio

https://github.com/comput3ai/c3-trellis-gradio

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: arxiv.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.9%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: comput3ai
  • Language: Python
  • Default Branch: main
  • Size: 30.3 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created about 1 year ago · Last pushed 12 months ago
Metadata Files
Readme

README.md

🎨 C3-TRELLIS-Gradio

A comprehensive comparison and implementation of different TRELLIS model variants for 3D asset generation from images. This project analyzes and consolidates multiple TRELLIS implementations to create an optimized Gradio interface.

🎯 Project Overview

This project examines four different TRELLIS implementations: - 🔧 FurkanGozukara-TRELLIS: Extended implementation with API and additional features - ⚡ TRELLIS-Imagen3D: Simplified, working implementation by cavargas10 - 🎯 TRELLIS-innoai: Multi-image support with both single and multi-image tabs - 🌐 microsoft-TRELLIS: The original official implementation

The main contribution is creating a unified app.py for TRELLIS-innoai that combines the best features from all implementations while removing unnecessary dependencies.

🚀 Key Features

✨ Unified Implementation

  • 🎭 Dual-mode interface: Single image and multi-image 3D generation in one app
  • 🔧 No LitModel3D dependency: Uses standard Gradio components for better compatibility
  • ✂️ Split image functionality: Automatically splits concatenated multi-view images
  • Clean UI: Based on the working TRELLIS-Imagen3D interface design

🖼️ Multi-Image Support

  • 📷 Upload multiple views of the same object for improved 3D reconstruction
  • 🎲 Supports both stochastic and multidiffusion algorithms
  • 🧪 Experimental feature that works best with consistent object views

📦 Export Options

  • 🎮 GLB files: Industry-standard 3D format with texture
  • Gaussian splats: PLY format for 3D Gaussian representation
  • 🏛️ Adjustable mesh simplification and texture resolution

📋 Implementation Details

🔄 Key Changes Made

  1. 🚫 Removed gradio_litmodel3d dependency: Replaced with standard gr.Model3D
  2. Preserved split_image functionality: Essential for processing multi-view images
  3. 🔗 Unified configuration: Server runs on 0.0.0.0:7860 like Imagen3D
  4. 🧤 Simplified codebase: Removed unnecessary docstrings while maintaining functionality

⚙️ Technical Architecture

```python

Core pipeline configuration

pipeline = TrellisImageTo3DPipeline.from_pretrained("jetx/trellis-image-large") pipeline.cuda()

Two-stage generation process

Stage 1: Sparse Structure Generation (guidance: 7.5, steps: 12)

Stage 2: Detail Enhancement (guidance: 3.0, steps: 12)

```

🔧 Installation

  1. Clone the repository: bash git clone https://github.com/yourusername/c3-trellis-gradio.git cd c3-trellis-gradio

  2. Install dependencies: bash pip install -r requirements.txt

  3. Run the application: bash python app.py

The application will be available at http://localhost:7860

🏆 Honorable Mentions

This project builds upon and references several TRELLIS implementations:

🏛️ Original Implementation

🤗 Hugging Face Spaces

🧠 Model Weights

💡 Usage Example

```python import os os.environ['SPCONV_ALGO'] = 'native'

from PIL import Image from trellis.pipelines import TrellisImageTo3DPipeline from trellis.utils import renderutils, postprocessingutils

Load pipeline

pipeline = TrellisImageTo3DPipeline.from_pretrained("jetx/trellis-image-large") pipeline.cuda()

Single image generation

image = Image.open("path/to/image.png") outputs = pipeline.run(image, seed=1)

Extract outputs

glb = postprocessingutils.toglb( outputs['gaussian'][0], outputs['mesh'][0], simplify=0.95, texture_size=1024 ) glb.export("output.glb") ```

🛠️ Project Structure

c3-trellis-gradio/ ├── TRELLIS-innoai/ # Unified implementation │ ├── app.py # Main Gradio interface │ ├── requirements.txt # Dependencies (no litmodel3d) │ └── assets/ # Example images ├── TRELLIS-Imagen3D/ # Reference implementation ├── FurkanGozukara-TRELLIS/ # Extended features └── microsoft-TRELLIS/ # Original implementation

📄 License

This project is licensed under the MIT License, following the original TRELLIS licensing.

🙏 Acknowledgments

Special thanks to: - 💙 The Microsoft Research team for creating TRELLIS - ⭐ cavargas10 for the clean Imagen3D implementation - 🌟 innoai for the multi-image support implementation - 🤗 The Hugging Face community for hosting the demos and models

📜 Citation

If you use this project, please cite the original TRELLIS paper:

bibtex @article{xiang2024structured, title = {Structured 3D Latents for Scalable and Versatile 3D Generation}, author = {Xiang, Jianfeng and Lv, Zelong and Xu, Sicheng and Deng, Yu and Wang, Ruicheng and Zhang, Bowen and Chen, Dong and Tong, Xin and Yang, Jiaolong}, journal = {arXiv preprint arXiv:2412.01506}, year = {2024} }

Owner

  • Name: comput3.AI
  • Login: comput3ai
  • Kind: organization
  • Email: hello@comput3.ai

Cloud infrastructure for the future of AI.

GitHub Events

Total
  • Push event: 4
  • Create event: 2
Last Year
  • Push event: 4
  • Create event: 2

Dependencies

Dockerfile docker
  • nvidia/cuda 12.8.1-cudnn-devel-ubuntu22.04 build
extensions/nvdiffrast/setup.py pypi
  • numpy *
requirements.txt pypi
  • easydict ==1.13
  • gradio *
  • igraph ==0.11.8
  • imageio ==2.36.1
  • imageio-ffmpeg ==0.5.1
  • onnxruntime ==1.20.1
  • opencv-python-headless ==4.10.0.84
  • pillow ==10.4.0
  • pymeshfix ==0.17.0
  • pyvista ==0.44.2
  • rembg ==2.0.60
  • scipy ==1.14.1
  • spaces *
  • spconv-cu120 ==2.3.6
  • torch ==2.4.0
  • torchvision ==0.19.0
  • tqdm ==4.67.1
  • transformers ==4.46.3
  • trimesh ==4.5.3
  • xatlas ==0.0.9
  • xformers ==0.0.27.post2