https://github.com/cvi-szu/mg-motionllm

[CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities

https://github.com/cvi-szu/mg-motionllm

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: arxiv.org, scholar.google
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.1%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

[CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities

Basic Info
  • Host: GitHub
  • Owner: CVI-SZU
  • Default Branch: main
  • Homepage:
  • Size: 1.01 MB
Statistics
  • Stars: 1
  • Watchers: 2
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 1 year ago · Last pushed about 1 year ago
Metadata Files
Readme

README.md

(CVPR 2025) MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities

[Bizhu Wu](https://scholar.google.com/citations?user=u7nZ3bgAAAAJ&hl=en) · [Jinheng Xie](https://scholar.google.com/citations?user=smbRMokAAAAJ&hl=en) · [Keming Shen]() · [Zhe Kong](https://scholar.google.com/citations?user=4X3yLwsAAAAJ&hl=en) [Jianfeng Ren*](https://scholar.google.com/citations?user=ZZ928OgAAAAJ&hl=en) · [Ruibin Bai](https://scholar.google.com/citations?user=oP6AThIAAAAJ&hl=en) · [Rong Qu](https://scholar.google.com/citations?user=ErszCRMAAAAJ&hl=en) · [Linlin Shen*](https://scholar.google.com/citations?user=AZ_y9HgAAAAJ&hl=en) *Corresponding Authors [![arXiv](https://img.shields.io/badge/arXiv-MGMotionLLM-A10717.svg?logo=arXiv)](https://arxiv.org/abs/2504.02478)

Description

MG-MotionLLM can address diverse motion-relevant tasks at multiple granularities by giving different instructions in a unified manner. - coarse-grained: e.g. text-to-motion and motion captioning (upper block) - fine-grained: e.g. motion-to-detailed text and motion localization (bottom block).

teaser

To achieve this, we propose multi-granularity training scheme with novel auxiliary tasks captures motion-related features at different levels, improving understanding across a wide range of tasks. Specifically, we pretrain the model with a total of 28 distinct motion-relevant tasks, including 12 existing classical coarse-grained tasks and 16 newly proposed fine-grained ones. Here, we display examples of prompt templates for a part of tasks used during training.

tasks_template

Visualization

We display some novel applications of our MG-MotionLLM. - text-driven fine-grained motion editing: Temporal Editing (left), Spatial Editing (middle), and Spatial-Temporal Editing (right).

edit
  • fine-grained captioning of both whole (up) and partial (bottom) motion sequences, and motion localization via fine-grained textual description (middle).
novel_apps

More Information (code, weights, etc)

For code, weights, etc, please see here.

Bibtex

If you use our code in your research, kindly cite our work:

bibtex @article{wu2025mg, title={MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities}, author={Wu, Bizhu and Xie, Jinheng and Shen, Keming and Kong, Zhe and Ren, Jianfeng and Bai, Ruibin and Qu, Rong and Shen, Linlin}, journal={arXiv preprint arXiv:2504.02478}, year={2025} }

Owner

  • Name: Computer Vision Institute, SZU
  • Login: CVI-SZU
  • Kind: organization
  • Location: Shenzhen Univeristy, Shenzhen, China

Computer Vision Institute, Shenzhen University

GitHub Events

Total
  • Issues event: 1
  • Watch event: 26
  • Push event: 4
  • Create event: 2
Last Year
  • Issues event: 1
  • Watch event: 26
  • Push event: 4
  • Create event: 2