https://github.com/cliangyu/mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.2%) to scientific vocabulary
Last synced: 9 months ago
·
JSON representation
Repository
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Basic Info
- Host: GitHub
- Owner: cliangyu
- License: apache-2.0
- Language: Python
- Default Branch: master
- Homepage: https://mmaction2.readthedocs.io
- Size: 63.8 MB
Statistics
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 0
Fork of open-mmlab/mmaction2
Created about 4 years ago
· Last pushed almost 4 years ago
https://github.com/cliangyu/mmaction2/blob/master/
English | [](/README_zh-CN.md) ## Introduction MMAction2 is an open-source toolbox for video understanding based on PyTorch. It is a part of the [OpenMMLab](http://openmmlab.org/) project. The master branch works with **PyTorch 1.5+**.
OpenMMLab website HOT OpenMMLab platform TRY IT OUT[](https://mmaction2.readthedocs.io/en/latest/) [](https://github.com/open-mmlab/mmaction2/actions) [](https://codecov.io/gh/open-mmlab/mmaction2) [](https://pypi.org/project/mmaction2/) [](https://github.com/open-mmlab/mmaction2/blob/master/LICENSE) [](https://github.com/open-mmlab/mmaction2/issues) [](https://github.com/open-mmlab/mmaction2/issues) [Documentation](https://mmaction2.readthedocs.io/en/latest/) | [Installation](https://mmaction2.readthedocs.io/en/latest/install.html) | [Model Zoo](https://mmaction2.readthedocs.io/en/latest/modelzoo.html) | [Update News](https://mmaction2.readthedocs.io/en/latest/changelog.html) | [Ongoing Projects](https://github.com/open-mmlab/mmaction2/projects) | [Reporting Issues](https://github.com/open-mmlab/mmaction2/issues/new/choose)
Action Recognition Results on Kinetics-400
Skeleton-base Action Recognition Results on NTU-RGB+D-120
Skeleton-based Spatio-Temporal Action Detection and Action Recognition Results on Kinetics-400
## Major Features - **Modular design**: We decompose a video understanding framework into different components. One can easily construct a customized video understanding framework by combining different modules. - **Support four major video understanding tasks**: MMAction2 implements various algorithms for multiple video understanding tasks, including action recognition, action localization, spatio-temporal action detection, and skeleton-based action detection. We support **27** different algorithms and **20** different datasets for the four major tasks. - **Well tested and documented**: We provide detailed documentation and API reference, as well as unit tests. ## What's New - (2022-03-04) We support **Multigrid** on Kinetics400, achieve 76.07% Top-1 accuracy and accelerate training speed. - (2021-11-24) We support **2s-AGCN** on NTU60 XSub, achieve 86.06% Top-1 accuracy on joint stream and 86.89% Top-1 accuracy on bone stream respectively. - (2021-10-29) We provide a demo for skeleton-based and rgb-based spatio-temporal detection and action recognition (demo/demo_video_structuralize.py). - (2021-10-26) We train and test **ST-GCN** on NTU60 with 3D keypoint annotations, achieve 84.61% Top-1 accuracy (higher than 81.5% in the [paper](https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/viewPaper/17135)). - (2021-10-25) We provide a script(tools/data/skeleton/gen_ntu_rgbd_raw.py) to convert the NTU60 and NTU120 3D raw skeleton data to our format. - (2021-10-25) We provide a [guide](https://github.com/open-mmlab/mmaction2/blob/master/configs/skeleton/posec3d/custom_dataset_training.md) on how to train PoseC3D with custom datasets, [bit-scientist](https://github.com/bit-scientist) authored this PR! - (2021-10-16) We support **PoseC3D** on UCF101 and HMDB51, achieves 87.0% and 69.3% Top-1 accuracy with 2D skeletons only. Pre-extracted 2D skeletons are also available. **Release**: v0.24.0 was released in 05/05/2022. Please refer to [changelog.md](docs/changelog.md) for details and release history. ## Installation MMAction2 depends on [PyTorch](https://pytorch.org/), [MMCV](https://github.com/open-mmlab/mmcv), [MMDetection](https://github.com/open-mmlab/mmdetection) (optional), and [MMPose](https://github.com/open-mmlab/mmdetection)(optional). Below are quick steps for installation. Please refer to [install.md](docs/install.md) for more detailed instruction. ```shell conda create -n open-mmlab python=3.8 pytorch=1.10 cudatoolkit=11.3 torchvision -c pytorch -y conda activate open-mmlab pip3 install openmim mim install mmcv-full mim install mmdet # optional mim install mmpose # optional git clone https://github.com/open-mmlab/mmaction2.git cd mmaction2 pip3 install -e . ``` ## Get Started Please see [getting_started.md](docs/getting_started.md) for the basic usage of MMAction2. There are also tutorials: - [learn about configs](docs/tutorials/1_config.md) - [finetuning models](docs/tutorials/2_finetune.md) - [adding new dataset](docs/tutorials/3_new_dataset.md) - [designing data pipeline](docs/tutorials/4_data_pipeline.md) - [adding new modules](docs/tutorials/5_new_modules.md) - [exporting model to onnx](docs/tutorials/6_export_model.md) - [customizing runtime settings](docs/tutorials/7_customize_runtime.md) A Colab tutorial is also provided. You may preview the notebook [here](demo/mmaction2_tutorial.ipynb) or directly [run](https://colab.research.google.com/github/open-mmlab/mmaction2/blob/master/demo/mmaction2_tutorial.ipynb) on Colab. ## Supported Methods
Spatio-Temporal Action Detection Results on AVA-2.1
| Action Recognition | ||||
| C3D (CVPR'2014) | TSN (ECCV'2016) | I3D (CVPR'2017) | I3D Non-Local (CVPR'2018) | R(2+1)D (CVPR'2018) |
| TRN (ECCV'2018) | TSM (ICCV'2019) | TSM Non-Local (ICCV'2019) | SlowOnly (ICCV'2019) | SlowFast (ICCV'2019) |
| CSN (ICCV'2019) | TIN (AAAI'2020) | TPN (CVPR'2020) | X3D (CVPR'2020) | OmniSource (ECCV'2020) |
| MultiModality: Audio (ArXiv'2020) | TANet (ArXiv'2020) | TimeSformer (ICML'2021) | ||
| Action Localization | ||||
| SSN (ICCV'2017) | BSN (ECCV'2018) | BMN (ICCV'2019) | ||
| Spatio-Temporal Action Detection | ||||
| ACRN (ECCV'2018) | SlowOnly+Fast R-CNN (ICCV'2019) | SlowFast+Fast R-CNN (ICCV'2019) | LFB (CVPR'2019) | |
| Skeleton-based Action Recognition | ||||
| ST-GCN (AAAI'2018) | 2s-AGCN (CVPR'2019) | PoseC3D (ArXiv'2021) | ||
| Action Recognition | |||
| HMDB51 (Homepage) (ICCV'2011) | UCF101 (Homepage) (CRCV-IR-12-01) | ActivityNet (Homepage) (CVPR'2015) | Kinetics-[400/600/700] (Homepage) (CVPR'2017) |
| SthV1 (Homepage) (ICCV'2017) | SthV2 (Homepage) (ICCV'2017) | Diving48 (Homepage) (ECCV'2018) | Jester (Homepage) (ICCV'2019) |
| Moments in Time (Homepage) (TPAMI'2019) | Multi-Moments in Time (Homepage) (ArXiv'2019) | HVU (Homepage) (ECCV'2020) | OmniSource (Homepage) (ECCV'2020) |
| FineGYM (Homepage) (CVPR'2020) | |||
| Action Localization | |||
| THUMOS14 (Homepage) (THUMOS Challenge 2014) | ActivityNet (Homepage) (CVPR'2015) | ||
| Spatio-Temporal Action Detection | |||
| UCF101-24* (Homepage) (CRCV-IR-12-01) | JHMDB* (Homepage) (ICCV'2015) | AVA (Homepage) (CVPR'2018) | |
| Skeleton-based Action Recognition | |||
| PoseC3D-FineGYM (Homepage) (ArXiv'2021) | PoseC3D-NTURGB+D (Homepage) (ArXiv'2021) | PoseC3D-UCF101 (Homepage) (ArXiv'2021) | PoseC3D-HMDB51 (Homepage) (ArXiv'2021) |
Owner
- Name: Liangyu Chen
- Login: cliangyu
- Kind: user
- Location: Singapore
- Company: Nanyang Technological University
- Website: cliangyu.com
- Twitter: cliangyu_
- Repositories: 1
- Profile: https://github.com/cliangyu



