mmaction2-master

https://github.com/zjyyjz556/mmaction2-master

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (14.7%) to scientific vocabulary

Last synced: 11 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: zjyyjz556
License: apache-2.0
Language: Python
Default Branch: main
Size: 23.7 MB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created almost 3 years ago · Last pushed almost 3 years ago

Metadata Files

Readme Contributing License Code of conduct Citation Support

Introduction

English | 简体中文

MMAction2 is an open-source toolbox for video understanding based on PyTorch. It is a part of the OpenMMLab project.

The master branch works with PyTorch 1.3+.

Action Recognition Results on Kinetics-400

Spatio-Temporal Action Detection Results on AVA-2.1

Skeleton-base Action Recognition Results on NTU-RGB+D-120

Major Features

Modular design

We decompose the video understanding framework into different components and one can easily construct a customized video understanding framework by combining different modules.

Support for various datasets

The toolbox directly supports multiple datasets, UCF101, Kinetics-[400/600/700], Something-Something V1&V2, Moments in Time, Multi-Moments in Time, THUMOS14, etc.

Support for multiple video understanding frameworks

MMAction2 implements popular frameworks for video understanding:

For action recognition, various algorithms are implemented, including TSN, TSM, TIN, R(2+1)D, I3D, SlowOnly, SlowFast, CSN, Non-local, etc.
For temporal action localization, we implement BSN, BMN, SSN.
For spatial temporal detection, we implement SlowOnly, SlowFast.
- Well tested and documented

We provide detailed documentation and API reference, as well as unittests.

Changelog

v0.18.0 was released in 02/09/2021. Please refer to changelog.md for details and release history.

Benchmark

| Model |input| io backend | batch size x gpus | MMAction2 (s/iter) | MMAction (s/iter) | Temporal-Shift-Module (s/iter) | PySlowFast (s/iter) | | :--- | :---------------:|:---------------:| :---------------:| :---------------: | :--------------------: | :----------------------------: | :-----------------: | | TSN| 256p rawframes |Memcached| 32x8|0.32 | 0.38| 0.42| x | | TSN| 256p dense-encoded video |Disk| 32x8|0.61| x | x | TODO | |I3D heavy|256p videos|Disk |8x8| 0.34 | x | x | 0.44 | | I3D|256p rawframes|Memcached|8x8| 0.43 | 0.56 | x | x | | TSM |256p rawframes|Memcached| 8x8|0.31 | x | 0.41 | x | | Slowonly|256p videos|Disk|8x8 | 0.32 | TODO | x | 0.34 | | Slowfast|256p videos|Disk|8x8 | 0.69 | x | x | 1.04 | | R(2+1)D|256p videos |Disk| 8x8|0.45 | x | x | x |

Details can be found in benchmark.

ModelZoo

Supported methods for Action Recognition:

(click to collapse)

- ✅ [TSN](configs/recognition/tsn/README.md) (ECCV'2016) - ✅ [TSM](configs/recognition/tsm/README.md) (ICCV'2019) - ✅ [TSM Non-Local](configs/recognition/tsm/README.md) (ICCV'2019) - ✅ [R(2+1)D](configs/recognition/r2plus1d/README.md) (CVPR'2018) - ✅ [I3D](configs/recognition/i3d/README.md) (CVPR'2017) - ✅ [I3D Non-Local](configs/recognition/i3d/README.md) (CVPR'2018) - ✅ [SlowOnly](configs/recognition/slowonly/README.md) (ICCV'2019) - ✅ [SlowFast](configs/recognition/slowfast/README.md) (ICCV'2019) - ✅ [CSN](configs/recognition/csn/README.md) (ICCV'2019) - ✅ [TIN](configs/recognition/tin/README.md) (AAAI'2020) - ✅ [TPN](configs/recognition/tpn/README.md) (CVPR'2020) - ✅ [C3D](configs/recognition/c3d/README.md) (CVPR'2014) - ✅ [X3D](configs/recognition/x3d/README.md) (CVPR'2020) - ✅ [OmniSource](configs/recognition/omnisource/README.md) (ECCV'2020) - ✅ [MultiModality: Audio](configs/recognition_audio/resnet/README.md) (ArXiv'2020) - ✅ [TANet](configs/recognition/tanet/README.md) (ArXiv'2020) - ✅ [TRN](configs/recognition/trn/README.md) (CVPR'2015) - ✅ [Timesformer](configs/recognition/timesformer/README.md) (ICML'2021)

Supported methods for Temporal Action Detection:

(click to collapse)

- ✅ [BSN](configs/localization/bsn/README.md) (ECCV'2018) - ✅ [BMN](configs/localization/bmn/README.md) (ICCV'2019) - ✅ [SSN](configs/localization/ssn/README.md) (ICCV'2017)

Supported methods for Spatial Temporal Action Detection:

(click to collapse)

- ✅ [ACRN](configs/detection/acrn/README.md) (ECCV'2018) - ✅ [SlowOnly+Fast R-CNN](configs/detection/ava/README.md) (ICCV'2019) - ✅ [SlowFast+Fast R-CNN](configs/detection/ava/README.md) (ICCV'2019) - ✅ [Long-Term Feature Bank](configs/detection/lfb/README.md) (CVPR'2019)

Supported methods for Skeleton-based Action Recognition:

(click to collapse)

- ✅ [PoseC3D](configs/skeleton/posec3d/README.md) (ArXiv'2021) - ✅ [STGCN](configs/skeleton/stgcn/README.md) (AAAI'2018)

Results and models are available in the README.md of each method's config directory. A summary can be found in the model zoo page.

We will keep up with the latest progress of the community, and support more popular algorithms and frameworks. If you have any feature requests, please feel free to leave a comment in Issues.

Dataset

Supported datasets:

Supported datasets for Action Recognition:

(click to collapse)

- ✅ [UCF101](/tools/data/ucf101/README.md) \[ [Homepage](https://www.crcv.ucf.edu/research/data-sets/ucf101/) \] (CRCV-IR-12-01) - ✅ [HMDB51](/tools/data/hmdb51/README.md) \[ [Homepage](https://serre-lab.clps.brown.edu/resource/hmdb-a-large-human-motion-database/) \] (ICCV'2011) - ✅ [Kinetics-[400/600/700]](/tools/data/kinetics/README.md) \[ [Homepage](https://deepmind.com/research/open-source/kinetics) \] (CVPR'2017) - ✅ [Something-Something V1](/tools/data/sthv1/README.md) \[ [Homepage](https://20bn.com/datasets/something-something/v1) \] (ICCV'2017) - ✅ [Something-Something V2](/tools/data/sthv2/README.md) \[ [Homepage](https://20bn.com/datasets/something-something) \] (ICCV'2017) - ✅ [Moments in Time](/tools/data/mit/README.md) \[ [Homepage](http://moments.csail.mit.edu/) \] (TPAMI'2019) - ✅ [Multi-Moments in Time](/tools/data/mmit/README.md) \[ [Homepage](http://moments.csail.mit.edu/challenge_iccv_2019.html) \] (ArXiv'2019) - ✅ [HVU](/tools/data/hvu/README.md) \[ [Homepage](https://github.com/holistic-video-understanding/HVU-Dataset) \] (ECCV'2020) - ✅ [Jester](/tools/data/jester/README.md) \[ [Homepage](https://20bn.com/datasets/jester/v1) \] (ICCV'2019) - ✅ [GYM](/tools/data/gym/README.md) \[ [Homepage](https://sdolivia.github.io/FineGym/) \] (CVPR'2020) - ✅ [ActivityNet](/tools/data/activitynet/README.md) \[ [Homepage](http://activity-net.org/) \] (CVPR'2015) - ✅ [Diving48](/tools/data/diving48/README.md) \[ [Homepage](http://www.svcl.ucsd.edu/projects/resound/dataset.html) \] (ECCV'2018) - ✅ [OmniSource](/tools/data/omnisource/README.md) \[ [Homepage](https://kennymckormick.github.io/omnisource/) \] (ECCV'2020)

Supported datasets for Temporal Action Detection

(click to collapse)

- ✅ [ActivityNet](/tools/data/activitynet/README.md) \[ [Homepage](http://activity-net.org/) \] (CVPR'2015) - ✅ [THUMOS14](/tools/data/thumos14/README.md) \[ [Homepage](https://www.crcv.ucf.edu/THUMOS14/download.html) \] (THUMOS Challenge 2014)

Supported datasets for Spatial Temporal Action Detection

(click to collapse)

- ✅ [AVA](/tools/data/ava/README.md) \[ [Homepage](https://research.google.com/ava/index.html) \] (CVPR'2018) - 🔲 [UCF101-24](/tools/data/ucf101_24/README.md) \[ [Homepage](http://www.thumos.info/download.html) \] (CRCV-IR-12-01) - 🔲 [JHMDB](/tools/data/jhmdb/README.md) \[ [Homepage](http://jhmdb.is.tue.mpg.de/) \] (ICCV'2013)

Supported datasets for Skeleton-based Action Detection

(click to collapse)

- ✅ [PoseC3D-FineGYM](/tools/data/skeleton/README.md) \[ [Homepage](https://kennymckormick.github.io/posec3d/) \] (arXiv'2021)

Datasets marked with 🔲 are not fully supported yet, but related dataset preparation steps are provided.

Installation

Please refer to install.md for installation.

Data Preparation

Please refer to data_preparation.md for a general knowledge of data preparation. The supported datasets are listed in supported_datasets.md

Get Started

Please see getting_started.md for the basic usage of MMAction2. There are also tutorials:

A Colab tutorial is also provided. You may preview the notebook here or directly run on Colab.

FAQ

Please refer to FAQ for frequently asked questions.

License

This project is released under the Apache 2.0 license.

Citation

If you find this project useful in your research, please consider cite:

BibTeX @misc{2020mmaction2, title={OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark}, author={MMAction2 Contributors}, howpublished = {\url{https://github.com/open-mmlab/mmaction2}}, year={2020} }

Contributing

We appreciate all contributions to improve MMAction2. Please refer to CONTRIBUTING.md in MMCV for more details about the contributing guideline.

Acknowledgement

MMAction2 is an open source project that is contributed by researchers and engineers from various colleges and companies. We appreciate all the contributors who implement their methods or add new features, as well as users who give valuable feedbacks. We wish that the toolbox and benchmark could serve the growing research community by providing a flexible toolkit to reimplement existing methods and develop their own new models.

Projects in OpenMMLab

MMCV: OpenMMLab foundational library for computer vision.
MIM: MIM Installs OpenMMLab Packages.
MMClassification: OpenMMLab image classification toolbox and benchmark.
MMDetection: OpenMMLab detection toolbox and benchmark.
MMDetection3D: OpenMMLab's next-generation platform for general 3D object detection.
MMSegmentation: OpenMMLab semantic segmentation toolbox and benchmark.
MMAction2: OpenMMLab's next-generation video understanding toolbox and benchmark.
MMTracking: OpenMMLab video perception toolbox and benchmark.
MMPose: OpenMMLab pose estimation toolbox and benchmark.
MMEditing: OpenMMLab image and video editing toolbox.
MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding.
MMGeneration: OpenMMLab image and video generative models toolbox.

Owner

Name: zengjiayu
Login: zjyyjz556
Kind: user

Repositories: 1
Profile: https://github.com/zjyyjz556

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - name: "MMAction2 Contributors"
title: "OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark"
date-released: 2020-07-21
url: "https://github.com/open-mmlab/mmaction2"
license: Apache-2.0

GitHub Events

Total

Last Year

Dependencies

.github/workflows/build.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite
codecov/codecov-action v1.0.14 composite

.github/workflows/deploy.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite

docker/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

requirements/build.txt pypi

numpy *
torch >=1.3

requirements/docs.txt pypi

docutils ==0.16.0
einops *
myst-parser *
opencv-python *
recommonmark *
scipy *
sphinx ==4.0.2
sphinx_copybutton *
sphinx_markdown_tables *
sphinx_rtd_theme ==0.5.2

requirements/mminstall.txt pypi

mmcv-full >=1.3.1

requirements/optional.txt pypi

PyTurboJPEG *
av *
decord >=0.4.1
einops *
imgaug *
librosa *
lmdb *
moviepy *
onnx *
onnxruntime *
pims *
timm *

requirements/readthedocs.txt pypi

mmcv *
titlecase *
torch *
torchvision *

requirements/runtime.txt pypi

Pillow *
decord *
einops *
matplotlib *
numpy *
opencv-contrib-python *
scipy *

requirements/tests.txt pypi

coverage * test
flake8 * test
interrogate * test
isort ==4.3.21 test
pytest * test
pytest-runner * test
xdoctest >=0.10.0 test
yapf * test

requirements.txt pypi

setup.py pypi

tools/data/activitynet/environment.yml pypi

decorator ==4.4.2
intel-openmp ==2019.0
joblib ==0.15.1
mkl ==2019.0
numpy ==1.18.4
olefile ==0.46
pandas ==1.0.3
python-dateutil ==2.8.1
pytz ==2020.1
six ==1.14.0
youtube-dl ==2020.5.8

tools/data/gym/environment.yml pypi

decorator ==4.4.2
intel-openmp ==2019.0
joblib ==0.15.1
mkl ==2019.0
numpy ==1.18.4
olefile ==0.46
pandas ==1.0.3
python-dateutil ==2.8.1
pytz ==2020.1
six ==1.14.0
youtube-dl ==2020.5.8

tools/data/hvu/environment.yml pypi

decorator ==4.4.2
intel-openmp ==2019.0
joblib ==0.15.1
mkl ==2019.0
numpy ==1.18.4
olefile ==0.46
pandas ==1.0.3
python-dateutil ==2.8.1
pytz ==2020.1
six ==1.14.0
youtube-dl ==2020.5.8

tools/data/kinetics/environment.yml pypi

decorator ==4.4.2
intel-openmp ==2019.0
joblib ==0.15.1
mkl ==2019.0
numpy ==1.18.4
olefile ==0.46
pandas ==1.0.3
python-dateutil ==2.8.1
pytz ==2020.1
six ==1.14.0
youtube-dl ==2020.5.8

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

mmaction2-master

Science Score: 44.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Introduction

Major Features

Changelog

Benchmark

ModelZoo

Dataset

Installation

Data Preparation

Get Started

FAQ

License

Citation

Contributing

Acknowledgement

Projects in OpenMMLab

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Dependencies