2024videoactionrec
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (6.9%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: VisualMediaLaboratory
- License: apache-2.0
- Language: Python
- Default Branch: hnu_action_demo
- Size: 68.3 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 2
- Open Issues: 1
- Releases: 0
Metadata Files
README.md
쓰레기 불법 투기 감지를 위한 영상처리 알고리즘 연구
Introduction
Features
- 행동 인식(Action Recognition) 신경망을 활용한 불법 쓰레기 무단 투기 행동 감지
- MMaction2에서 지원하는 Kinetics-400으로 사전학습된 I3D(Inflated 3D ConvNet) 신경망을 미세 조정(Fine Tuning) 학습을 진행하여 성능 개선
- AI-HUB에서 제공하는 이상행동 CCTV 영상의 투기(dump) 데이터 셋을 사용하여 미세 조정 학습 진행 ### I3D(Inflated 3D ConvNet)
- 비디오 데이터의 행동 인식을 위해 설계된 딥러닝 모델로, 2D CNN을 기반으로 3D CNN으로 확장한 모델
- 각 컨볼루션 필터가 2D 대신 3D로 작동하며, 동영상 데이터를 처리할 수 있도록 시간 축 정보도 포함됨
- RGB 데이터와 Optical Flow 데이터를 3D Conv로 처리하여 시간 흐름에 따른 공간적 변화를 학습
- Google의 Inception V1 아키텍처를 3D로 확장하여 설계되었음
Installation
가상환경 생성
python3.8 -m venv venv
source ./venv/bin/activate
pip install -U pip
Dependencies
- CUDA11.8
- Python3.8
- pytorch 2.0.0+cu118
- torchvision 0.15.1+cu118
mmaction 관련 패키지 설치
pip install -U openmim
mim install mmengine
mim install "mmcv==2.1.0"
mim install mmdet
Project build
git clone "this repository" project
cd project
pip install -v -e .
Run
Video input Video output download link (weight file) : i3dimagenet-pretrained-r50-heavy8xb8-32x2x1-100ekinetics400-rgbtrainedbyhnuepoch40.pth
download link (demo video, dump) : i3dimagenet-pretrained-r50-heavy8xb8-32x2x1-100ekinetics400-rgbtrainedbyhnuepoch40.pth
python demo/demo_visualize.py configs/recognition/i3d/i3d_dense_trained_by_hnu.py \
i3d_imagenet-pretrained-r50-heavy_8xb8-32x2x1-100e_kinetics400-rgb_trained_by_hnu_epoch_40.pth \
demo/167-3_cam02_dump02_place04_day_summer.mp4 tools/data/kinetics/label_map_hnu_label.txt \
--out-filename output.mp4
|
|
|
|:-----------------:|:---------------:|
| 투기행위 신뢰도 : 0.78 (야간) | 투기행위 신뢰도 : 0.98 (주간) |
|
|
|
| 투기행위 신뢰도 : 0.91 (주간) | 투기행위 신뢰도 : 0.92 (주간) |
Related linkes
- I3D : Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
- MMDetection: OpenMMLab detection toolbox and benchmark.
- MMAction2: OpenMMLab Video Action Recognition toolbox and benchmark.
Owner
- Login: VisualMediaLaboratory
- Kind: user
- Repositories: 1
- Profile: https://github.com/VisualMediaLaboratory
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - name: "MMAction2 Contributors" title: "OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark" date-released: 2020-07-21 url: "https://github.com/open-mmlab/mmaction2" license: Apache-2.0
GitHub Events
Total
- Pull request event: 5
- Fork event: 2
- Create event: 2
Last Year
- Pull request event: 5
- Fork event: 2
- Create event: 2
Dependencies
- actions/checkout v2 composite
- actions/setup-python v2 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- codecov/codecov-action v1.0.14 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- codecov/codecov-action v1.0.14 composite
- actions/checkout v2 composite
- actions/setup-python v2 composite
- pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build
- pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build
- pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build
- Pillow *
- decord >=0.4.1
- einops *
- matplotlib *
- numpy *
- opencv-contrib-python *
- scipy *
- torch >=1.3
- docutils ==0.18.1
- einops *
- modelindex *
- myst-parser *
- opencv-python *
- scipy *
- sphinx ==6.1.3
- sphinx-notfound-page *
- sphinx-tabs *
- sphinx_copybutton *
- sphinx_markdown_tables *
- sphinxcontrib-jquery *
- tabulate *
- mmcv >=2.0.0rc4,<2.2.0
- mmengine >=0.7.1,<1.0.0
- transformers >=4.28.0
- PyTurboJPEG *
- av >=9.0
- future *
- imgaug *
- librosa *
- lmdb *
- moviepy *
- openai-clip *
- packaging *
- pims *
- soundfile *
- tensorboard *
- wandb *
- mmcv *
- titlecase *
- torch *
- torchvision *
- coverage * test
- flake8 * test
- interrogate * test
- isort ==4.3.21 test
- parameterized * test
- pytest * test
- pytest-runner * test
- xdoctest >=0.10.0 test
- yapf * test
- ca-certificates 2020.1.1.*
- certifi 2020.4.5.1.*
- ffmpeg 2.8.6.*
- libcxx 10.0.0.*
- libedit 3.1.20181209.*
- libffi 3.3.*
- ncurses 6.2.*
- openssl 1.1.1g.*
- pip 20.0.2.*
- python 3.7.7.*
- readline 8.0.*
- setuptools 46.4.0.*
- sqlite 3.31.1.*
- tk 8.6.8.*
- wheel 0.34.2.*
- xz 5.2.5.*
- zlib 1.2.11.*
- ca-certificates 2020.1.1.*
- certifi 2020.4.5.1.*
- ffmpeg 2.8.6.*
- libcxx 10.0.0.*
- libedit 3.1.20181209.*
- libffi 3.3.*
- ncurses 6.2.*
- openssl 1.1.1g.*
- pip 20.0.2.*
- python 3.7.7.*
- readline 8.0.*
- setuptools 46.4.0.*
- sqlite 3.31.1.*
- tk 8.6.8.*
- wheel 0.34.2.*
- xz 5.2.5.*
- zlib 1.2.11.*
- ca-certificates 2020.1.1.*
- certifi 2020.4.5.1.*
- ffmpeg 2.8.6.*
- libcxx 10.0.0.*
- libedit 3.1.20181209.*
- libffi 3.3.*
- ncurses 6.2.*
- openssl 1.1.1g.*
- pip 20.0.2.*
- python 3.7.7.*
- readline 8.0.*
- setuptools 46.4.0.*
- sqlite 3.31.1.*
- tk 8.6.8.*
- wheel 0.34.2.*
- xz 5.2.5.*
- zlib 1.2.11.*
- ca-certificates 2020.1.1.*
- certifi 2020.4.5.1.*
- ffmpeg 2.8.6.*
- libcxx 10.0.0.*
- libedit 3.1.20181209.*
- libffi 3.3.*
- ncurses 6.2.*
- openssl 1.1.1g.*
- pip 20.0.2.*
- python 3.7.7.*
- readline 8.0.*
- setuptools 46.4.0.*
- sqlite 3.31.1.*
- tk 8.6.8.*
- wheel 0.34.2.*
- xz 5.2.5.*
- zlib 1.2.11.*