instance-segmentation

https://github.com/carleton-comps-cv4ad/instance-segmentation

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (12.5%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: Carleton-Comps-CV4AD
License: apache-2.0
Language: Python
Default Branch: main
Size: 176 MB

Statistics

Stars: 0
Watchers: 0
Forks: 0
Open Issues: 0
Releases: 0

Created over 1 year ago · Last pushed over 1 year ago

Metadata Files

Readme Contributing License Code of conduct Citation

Instance Segmentation on CARLA Data for 2025 Carleton College Comps Project

Adapted from MMDetection

Contributers

| Name | Email | | -------------- | --------------------- | | Ben Zhao | benzhao90@gmail.com | | Nathaniel Li | lin@carleton.edu | | Julian Tanguma | tangumaj@carleton.edu | | David Toledo | toledod@carleton.edu | | Ethan Masadde(Indirectly) | masaddee@carleton.edu | | Josh Meier(Indirectly) | meierj@carleton.edu |

Forked from the MMDetection repository, which is an open source object detection toolbox that provides access to many different models.

We chose to adapt the implementation of MaskTrackRCNN, which is a video instance segmentation model, that was originally trained on the YoutubeVIS2021 dataset, to be trained on a custom made dataset of image sequences from the simulator CARLA to investigate how different weather situations impacted the performance of the model and how we could mitigate those changes in performance.

Environment Setup
Training
Visualization/Inference

Environment Setup

If desired, you can use the original documentation from OpenMMLab to go through the installation process. Otherwise, you can follow these simplified directions.

MMDetection works on Linux, Windows, and macOS. It requires Python 3.7+, CUDA 9.2+, and PyTorch 1.8+. We also had 1 GPU, though more would likely improve training and inference. MMDetection says that you can run their models on CPU only, but we did not try this.

Step 1: Install Miniconda

Step 2: Create and activate conda environment

bash conda create --name openmmlab python=3.8 -y conda activate openmmlab

Step 3: Install dependencies

bash conda install pytorch torchvision -c pytorch conda install fsspec pip install -U openmim mim install mmengine conda install cuda -c nvidia mim install mmcv=="2.1.0"

bash cd mmdetection pip install -v -e . pip install seaborn

Training

To train the model, first you must have the correct video data. Go to our CARLA data repo to produce that or email ___@gmail.com.

Once you have the data, you must create the proper annotations for it. First, you need your data to be organized by weather and frouped by videos. If you ran our custom datset generating script, you will need to do to the datageneration/generatevis_annotations.py file and edit the main portion of the code to specify the path to your data and the exact seeds you used to generate your data. You will then have to add the following lines of code:

```bash if name == 'main': path = '/Data/video_data' # substitute with your data path

unwrap_images(path, random_seeds)
#make sure data is organized into folders
enforce_video_organization(path)
split_train_val(path_test)
generate_vis_annotations(path) # this will generate annotations for your data

```

To generate annotations for your dataset you will need to run python datageneration/generatevis_annotations.py

Once your annotations are complete, go into the config of the model for the type of data that you want to train on, and change the dataroot variable at near the top of the file to point to the location of the weather type within the videodata directory. For example, if I want to train on clear day sequences, then set dataroot = 'pathtodata/videodata/clearday'. For training purposes, the datasettobetested variable inside each config does not matter, so use any of the config files within that weather's directory to train.

Then, you can run python tools/train.py [path to config] and it will create a work directory where logs and checkpoints at the end of each epoch will be saved. With 1 GPU our models took about 5 hours each to train, so it is likely best to set up a screen session (if on linux) first so nothing will interrupt the training.

Here is an example of what you would run if you wanted to train your model on clear day sequences:

bash python tools/train.py ./masktrack-rcnn_models/masktrack-rcnn_carla_clear_day/masktrack-rcnn_carla_clear_day_on_clear_day.py

To train all on all 4 weather types, make sure this train command is repeated for clearday, clearnight, rainyday, and foggyday by pointing to a config that corresponds to that weather type.

Evaluation

To evaluate the performance of the model on a particular dataset, we have the configs set up to calculate the COCO metrics for Average Precision and Average Recall. We primarily used Mean Average Precision to compare our models.

Run python tools/testtracking.py [path to config file] --checkpoint [path to checkpoint] to calculate these metrics. It should take 3-5 minutes and the output will be put to the terminal unless a text file is specified. It is also important to note that the config now matters because each config for each weather has the datasettobetested variable set individually. The naming of each config tells you which dataset that config is set up to test on: masktrack-rcnncarlaxony.py where x is the training set and y is the test set.

Here is an example of testing a model trained on clear day on foggy day scenes with the output to a text file:

bash python tools/test_tracking.py masktrack-rcnn_models/masktrack-rcnn_carla_clear_day/masktrack-rcnn_carla_clear_day_on_foggy_day.py --checkpoint masktrack-rcnn_models/masktrack-rcnn_carla_clear_day/epoch_12.pth > clear_day_on_foggy_day.txt

If you have all 4 model checkpoints trained for each weather and in the correct location (as specified in the script), you can also run this script to run all 16 tests (4 models on all 4 weather types):

bash ./tools/masktrack-rcnn_test.sh

Visualization/Inference

Step 1: create the mp4s

To visualize the output of each different model on the weather situations you need to have an mp4 (video) to give as input to the model instead of the sequences of 18 images that create a 3 second clip at 6 fps. We found the easiest way to do this was to use ffmpeg to create an mp4 from a video sequence. Run ffmpeg -framerate 6 -startnumber [number of first filename in video sequence] -i <pathtodata>/videodata//val/rgb//%04d.png -c:v mpeg4 -q:v 5 .mp4

Here's an example of this command being run:

bash ffmpeg -framerate 6 -start_number 7993 -i /Data/video_data/rainy_day/val/rgb/video_149/%04d.png -c:v mpeg4 -q:v 5 /Data/video_data/videos/video_149_rainy_day.mp4

We used this script to generate mp4s of 6 different videos for all 4 weathers. If you wish to use it, make sure the paths to the images and output paths are correct to your use case.

bash ./tools/visualizations/video_conversion.sh

Step 2: run your model on the mp4

Run python demo/mot_demo.py [path to mp4] [path to config] --checkpoint [path to checkpoint] --score-thr 0.5 --out [output file].mp4 to output another mp4 with the instance segmentation masks and bounding boxes from the model visualized on the video.

An example of this full command of the model trained on clear day being visualized on video49clear_day is:

bash python demo/mot_demo.py /Data/video_data/videos/video_49_clear_day.mp4 masktrack-rcnn_models/masktrack-rcnn_carla_clear_day/masktrack-rcnn_carla_clear_day.py --checkpoint masktrack-rcnn_models/masktrack-rcnn_carla_clear_day/epoch_12.pth --score-thr 0.5 --out /Data/video_data/video_results/clear_day_on_video_49_clear_day.mp4

We used this script to create visualizations of all 4 models on all 6 videos in all 4 weather situations. Feel free to modify it to your file paths as necessary:

bash ./tools/visualizations/model_vis.sh

Step 3 (optional): convert visualization mp4s into gifs

Run ffmpeg -i [path to mp4] -vf fps=15,scale=1080:-1:flags=lanczos,split[s0][s1];[s0]palettegen=statsmode=diff[p];[s1][p]paletteuse=dither=bayer:bayerscale=1:diff_mode=rectangle [path to output file].gif to create a gif from the mp4.

If you are looking to convert many mp4s to gifs, you can use this script to generate the file path names for all the videos by modifying it to look in the correct directory of your videos:

bash ./tools/video_filenames.sh

Then copy the list of video filenames into this script and run it to create all the gifs:

bash ./tools/gif_conversion.sh

Acknowledgement

MMDetection is an open source project that is contributed by researchers and engineers from various colleges and companies. We appreciate all the contributors who implement their methods or add new features, as well as users who give valuable feedbacks. We wish that the toolbox and benchmark could serve the growing research community by providing a flexible toolkit to reimplement existing methods and develop their own new detectors.

Citations

See the paper for the object detection toolbox/frame work we utilized: MMDetection

@article{mmdetection, title = {{MMDetection}: Open MMLab Detection Toolbox and Benchmark}, author = {Chen, Kai and Wang, Jiaqi and Pang, Jiangmiao and Cao, Yuhang and Xiong, Yu and Li, Xiaoxiao and Sun, Shuyang and Feng, Wansen and Liu, Ziwei and Xu, Jiarui and Zhang, Zheng and Cheng, Dazhi and Zhu, Chenchen and Cheng, Tianheng and Zhao, Qijie and Li, Buyu and Lu, Xin and Zhu, Rui and Wu, Yue and Dai, Jifeng and Wang, Jingdong and Shi, Jianping and Ouyang, Wanli and Loy, Chen Change and Lin, Dahua}, journal= {arXiv preprint arXiv:1906.07155}, year={2019} }

See the paper for the model we chose: MaskTrack-RCNN

@inproceedings{yang2019video, title={Video instance segmentation}, author={Yang, Linjie and Fan, Yuchen and Xu, Ning}, booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision}, pages={5188--5197}, year={2019} }

License

This project is released under the Apache 2.0 license.

Owner

Name: Carleton-Comps-CV4AD
Login: Carleton-Comps-CV4AD
Kind: organization

Repositories: 1
Profile: https://github.com/Carleton-Comps-CV4AD

GitHub Events

Total

Push event: 2
Create event: 2

Last Year

Push event: 2
Create event: 2

Dependencies

.github/workflows/deploy.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite

.circleci/docker/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

docker/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

docker/serve/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

docker/serve_cn/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

requirements/albu.txt pypi

albumentations >=0.3.2

requirements/build.txt pypi

cython *
numpy *

requirements/docs.txt pypi

docutils ==0.16.0
myst-parser *
sphinx ==4.0.2
sphinx-copybutton *
sphinx_markdown_tables *
sphinx_rtd_theme ==0.5.2
urllib3 <2.0.0

requirements/mminstall.txt pypi

mmcv >=2.0.0rc4,<2.2.0
mmengine >=0.7.1,<1.0.0

requirements/multimodal.txt pypi

fairscale *
jsonlines *
nltk *
pycocoevalcap *
transformers *

requirements/optional.txt pypi

cityscapesscripts *
emoji *
fairscale *
imagecorruptions *
scikit-learn *

requirements/readthedocs.txt pypi

mmcv >=2.0.0rc4,<2.2.0
mmengine >=0.7.1,<1.0.0
scipy *
torch *
torchvision *
urllib3 <2.0.0

requirements/runtime.txt pypi

matplotlib *
numpy *
pycocotools *
scipy *
shapely *
six *
terminaltables *
tqdm *

requirements/tests.txt pypi

asynctest * test
cityscapesscripts * test
codecov * test
flake8 * test
imagecorruptions * test
instaboostfast * test
interrogate * test
isort ==4.3.21 test
kwarray * test
memory_profiler * test
nltk * test
onnx ==1.7.0 test
onnxruntime >=1.8.0 test
parameterized * test
prettytable * test
protobuf <=3.20.1 test
psutil * test
pytest * test
transformers * test
ubelt * test
xdoctest >=0.10.0 test
yapf * test

requirements/tracking.txt pypi

mmpretrain *
motmetrics *
numpy <1.24.0
scikit-learn *
seaborn *

requirements.txt pypi

requirements_1.txt pypi

setup.py pypi

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science