cs231n-3d-segmentation

Stanford CS231n Deep Learning for Computer Vision Class Project

https://github.com/jamesbraza/cs231n-3d-segmentation

Keywords

computer-vision deep-learning medical-imaging segmentation stanford

Last synced: 9 months ago · JSON representation ·

Repository

Stanford CS231n Deep Learning for Computer Vision Class Project

Basic Info

Host: GitHub
Owner: jamesbraza
License: mit
Language: Python
Default Branch: main
Homepage:
Size: 38.7 MB

Statistics

Stars: 1
Watchers: 3
Forks: 0
Open Issues: 0
Releases: 0

Topics

computer-vision deep-learning medical-imaging segmentation stanford

Created about 3 years ago · Last pushed almost 3 years ago

Metadata Files

Readme License Citation

cs231n-3d-segmentation

Stanford CS231N Deep Learning for Computer Vision: Class Project

Summary

3-D U-Net

A 3-D U-Net gets passed batches of four MRI scans (B x 4 x D x H x W), and processes them using 3-D convolution and max pooling layers. The idea is the model can utilize 3-D spatial information, not just 2-D (per-slice) info.

Trained weights are available in unet_zoo/checkpoints. The binary threshold used on the 3-D U-Net model's raw predictions was 0.25, we found this maximized the IoU within the validation dataset's binarized predictions and labels.

Here are several cross-sections from the test dataset's example 2:

| Angle | Image | | :---: | :-------------------------------------------------------------------: | | 0 | cross-section angle 1 | | 1 | | | 2 | cross-section angle 2 |

Here is the same example rendered in 3-D. The color code is grey is healthy tissue, green is non-enhancing tumor core, blue is peritumoral edema, and purple is Gd-enhancing tumor.

2-D U-Net

On the branch experiment/unet2d, we trained a 2-D U-Net. A 2-D U-Net gets passed batches of four MRI slices (B x 4 x H x W), where the batch dimension B is actually the original MRI's depth dimension D.

This slicing detail effectively turns a batch of MRI volumes into a batch of slices from MRI volumes. This means a 2-D U-Net doesn't learn upon data containing 3-D spatial information. The 2-D U-Net model was trained using the slices dataset found in data/loaders.py.

The 2-D U-Net uses 2-D convolution and max pooling layers (as opposed to 3-D ones), so it has 1/3rd the weights of a 3-D U-Net. This lightweight model runs substantially faster, but, given the same training data, generally under-performs a 3-D U-Net when looking at IoU with labels.

Trained weights are available in unet_zoo/checkpoints on the experiment/unet2d branch. The binary threshold used on the 2-D U-Net model's raw predictions was 0.06, using the IoU procedure detailed in the 3-D U-Net summary.

Here are several cross-sections from the test dataset's example 2:

| Angle | Image | | :---: | :-------------------------------------------------------------------: | | 0 | cross-section angle 1 | | 1 | | | 2 | cross-section angle 2 |

Here is the same example rendered in 3-D, its color code corresponds with the 3-D U-Net summary's colors.

Dataset

We used the BraTS2020 Dataset (Training + Validation) dataset from Kaggle. All iterations of the BraTS challenge can be found here.

Here's how to easily download the dataset with the Kaggle API:

console kaggle datasets download -p data/brats2020-training-validation-data --unzip awsaf49/brats20-dataset-training-validation

Development

Development began in spring 2023 using Python 3.11.

AWS

AWS granted access to G and VT instances, and at the time, AWS's Deep Learning AMI supported G3, P3, P3dn, P4d, P4de, G5, G4dn instances. Thus, the AMI used was Deep Learning AMI GPU PyTorch 2.0.0 (Ubuntu 20.04) 20230530 (release notes) with instance type g4dn.2xlarge and 120 GiB of gp3 (general purpose SSD) storage.

For your reference, the below commands take less than 20 minutes to run.

Step 1: check GPU is present.

shell nvcc --version # cuda_11.8 nvidia-smi # NVIDIA Tesla T4 sudo apt update && sudo apt upgrade -y sudo apt install -y ubuntu-drivers-common alsa-utils ubuntu-drivers devices # Drivers: nvidia-driver-525, nvidia-driver-525-server

Step 2: install and configure Python 3.11.

shell python3 --version # 3.8.10 sudo apt update && sudo apt upgrade -y sudo add-apt-repository -y ppa:deadsnakes/ppa sudo apt install -y python3.11 python3.11-dev python3.11-venv sudo update-alternatives --install /usr/bin/python3 python3 /usr/bin/python3.8 1 sudo update-alternatives --install /usr/bin/python3 python3 /usr/bin/python3.11 2 python3 --version # 3.11.3

Step 3: git clone and install requirements into a venv.

shell git clone https://github.com/jamesbraza/cs231n-3d-segmentation.git cd cs231n-3d-segmentation python3 -m venv venv source venv/bin/activate python -m pip install --no-cache-dir --progress-bar off -r requirements.txt

Step 4: download BraTS 2020 dataset using the Kaggle API.

```shell

Run these from local (non-VM) machine

export SEG01= scp -pr ~/.kaggle/ ubuntu@$SEG01:~/.kaggle/ ssh ubuntu@$SEG01 cd cs231n-3d-segmentation source venv/bin/activate kaggle datasets download -p data/brats2020-training-validation-data \ --unzip awsaf49/brats20-dataset-training-validation ```

TensorBoard

Local Training

Here is how you kick off TensorBoard:

shell tensorboard --logdir <path> --port 6006

Afterwards, go to the URL: http://localhost:6006/.

Remote Training

If training on a remote machine, make sure you expose port 6006 in the AWS security group:

IP version: IPv4
Type: Custom TCP
Protocol: TCP
Port range: 6006
Source: 0.0.0.0/0
Description: TensorBoard

First, start TensorBoard on the remote machine:

shell tensorboard --logdir <path> --host 0.0.0.0 --port 6006

Then on the local machine:

```shell export SEG01= python -m webbrowser http://$SEG01:6006/

or

open http://$SEG01:6006/ ```

Owner

Name: James Braza
Login: jamesbraza
Kind: user
Location: San Francisco Bay Area, CA, USA
Company: AI Research @Future-House

Website: jamesbraza.com
Repositories: 45
Profile: https://github.com/jamesbraza

Just a man eating spaghetti

Citation (CITATION.cff)

---
# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!

cff-version: 1.2.0
title: 'Neuro-Oncological Volumetric (3-D) Segmentation'
message: >-
  If you use this software, please cite it using the
  metadata from this file.
type: software
authors:
  - given-names: James
    family-names: Braza
    email: jamesbraza@stanford.edu
    orcid: 'https://orcid.org/0000-0002-1444-5313'
    affiliation: Stanford University School of Engineering
repository-code: 'https://github.com/jamesbraza/cs231n-3d-segmentation'
abstract: >-
  We experiment on existing 2-D and 3-D U-Net models to decrease model size
  and computation time, enabling these models to run on cheaper hardware and
  perform real-time inference.
keywords:
  - volumetric segmentation
  - 3-D segmentation
  - U-Net
  - BraTS
  - MRI analysis
license: MIT

GitHub Events

Total

Last Year

Issues and Pull Requests

Last synced: about 1 year ago

All Time

Total issues: 0
Total pull requests: 35
Average time to close issues: N/A
Average time to close pull requests: 1 minute
Total issue authors: 0
Total pull request authors: 2
Average comments per issue: 0
Average comments per pull request: 0.0
Merged pull requests: 35
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

cs231n-3d-segmentation

Science Score: 31.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

cs231n-3d-segmentation

Summary

3-D U-Net

2-D U-Net

Dataset

Development

AWS

Run these from local (non-VM) machine

TensorBoard

Local Training

Remote Training

or

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels