dt_segnet

A comprehensive, two-tiered deep learning approach designed for precise object detection and segmentation in electron microscopy (EM) images.

https://github.com/xiazeyu/dt_segnet

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 5 DOI reference(s) in README
✓
Academic publication links
Links to: rsc.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (13.4%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

A comprehensive, two-tiered deep learning approach designed for precise object detection and segmentation in electron microscopy (EM) images.

Basic Info

Host: GitHub
Owner: xiazeyu
License: mit
Language: Python
Default Branch: main
Homepage: https://doi.org/10.1039/d3cp00402c
Size: 16.7 MB

Statistics

Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Releases: 5

Created over 3 years ago · Last pushed over 2 years ago

Metadata Files

Readme License Citation Zenodo

Accurate identification and measurement of the precipitate area by two-stage deep neural networks in novel chromium-based alloys

About The Project

The performance of advanced materials for extreme environments is underpinned by their microstruc- ture, such as the size and distribution of nano- to micro-sized reinforcing phase(s). Chromium-based superalloys are a recently proposed alternative to conventional face-centred-cubic superalloys for high-temperature applications, e.g., Concentrated Solar Power. Their development requires the de- termination of precipitate volume fraction and size distribution using Electron Microscopy (EM), as these properties are crucial for the thermal stability and mechanical properties of chromium superal- loys. Traditional approaches to EM image processing utilise filtering with a fixed contrast threshold, leading to weak robustness to background noise and poor generalisability to different materials. It also requires an enormous amount of time for manual object measurements. Efficient and accurate object detection and segmentation are therefore highly desired to accelerate the development of novel materials like chromium-based superalloys. To address these bottlenecks, based on YOLOv5 and SegFormer structures, this study proposes an end-to-end, two-stage deep learning scheme, DT- SegNet, to perform object detection and segmentation for EM images. The proposed approach can thus benefit from the training efficiency of Convolutional Neural Networks at the detection stage (i.e., a small number of training images required) and the accuracy of the Vision Transformer at the segmentation stage. Extensive numerical experiments demonstrate that the proposed DT-SegNet significantly outperforms the state-of-the-art segmentation tools offered by Weka and ilastik regard- ing a large number of metrics, including accuracy, precision, recall and F1-score. This model will be a meaningful tool for accelerating alloy development and microstructure examination.

Getting Started

Quick Start

You can use the Inference_Colab.ipynb (Colab link) to perform online inference. Or, use following the instructions in Finetune_Colab.ipynb (Colab link) to perform online finetune.

Hardware requirement

Operation System: Windows or Linux
Platform: AutoDL / Google Colab Pro
GPU: NVIDIA RTX A5000
Google Drive space: 10GB

Software requirement

Programming language: Python (3.8 or higher)
Package Management: Anaconda (Miniconda recommended)
Machine Learning Framework: PyTorch and PaddlePaddle

Anaconda environment for labelling on Windows system is in dtsegnet.yaml. The environment can be restored by executing conda create --name dtsegnet --file dtsegnet.yaml in the console.

Two machine learning frameworks need to be installed following the tutorials on their websites. The necessary environment for training and inferring is stored as a pip requirement file in 1_Detection_Model/requirements.txt and 3_Segmentation_Model/requirements.txt.

All the requirements for training and inferring will be installed in the 0_Prepare.ipynb notebook.

Dataset

All data for this project are stored in the Dataset/ folder. All images are numbered for the DT-SegNet pipeline, and the data mapping is stored in dataset_mapping.csv.

The dataset contains the original image, segmentation label and detection label. Detection labels can be used directly for the detection network, but the segmentation label needs to be cropped using codes provided in the notebook before delivering to the segmentation network.

The detection dataset is separated into three sets: test, train, and val.

The segmentation annotation is stored in Dataset/segmentation_labels/. The Regions of Interest for the segmentation network with their annotations will be generated before the segmentation stage.

Implementation

0 Prepare

Follow the cells in 0_Prepare.ipynb to prepare required environments.

0 Label the dataset

0_Labelling_Tools/ contains tools of scripts to label the dataset. The user should follow the following process to label the dataset.

Execute conda create --name dtsegnet-label --file 0_Labelling_Tools/dtsegnet-label.yaml to import the Anaconda environment for labelling.
Execute conda activate dtsegnet-label to activate the labelling environment.
Download the model for EISeg labelling from https://paddleseg.bj.bcebos.com/eiseg/0.5/statichrnet18socr48_aluminium.zip.
Execute python 0_Labelling_Tools/0_EISeg/exe.py to start labelling for the segmentation stage.
Execute python 0_Labelling_Tools/3_gray2pseudo_color.py <Dataset/label> to convert the grey-scale segmentation labels to pseudo colour annotation images. should be replaced by the path of segmentation annotations generated by EISeg.
Execute python 0_Labelling_Tools/1_Segmentation_Label_Flood_Fill.py <Dataset/label> <Dataset/Detection_Label> to generate the detection labels from the segmentation labels. should be replaced by the path of segmentation annotations generated by EISeg, which contain files like: 1.png, 1_cutout.png and 1_pseudo.png. should be replaced by the output folder for detection labels.
Execute python 0_Labelling_Tools/2_labelImg/labelImg.py to finetune the detection labels.

1 Train the model

Follow the cells in 1_Train.ipynb to train the detection model. The trained detection model will be stored in <Google Drive>/DT-SegNet/Detection_Model_Output. it will also automatically generate the dataset for the segmentation network. The generated segmentation dataset will be compressed and stored in <Google Drive>/DT-SegNet/Segmentation_Dataset.zip. The trained segmentation model will be stored in <Google Drive>/DT-SegNet/Segmentation_Model_Output.

2 Inference

Follow the cells in 2_Inference.ipynb to infer using DT-SegNet. The output from the detection model will be stored in <Google Drive>/DT-SegNet/Detection_Output. The output from the segmentation model will be held in <Google Drive>/DT-SegNet/Segmentation_Output. The original-size segmentation mask will be stored in <Google Drive>/DT-SegNet/Output.

3 Validation

Follow the cells in 3_Validation.ipynb to validate the trained models.

4 Analysis

Follow the cells in 4_Analysis.ipynb to analyse the trained models. To compare software and algorithms, we performed experiments on Weka trainable segmentation and Ilastik pixel classification. The notebook uses our output in Output/.

Main Models

Our best-trained models are stored on the Github release page.

Machine-friendly download links are also provided for automatic scripts:

https://github.com/xiazeyu/DT_SegNet/releases/latest/download/detection.pt https://github.com/xiazeyu/DT_SegNet/releases/latest/download/segmentation.pdparams

Output

The Output/ folder holds this project's output images and NumPy metrics, including results from DT-SegNet, Weka and ilastik. Each experiment has an output in two different formats: the .png image output and the .npy NumPy matrix output. Each file in the folder is named by <test id>_<software used>_<algorithm>.

Reference

```bibtex @article{xia2023Accurate, author = {Zeyu Xia and Kan Ma and Sibo Cheng and Thomas Blackburn and Ziling Peng and Kewei Zhu and Weihang Zhang and Dunhui Xiao and Alexander J Knowles and Rossella Arcucci}, copyright = {CC BY-NC 3.0}, doi = {10.1039/d3cp00402c}, issn = {1463-9076}, journal = {Physical Chemistry Chemical Physics}, keywords = {}, language = {English}, month = {6}, number = {23}, pages = {15970--15987}, pmid = {37265373}, publisher = {Royal Society of Chemistry (RSC)}, title = {Accurate Identification and Measurement of the Precipitate Area by Two-Stage Deep Neural Networks in Novel Chromium-Based Alloy}, url = {https://pubs.rsc.org/en/content/articlelanding/2023/CP/D3CP00402C}, volume = {25}, year = {2023} }

```

License

MIT License. More information see LICENSE

Contact

Zeyu Xia - zeyu.xia@connect.qut.edu.au

Kan Ma - arnaud.masysu@gmail.com

Sibo Cheng - sibo.cheng@imperial.ac.uk

Owner

Name: KokomiCat
Login: xiazeyu
Kind: user
Location: Brisbane, Queensland, Australia

Website: https://xiaze.pro/
Twitter: kokomiCat
Repositories: 30
Profile: https://github.com/xiazeyu

Innovating at the Edge of Science and Technology

Citation (CITATION.cff)

cff-version: 1.2.0
message: If you use this software, please cite it as below.
authors:
  - family-names: Xia
    given-names: Zeyu
    orcid: https://orcid.org/0000-0003-0234-5857
  - family-names: Ma
    given-names: Kan
    orcid: https://orcid.org/0000-0001-5729-5477
  - family-names: Cheng
    given-names: Sibo
    orcid: https://orcid.org/0000-0002-8707-2589
  - family-names: Blackburn
    given-names: Thomas
    orcid: https://orcid.org/0000-0001-9160-3285
  - family-names: Peng
    given-names: Ziling
  - family-names: Zhu
    given-names: Kewei
  - family-names: Zhang
    given-names: Weihang
  - family-names: Xiao
    given-names: Dunhui
  - family-names: Knowles
    given-names: Alexander J
  - family-names: Arcucci
    given-names: Rossella
title: DT_SegNet
identifiers:
  - type: doi
    value: 10.1039/d3cp00402c
version: v1.1.2
date-released: 2023-01-06

GitHub Events

Total

Last Year

Issues and Pull Requests

Last synced: over 1 year ago

All Time

Total issues: 4
Total pull requests: 1
Average time to close issues: 2 months
Average time to close pull requests: less than a minute
Total issue authors: 1
Total pull request authors: 1
Average comments per issue: 0.75
Average comments per pull request: 0.0
Merged pull requests: 1
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 1
Average time to close issues: N/A
Average time to close pull requests: less than a minute
Issue authors: 0
Pull request authors: 1
Average comments per issue: 0
Average comments per pull request: 0.0
Merged pull requests: 1
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

xiazeyu (4)

Pull Request Authors

xiazeyu (1)

Top Labels

Issue Labels

Pull Request Labels

Dependencies

0_Labelling_Tools/0_EISeg/eiseg/requirements.txt pypi

albumentations *
cython *
easydict *
opencv-python *
paddleseg *
protobuf ==3.20.0
pyqt5 *
pyyaml *
qtpy *
requests *
scikit-image *
scipy *
wget *

0_Labelling_Tools/2_labelImg/requirements.txt pypi

lxml ==4.9.1
pyqt5 ==5.14.1

1_Detection_Model/requirements.txt pypi

Pillow >=7.1.2
PyYAML >=5.3.1
ipython *
matplotlib >=3.2.2
numpy >=1.18.5
opencv-python >=4.1.1
pandas >=1.1.4
psutil *
pycocotools >=2.0
requests >=2.23.0
scipy >=1.4.1
seaborn >=0.11.0
tensorboard >=2.4.1
thop >=0.1.1
torch >=1.7.0
torchvision >=0.8.1
tqdm *

3_Segmentation_Model/requirements.txt pypi

filelock *
opencv-python *
prettytable *
pyyaml >=5.1
scipy *
sklearn ==0.0
tqdm *
visualdl >=2.2.0