multiresolution-htc

Multiresolution Learning-based Hybrid Transformer-CNN Model for Anatomical Landmark Detection

https://github.com/seriee/multiresolution-htc

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 4 DOI reference(s) in README
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (9.8%) to scientific vocabulary

Last synced: 9 months ago · JSON representation ·

Repository

Multiresolution Learning-based Hybrid Transformer-CNN Model for Anatomical Landmark Detection

Basic Info

Host: GitHub
Owner: seriee
Language: Python
Default Branch: main
Size: 10.6 MB

Statistics

Stars: 10
Watchers: 1
Forks: 1
Open Issues: 4
Releases: 0

Created about 3 years ago · Last pushed over 2 years ago

Metadata Files

Readme Contributing Code of conduct Citation

Anatomical Landmark Detection Using a Multiresolution Learning Approach with a Hybrid Transformer-CNN Model (MICCAI 2023)

This is the official pytorch implementation repository of our Anatomical Landmark Detection Using a Multiresolution Learning Approach with a Hybrid Transformer-CNN Model framework - Link to paper: https://doi.org/10.1007/978-3-031-43987-2_42 - Link to github: https://github.com/seriee/Multiresolution-HTC.git

Abstract

Accurate localization of anatomical landmarks has a critical role in clinical diagnosis, treatment planning, and research. Most existing deep learning methods for anatomical landmark localization rely on heatmap regression-based learning, which generates label representations as 2D Gaussian distributions centered at the labeled coordinates of each of the landmarks and integrates them into a single spatial resolution heatmap. However, the accuracy of this method is limited by the resolution of the heatmap, which restricts its ability to capture finer details. In this study, we introduce a multiresolution heatmap learning strategy that enables the network to capture semantic feature representations precisely using multiresolution heatmaps generated from the feature representations at each resolution independently, resulting in improved localization accuracy. Moreover, we propose a novel network architecture called hybrid transformer-CNN (HTC), which combines the strengths of both CNN and vision transformer models to improve the network's ability to effectively extract both local and global representations. Extensive experiments demonstrated that our approach outperforms state-of-the-art deep learning-based anatomical landmark localization networks on the numerical XCAT 2D projection images and two public X-ray landmark detection benchmark datasets.

Hybrid Transformer-CNN (HTC) with Multiresolution Learning

Hybrid Transformer-CNN (HTC) Architecture

Dataset

We have used the following datasets:
- 4D XCAT Head CBCT dataset: Segars, W.P., Sturgeon, G., Mendonca, S., Grimes, J., Tsui, B.M.: 4d xcat phantom for multimodality imaging research. Medical physics 37(9), 4902–4915 (2010)
- ISBI2023 challenge dataset: Anwaar Khalid, M., Zulfiqar, K., Bashir, U., Shaheen, A., Iqbal, R., Rizwan, Z., Rizwan, G., Moazam Fraz, M.: Cepha29: Automatic cephalometric landmark detection challenge 2023. arXiv e-prints pp. arXiv–2212 (2022)
- Hand X-ray dataset: Payer, C., ˇStern, D., Bischof, H., Urschler, M.: Integrating spatial configuration into heatmap regression based CNNs for landmark localization. Medical Image Analysis 54, 207–219 (2019)

Results

Prerequesites

Python 3.7
MMpose 0.23

Usage of the code

Dataset preparation
- The dataset structure should be in the following structure:

Inputs: .PNG images and JSON file └── <dataset name> ├── 2D_images | ├── 001.png │ ├── 002.png │ ├── 003.png │ ├── ... | └── JSON ├── train.json └── test.json - Example json format { "images": [ { "id": 0, "file_name": "0.png", "height": 420, "width": 620 }, ... ], "annotations": [ { "image_id": 0, "id": 0, "category_id": 1, "keypoints": [ 604.5070198755171, 289.1590783982888, 2, 592.8121081534473, 261.62600827462876, 2, 428.0154934462112, 301.24809471563935, 2, 604.9223114040644, 234.45993184950234, 2, 570.296873380625, 182.90429052972533, 2, 456.97751121306436, 208.8105499707776, 2, 369.95414168150415, 239.07609878665616, 2, 307.83364934785106, 229.91052362204155, 2, 373.5995213621739, 353.599939601835, 2, 499.50552505239256, 453.1111418891231, 2, 493.50543334239256, 456.12341418891231, 2 ], "num_keypoints": 11, "iscrowd": 0 }, ... ]

Output: 2D landmark coordinates
- Train the model
To train our HTC model with a multiresolution learning approach, run sh train.sh: ```

sh train.sh

CUDAVISIBLEDEVICES=gpuids PORT=PORTNUM ./tools/disttrain.sh \ configfilepath numgpus ```
- Evaluation
To evaluate the trained HTC model, run sh test.sh: ```

sh test.sh

CUDAVISIBLEDEVICES=gpuid PORT=29504 ./tools/disttest.sh configfilepath \ modelweightpath numgpus \ # For evaluation of the Head XCAT dataset, use: --eval 'MREh','MREstdh','SDR2h','SDR2.5h','SDR3h','SDR4h' # For evaluation of ISBI2023 and Hand X-ray dataset, use: # --eval 'MREi2','MREstdi2','SDR2i2','SDR2.5i2','SDR3i2','SDR4_i2' ```

Author

Thanaporn Viriyasaranon, Serie Ma, and Jang-Hwan Choi

Citation

Viriyasaranon, T., Ma, S., Choi, JH. (2023). Anatomical Landmark Detection Using a Multiresolution Learning Approach with a Hybrid Transformer-CNN Model. In: Greenspan, H., et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2023. MICCAI 2023. Lecture Notes in Computer Science, vol 14225. Springer, Cham. https://doi.org/10.1007/978-3-031-43987-2_42

Owner

Login: seriee
Kind: user

Repositories: 1
Profile: https://github.com/seriee

Citation (CITATION.cff)

message: "(Citaion will be upated) If you use this code, please cite it as below."
authors:
  - name: "Thanaporn Viriyasaranon, Serie Ma, and Jang-Hwan Choi"
title: "Anatomical Landmark Detection Using a Multiresolution Learning Approach with a Hybrid Transformer-CNN Model"
date-released: 2020-05-30
url: "https://github.com/seriee/Multiresolution-Learning-based-Hybrid-Transformer-CNN-Model-for-Anatomical-Landmark-Detection"

GitHub Events

Total

Watch event: 3

Last Year

Watch event: 3

Dependencies

.github/workflows/build.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite
codecov/codecov-action v2 composite

.github/workflows/deploy.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite

.github/workflows/lint.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite

docker/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

docker/serve/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

requirements/build.txt pypi

numpy *
torch >=1.3

requirements/docs.txt pypi

docutils ==0.16.0
myst-parser *
sphinx ==4.0.2
sphinx_copybutton *
sphinx_markdown_tables *

requirements/mminstall.txt pypi

mmcv-full >=1.3.8
mmdet >=2.14.0
mmtrack >=0.6.0

requirements/optional.txt pypi

albumentations >=0.3.2
onnx *
onnxruntime *
pyrender *
requests *
smplx >=0.1.28
trimesh *

requirements/readthedocs.txt pypi

mmcv-full *
munkres *
regex *
scipy *
titlecase *
torch *
torchvision *
xtcocotools >=1.8

requirements/runtime.txt pypi

chumpy *
dataclasses *
json_tricks *
matplotlib *
munkres *
numpy *
opencv-python *
pillow *
scipy *
torchvision *
xtcocotools >=1.8

requirements/tests.txt pypi

coverage * test
flake8 * test
interrogate * test
isort ==4.3.21 test
pytest * test
pytest-runner * test
smplx >=0.1.28 test
xdoctest >=0.10.0 test
yapf * test

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science