Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.1%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: Tammytcl
- License: apache-2.0
- Language: Python
- Default Branch: main
- Size: 42.1 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
PSVT: Pyramid Shifted Window based Vision Transformer for Cardiac Image Segmentation
Abstract
Automating cardiac image analysis poses formidable challenges due to deformations during respiratory and cardiac cycles and the unique features of imaging modalities like CT and MRI. Transformer-based methods integrated with U-Net often miss essential local spatial details and incur high computational costs. To overcome these hurdles, we present the Pyramid Shifted-window-based Vision Transformer (PSVT), an innovative backbone for cardiac segmentation. Our model is meticulously designed to maintain a robust representation of both global and local features while operating on lower-dimensional inputs. By leveraging Continuous Position Bias in the Attention mechanism, we seamlessly integrate the attention module of Swin-Transformer-v2 with CNNs, yielding enhanced segmentation performance and computational efficiency. Additionally, we amplify receptive field understanding by incorporating Depth-Wise Convolution into the Feed-Forward module. Inspired by various adaptations of U-Net, we have restructured the Patch Merging and Patch Expanding modules, utilizing transposed convolution techniques to elegantly fuse multi-scale features into the classification head. Our experimental results compellingly demonstrate the superiority of PSVT, outperforming state-of-the-art cardiac segmentation models across three small-sample cardiac datasets: ACDC, MMWHS-CT, and LASC-2013.
We provide ckpts on huggingface https://huggingface.co/datasets/mytcl/pvst
Installation and Usage
- Install MMSegmentation following the official guide: https://mmsegmentation.readthedocs.io/zh-cn/latest/get_started.html, here we use the latest version of MMSegmentation and pytorch 2.1.2 with cuda12.1.
- Clone this repository and replace the configs and mmseg folder with the our folders.
- Train or evaluate the model with configs/unet/fcnpsvt256x25625k{dataset}.py following the official guide: https://mmsegmentation.readthedocs.io/zh-cn/latest/userguides/4traintest.html, where {dataset} can be acdc, mmwhsct, or lasc2013. If package is not installed, please install it with pip.
Owner
- Login: Tammytcl
- Kind: user
- Repositories: 1
- Profile: https://github.com/Tammytcl
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - name: "MMSegmentation Contributors" title: "OpenMMLab Semantic Segmentation Toolbox and Benchmark" date-released: 2020-07-10 url: "https://github.com/open-mmlab/mmsegmentation" license: Apache-2.0
GitHub Events
Total
- Push event: 6
Last Year
- Push event: 6
Dependencies
- Brotli ==1.0.9
- Jinja2 ==3.1.4
- Markdown ==3.7
- MarkupSafe ==2.1.3
- PySocks ==1.7.1
- PyYAML ==6.0.1
- Pygments ==2.18.0
- Werkzeug ==3.0.4
- absl-py ==2.1.0
- addict ==2.4.0
- aliyun-python-sdk-core ==2.15.2
- aliyun-python-sdk-kms ==2.16.5
- cachetools ==5.5.0
- certifi ==2024.8.30
- cffi ==1.17.1
- charset-normalizer ==3.3.2
- click ==8.1.7
- colorama ==0.4.6
- contourpy ==1.1.1
- crcmod ==1.7
- cryptography ==43.0.1
- cycler ==0.12.1
- einops ==0.8.0
- filelock ==3.14.0
- fonttools ==4.53.1
- fsspec ==2024.9.0
- ftfy ==6.2.3
- future ==1.0.0
- fvcore ==0.1.5.post20221221
- gmpy2 ==2.1.2
- google-auth ==2.34.0
- google-auth-oauthlib ==0.4.6
- grpcio ==1.66.1
- huggingface-hub ==0.24.7
- idna ==3.7
- importlib_metadata ==8.5.0
- importlib_resources ==6.4.5
- iopath ==0.1.10
- jmespath ==0.10.0
- kiwisolver ==1.4.7
- markdown-it-py ==3.0.0
- matplotlib ==3.7.5
- mdurl ==0.1.2
- mkl-fft ==1.3.8
- mkl-random ==1.2.4
- mkl-service ==2.4.0
- mmcv ==2.1.0
- mmengine ==0.10.4
- mmsegmentation ==1.2.2
- model-index ==0.1.11
- mpmath ==1.3.0
- networkx ==3.1
- numpy ==1.24.3
- oauthlib ==3.2.2
- opencv-python ==4.10.0.84
- opendatalab ==0.0.10
- openmim ==0.3.9
- openxlab ==0.1.1
- ordered-set ==4.1.0
- oss2 ==2.17.0
- packaging ==24.1
- pandas ==2.0.3
- pillow ==10.4.0
- pip ==24.2
- platformdirs ==4.3.2
- portalocker ==2.10.1
- prettytable ==3.11.0
- protobuf ==5.28.1
- pyasn1 ==0.6.1
- pyasn1_modules ==0.4.1
- pycparser ==2.22
- pycryptodome ==3.20.0
- pyparsing ==3.1.4
- python-dateutil ==2.9.0.post0
- pytz ==2023.4
- regex ==2024.9.11
- requests ==2.28.2
- requests-oauthlib ==2.0.0
- rich ==13.4.2
- rsa ==4.9
- safetensors ==0.4.5
- scipy ==1.10.1
- setuptools ==60.2.0
- six ==1.16.0
- sympy ==1.13.2
- tabulate ==0.9.0
- tensorboard ==2.12.0
- tensorboard-data-server ==0.7.2
- tensorboard-plugin-wit ==1.8.1
- tensorboardX ==2.6.2.2
- termcolor ==2.4.0
- timm ==1.0.9
- tomli ==2.0.1
- torch ==2.1.2
- torchaudio ==2.1.2
- torchvision ==0.16.2
- tqdm ==4.65.2
- triton ==2.1.0
- typing_extensions ==4.11.0
- tzdata ==2024.1
- urllib3 ==1.26.20
- wcwidth ==0.2.13
- wheel ==0.43.0
- yacs ==0.1.8
- yapf ==0.40.2
- zipp ==3.20.1
- actions/checkout v3 composite
- actions/setup-python v4 composite
- pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build
- pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build
- pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build
- albumentations >=0.3.2
- docutils ==0.16.0
- myst-parser *
- sphinx ==4.0.2
- sphinx_copybutton *
- sphinx_markdown_tables *
- urllib3 <2.0.0
- mmcv >=2.0.0rc4,<2.2.0
- mmengine >=0.5.0,<1.0.0
- ftfy *
- regex *
- cityscapesscripts *
- diffusers *
- einops ==0.3.0
- imageio ==2.9.0
- imageio-ffmpeg ==0.4.2
- invisible-watermark *
- kornia ==0.6
- nibabel *
- omegaconf ==2.1.1
- pudb ==2019.2
- pytorch-lightning ==1.4.2
- streamlit >=0.73.1
- test-tube >=0.7.5
- timm *
- torch-fidelity ==0.3.0
- torchmetrics ==0.6.0
- transformers ==4.19.2
- mmcv >=2.0.0rc1,<2.1.0
- mmengine >=0.4.0,<1.0.0
- prettytable *
- scipy *
- torch *
- torchvision *
- matplotlib *
- numpy *
- packaging *
- prettytable *
- scipy *
- codecov * test
- flake8 * test
- ftfy * test
- interrogate * test
- pytest * test
- regex * test
- xdoctest >=0.10.0 test
- yapf * test