seg-lstm

https://github.com/zhuqinfeng1999/seg-lstm

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (9.6%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: zhuqinfeng1999
License: apache-2.0
Language: Python
Default Branch: main
Size: 5.13 MB

Statistics

Stars: 14
Watchers: 1
Forks: 2
Open Issues: 2
Releases: 0

Created about 2 years ago · Last pushed almost 2 years ago

Metadata Files

Readme Contributing License Code of conduct Citation

Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images

This is the official code repository for "Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images". {Arxiv Paper}

Abstract

Recent advancements in autoregressive networks with linear complexity have driven significant research progress, demonstrating exceptional performance in large language models. A representative model is the Extended Long Short-Term Memory (xLSTM), which incorporates gating mechanisms and memory structures, performing comparably to Transformer architectures in long-sequence language tasks. Autoregressive networks such as xLSTM can utilize image serialization to extend their application to visual tasks such as classification and segmentation. Although existing studies have demonstrated Vision-LSTM’s impressive results in image classification, its performance in image semantic segmentation remains unverified. Our study represents the first attempt to evaluate the effectiveness of Vision-LSTM in the semantic segmentation of remotely sensed images. This evaluation is based on a specifically designed encoder-decoder architecture named Seg-LSTM, and comparisons with state-of-the-art segmentation networks. Our study found that Vision-LSTM's performance in semantic segmentation was limited and generally inferior to Vision-Transformers-based and Vision-Mamba-based models in most comparative tests. Future research directions for enhancing Vision-LSTM are recommended. The source code is available from https://github.com/zhuqinfeng1999/Seg-LSTM.

Installation

Requirements

Requirements: Ubuntu 20.04, CUDA 12.4

Set up the mmsegmentation environment; we conduct experiments using the mmsegmentation framework. Please refer to https://github.com/open-mmlab/mmsegmentation.

LoveDA datasets

The LoveDA datasets can be found here https://github.com/Junjue-Wang/LoveDA.
After downloading the dataset, you are supposed to put them into '/mmsegmentation/data/loveDA/'
'/mmsegmentation/data/loveDA/'
ann_dir
- train
- .png
- val
- .png
img_dir
- train
- .png
- val
- .png

Model file and config file

The model file zqf_seglstm.py can be found in /mmsegmentation/mmseg/models/backbones/
The config file zqfseglstm'decoder'.py for the combination of backbone and decoder head can be found in /mmsegmentation/configs/base/models
The config file for training can be found in /mmsegmentation/configs/zqf_seglstm/

Training Seg-LSTM

bash tools/dist_train.sh 'configfile' 2 --work-dir /mmsegmentation/output/seglstm

Testing Seg-LSTM

bash tools/dist_test.sh 'configfile' \ /mmsegmentation/output/seglstm/iter_15000.pth 2 --out /mmsegmentation/visout/seglstm

Citation

If you find this work useful in your research, please consider cite:

``` @article{ zhu2024seglstm, title={Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images}, author={Qinfeng Zhu and Yuanzhi Cai and Lei Fan}, year={2024}, eprint={2406.14086}, archivePrefix={arXiv}, primaryClass={cs.CV} }

```

License

This project is released under the Apache 2.0 license.

Acknowledgement

We acknowledge all the authors of the employed public datasets, allowing the community to use these valuable resources for research purposes. We also thank the authors of xLSTM, Vision-LSTM, MMSegmentation for making their valuable code publicly available.

Owner

Name: Qinfeng Zhu
Login: zhuqinfeng1999
Kind: user
Location: Suzhou, Jiangsu, China
Company: University of Liverpool

Website: https://zhuqinfeng1999.github.io/
Repositories: 4
Profile: https://github.com/zhuqinfeng1999

I am a Ph.D. student at Xi'an Jiaotong-Liverpool University and University of Liverpool.

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - name: "MMSegmentation Contributors"
title: "OpenMMLab Semantic Segmentation Toolbox and Benchmark"
date-released: 2020-07-10
url: "https://github.com/open-mmlab/mmsegmentation"
license: Apache-2.0

GitHub Events

Total

Watch event: 3

Last Year

Watch event: 3

Dependencies

.github/workflows/deploy.yml actions

actions/checkout v3 composite
actions/setup-python v4 composite

.circleci/docker/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

docker/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

docker/serve/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

requirements/albu.txt pypi

albumentations >=0.3.2

requirements/docs.txt pypi

docutils ==0.16.0
myst-parser *
sphinx ==4.0.2
sphinx_copybutton *
sphinx_markdown_tables *
urllib3 <2.0.0

requirements/mminstall.txt pypi

mmcv >=2.0.0rc4,<2.2.0
mmengine >=0.5.0,<1.0.0

requirements/multimodal.txt pypi

ftfy *
regex *

requirements/optional.txt pypi

cityscapesscripts *
diffusers *
einops ==0.3.0
imageio ==2.9.0
imageio-ffmpeg ==0.4.2
invisible-watermark *
kornia ==0.6
nibabel *
omegaconf ==2.1.1
pudb ==2019.2
pytorch-lightning ==1.4.2
streamlit >=0.73.1
test-tube >=0.7.5
timm *
torch-fidelity ==0.3.0
torchmetrics ==0.6.0
transformers ==4.19.2

requirements/readthedocs.txt pypi

mmcv >=2.0.0rc1,<2.1.0
mmengine >=0.4.0,<1.0.0
prettytable *
scipy *
torch *
torchvision *

requirements/runtime.txt pypi

matplotlib *
numpy *
packaging *
prettytable *
scipy *

requirements/tests.txt pypi

codecov * test
flake8 * test
ftfy * test
interrogate * test
pytest * test
regex * test
xdoctest >=0.10.0 test
yapf * test

requirements.txt pypi

setup.py pypi

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science