aaai25_sgdn
[AAAI 25] Guided Real Image Dehazing using YCbCr Color Space
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.7%) to scientific vocabulary
Repository
[AAAI 25] Guided Real Image Dehazing using YCbCr Color Space
Basic Info
Statistics
- Stars: 38
- Watchers: 2
- Forks: 1
- Open Issues: 2
- Releases: 0
Metadata Files
README.md
Guided Real Image Dehazing using YCbCr Color Space (AAAI'2025)
Wenxuan Fang1 Junkai Fan1 Yu Zheng1 Jiangwei Weng1 Ying Tai2 Jun Li1 ✉️
1School of Computer Science and Engineering, Nanjing University of Science and Technology (NanJing)
2Intelligence science and technology, Nanjing University (SuZhou)
The Association for the Advancement of Artificial Intelligence (AAAI), February 25 – March 4, 2025 | Philadelphia, Pennsylvania, USA
Abstract
Image dehazing, particularly with learning-based methods, has gained significant attention due to its importance in real-world applications. However, relying solely on the RGB color space often fall short, frequently leaving residual haze. This arises from two main issues: the difficulty in obtaining clear textural features from hazy RGB images and the complexity of acquiring real haze/clean image pairs outside controlled environments like smoke-filled scenes. To address these issues, we first propose a novel Structure Guided Dehazing Network (SGDN) that leverages the superior structural properties of YCbCr features over RGB. SGDN comprises two key modules: Bi-Color Guidance Bridge (BGB) and Color Enhancement Module (CEM). BGB integrates a phase integration module and an interactive attention module, utilizing the rich texture features of the YCbCr space to guide the RGB space, thereby recovering clearer features in both frequency and spatial domains. To maintain tonal consistency, CEM further enhances the color perception of RGB features by aggregating YCbCr channel information. Furthermore, for effective supervised learning, we introduce the Real-World Well-Aligned Haze (RW$^2$AH) dataset, which includes a diverse range of scenes from various geographical regions and climate conditions. Experimental results demonstrate that our method surpasses existing state-of-the-art methods across multiple real-world smoke/haze datasets.
Overview
The overall pipeline of our SGDN. It includes the proposed Bi-Color Guidance Bridge (BGB) and Color Enhancement Module (CEM). BGB promotes RGB features to produce clearer textures through YCbCr color space in both frequency and spatial domain, while CEM significantly enhances the visual contrast of the images.
Real-World Well-Aligned Haze Dataset
The R2WD dataset can be downloaded from following link: Baidu Drive (sgdn)
To enable effective supervised learning, we collect a real-world haze dataset featuring multiple scenes and varying haze concentrations, named the Real-World Well-Aligned Haze (RW$^2$AH) dataset, with a total of 1758 image pairs. The RW$^2$AH dataset primarily records haze/clean images captured by stationary webcams from YouTube, with scenes including landscapes, vegetation, buildings and mountains.
Visual Comparisons
Real-world Smoke/Haze (click to expand)
Results
Real-world Smoke/Haze (click to expand)
Installation
:satisfied: Our SGDN is built in Pytorch=2.0.1, we train and test it on Ubuntu=20.04 environment (Python=3.8+, Cuda=11.6).
First, for tool initialization, please refer to BasicSR
Secondly, please follow these instructions:
conda create -n py38 python=3.8.16
conda activate py38
pip3 install torch torchvision torchaudio
pip3 install -r requirements.txt
Training and Test
Training our R2WD
CUDA_VISIBLE_DEVICES=xxxx python basicsr/train.py -opt options/train/SGDN/train_SGDN.yml --auto_resume
Training our R2WD
CUDA_VISIBLE_DEVICES=XXX python basicsr/test.py -opt options/test/SGDN/test_SGDN.yml --auto_resume
🎓 Citation
If you find the code helpful in your research or work, please cite the following paper(s).
bibtex
@inproceedings{fang2025guided,
title={Guided Real Image Dehazing using YCbCr Color Space},
author={Fang, Wenxuan and Fan, JunKai and Zheng, Yu and Weng, Jiangwei and Tai, Ying and Li, Jun},
booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
pages={xxxxx--xxxxx},
year={2025}
}
Contact
If you have any questions, please contact the email wenxuan_fang@njust.edu.cn
Acknowledgment: This code is based on the BasicSR toolbox.
Owner
- Name: wxFang
- Login: fiwy0527
- Kind: user
- Repositories: 1
- Profile: https://github.com/fiwy0527
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this project, please cite it as below."
title: "BasicSR: Open Source Image and Video Restoration Toolbox"
version: 1.3.5
date-released: 2022-02-16
url: "https://github.com/XPixelGroup/BasicSR"
license: Apache-2.0
authors:
- family-names: Wang
given-names: Xintao
- family-names: Xie
given-names: Liangbin
- family-names: Yu
given-names: Ke
- family-names: Chan
given-names: Kelvin C.K.
- family-names: Loy
given-names: Chen Change
- family-names: Dong
given-names: Chao
GitHub Events
Total
- Issues event: 5
- Watch event: 44
- Issue comment event: 11
- Push event: 8
- Fork event: 2
Last Year
- Issues event: 5
- Watch event: 44
- Issue comment event: 11
- Push event: 8
- Fork event: 2
Dependencies
- Pillow *
- addict *
- future *
- lmdb *
- numpy >=1.17
- opencv-python *
- pyyaml *
- requests *
- scikit-image *
- scipy *
- tb-nightly *
- torch >=1.7
- torchvision *
- tqdm *
- yapf *