https://github.com/cyberagentailab/webcolor

Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.

https://github.com/cyberagentailab/webcolor

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
    Links to: arxiv.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.7%) to scientific vocabulary

Keywords

colorization dataset deep-learning generative-ai graphical-user-interface pretrained-models pytorch pytorch-lightning web-design
Last synced: 7 months ago · JSON representation

Repository

Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.

Basic Info
Statistics
  • Stars: 17
  • Watchers: 5
  • Forks: 2
  • Open Issues: 1
  • Releases: 0
Topics
colorization dataset deep-learning generative-ai graphical-user-interface pretrained-models pytorch pytorch-lightning web-design
Created over 3 years ago · Last pushed over 2 years ago
Metadata Files
Readme License

README.md

Generative Colorization of Structured Mobile Web Pages

Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.

ArXiv | Dataset | Pre-trained models

Setup

Development environment

  • Ubuntu 22.04, Python 3.10.9, Poetry 1.2.2
  • CUDA 11.6, cuDNN 8.7.0
  • PyTorch 1.12.1, PyTorch Lightning 1.8.6, Deep Graph Library 0.9.1
  • (For taking screenshots) Google Chrome 108.0.5359.124, ChromeDriver 108.0.5359.71
  • (For computing contrast violations) Lighthouse 9.6.8

Installation

bash git clone https://github.com/CyberAgentAILab/webcolor.git poetry install

Note that we cannot guarantee or support operation in other environments, such as Windows. If you wish to install PyTorch or DGL for other CUDA versions, please edit URLs in pyproject.toml. You can find the commands to install Chrome, ChromeDriver, and Lighthouse on Ubuntu here.

Data preparation

bash ./data/download.sh cache

For details on the dataset, please see this document.

Colorization demo

bash MODEL_NAME=CVAE # {CVAE,NAR,AR,Stats} BASE_URL=https://storage.googleapis.com/ailab-public/webcolor/checkpoints poetry run python demo.py --model $MODEL_NAME --ckpt_path ${BASE_URL}/${MODEL_NAME}.ckpt --upsampler_path ${BASE_URL}/Upsampler.ckpt --target random --out_path output/screenshot.png --num_save 3 --save_gt

The above command performs automatic colorization using pre-trained models and produces screenshots like the following.

|CVAE #1|CVAE #2|CVAE #3|Real| |:---:|:---:|:---:|:---:| |||||

Training

bash MODEL_NAME=CVAE # {CVAE,NAR,AR,Stats,Upsampler} poetry run python -m webcolor.main fit --model $MODEL_NAME --trainer.accelerator gpu --trainer.devices 1

Model hyperparameters can be listed with --model.help $MODEL_NAME.

Evaluation

```bash MODELNAME=CVAE # {CVAE,NAR,AR,Stats,Upsampler} CKPTPATH=https://storage.googleapis.com/ailab-public/webcolor/checkpoints/${MODEL_NAME}.ckpt # Evaluate the pre-trained model

CKPTPATH=lightninglogs/version_0/checkpoints/best.ckpt # Evaluate your own trained model

poetry run python -m webcolor.main test --model $MODELNAME --ckptpath $CKPTPATH --trainer.defaultroot_dir /tmp --trainer.accelerator gpu --trainer.devices 1 ```

The following command calculates Pixel-FCD and contrast violations and takes a long time to complete (about four hours with 24 workers in our environment).

```bash MODELNAME=CVAE # {CVAE,NAR,AR,Stats} CKPTPATH=https://storage.googleapis.com/ailab-public/webcolor/checkpoints/${MODEL_NAME}.ckpt

CKPTPATH=lightninglogs/version_0/checkpoints/best.ckpt

UPSAMPLER_PATH=https://storage.googleapis.com/ailab-public/webcolor/checkpoints/Upsampler.ckpt

UPSAMPLERPATH=lightninglogs/version_1/checkpoints/best.ckpt

poetry run python eval.py --numworkers 4 --model $MODELNAME --ckptpath $CKPTPATH --upsamplerpath $UPSAMPLERPATH ```

For details on the pre-trained models, please see this document.

Citation

bibtex @inproceedings{Kikuchi2023, title = {Generative Colorization of Structured Mobile Web Pages}, author = {Kotaro Kikuchi and Naoto Inoue and Mayu Otani and Edgar Simo-Serra and Kota Yamaguchi}, booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)}, year = {2023}, pages = {3639-3648}, doi = {10.1109/WACV56688.2023.00364} }

Licence

The code is licensed under Apache-2.0 and the dataset is licensed under CC BY-NC-SA 4.0.

Owner

  • Name: CyberAgent AI Lab
  • Login: CyberAgentAILab
  • Kind: organization
  • Location: Japan

GitHub Events

Total
  • Watch event: 1
Last Year
  • Watch event: 1

Dependencies

poetry.lock pypi
  • 115 dependencies
pyproject.toml pypi
  • cairosvg ^2.6.0
  • dgl-cu116 *
  • h5py ^3.7.0
  • python ^3.10
  • pytorch-lightning ^1.8.6
  • tensorboard ^2.11.0
  • torch *