https://github.com/cyberagentailab/tdc-typography-generation

This repository contains codes for https://arxiv.org/abs/2309.02099

https://github.com/cyberagentailab/tdc-typography-generation

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: arxiv.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.9%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

This repository contains codes for https://arxiv.org/abs/2309.02099

Basic Info
  • Host: GitHub
  • Owner: CyberAgentAILab
  • License: apache-2.0
  • Language: Python
  • Default Branch: main
  • Size: 16.5 MB
Statistics
  • Stars: 5
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 2 years ago · Last pushed over 1 year ago
Metadata Files
Readme License

README.md

Paper: Towards Diverse and Consistent Typography Generation

Wataru Shimoda1, Daichi Haraguchi2, Seiichi Uchida2, Kota Yamaguchi1
1CyberAgent.Inc, 2 Kyushu University
Accepted to WACV2024. [Publication] [Arxiv] [Project-page]

Introduction

This repository contains the codes for "Towards Diverse and Consistent Typography Generation".

Requirements

We check the reproducibility under the environment. - Ubuntu (>=20.04) - Python3 (>=3.8, <3.11)

Install

Clone this repository and navigate to the folder

sh git clone https://github.com/CyberAgentAILab/tdc-typography-generation.git cd tdc-typography-generation

We manage the dependencies of python libraries by pyproject.toml.
Please install the dependencies via pip or poetry using pyproject.toml.

If the version of setuptools is setuptools >=61.0.0, the following command installs the dependencies via pip: sh pip install . or
We recommend installing the dependencies using Poetry (see official docs): sh poetry install Note that we omit the head of commands poetry run in the after guidance for simplification.

Dataset

Our model is trained and tested on Crello dataset, and this dataset is open on Hugging Face.
We can download this dataset through the Hugging Face Dataset API, but this dataset does not contain high-resolution background images.

We provide background images via Google Drive (link, 3.6GB).
Please download the background images and locate them to data/generate_bg_png/.
sh tar zxvf generate_bg_png.tar.gz mv generate_bg_png data/ rm generate_bg_png.tar.gz

We also provide font files for rendering designs and computing appropriate text sizes via Google Drive (link, 43MB).
Please download the font files and locate them to data/font/.
sh tar zxvf font.tar.gz mv font data/ rm font.tar.gz

Usage

We prepare scripts for experiments as the following.

Preprocessing

We recommend adding features to the dataset in advance, it makes training and testing faster.
We provide a script for preprocessing: sh python -m typography_generation map_features --datadir data This script extends the dataset via map function.
The extended dataset is saved in data/map_featreus, and --use_extended_dataset option manages the use of the extended dataset.

Training

The following command trains a model, it takes a half day with the preprocessed dataset and a NVIDIA T4 machine.
We handle the detail of training via configuration files in data/config/*.yaml.
The basic configurations are in src/typography_generation/config/*.py.

sh python -m typography_generation train_evaluation \ --configname bart \ --jobdir ${OUTPUT_DIR} \ --datadir data \ --use_extended_dataset \ --gpu \ The outputs are in ${OUTPUT_DIR}.

Sampling

The following command samples typographic attributes.
This command requires --weight option, which is a path for loading weights of a trained model.
A weight file obtained by the avobe training command is in ${OUTPUT_DIR}/weight.pth.
Please assign a path of a weight file to ${WEIGHT_FILE}.
A pretrained-model reproduced by this repository is here.
sh python -m typography_generation structure_preserved_sample \ --configname bart \ --jobdir ${OUTPUT_DIR} \ --datadir data \ --weight=${WEIGHT_FILE} \ --use_extended_dataset \ --gpu \

Visualization

We provides notebooks for showing results.
- notebooks/score.ipnyb shows scores of the saved results in ${OUTPUT_DIR}.
- notebooks/vis.ipnyb shows generated graphic designs in ${OUTPUT_DIR}.

Reference

bibtex @misc{shimoda_2024_tdctg, author = {Shimoda, Wataru and Haraguchi, Daichi and Uchida, Seiichi and Yamaguchi, Kota}, title = {Towards Diverse and Consistent Typography Generation}, publisher = {Winter Conference on Applications of Computer Vision (WACV)}, year = {2024}, }

Contact

This repository is maintained by Wataru shimoda(wataru_shimoda[at]cyberagent.co.jp).

Owner

  • Name: CyberAgent AI Lab
  • Login: CyberAgentAILab
  • Kind: organization
  • Location: Japan

GitHub Events

Total
  • Issues event: 1
  • Watch event: 3
  • Issue comment event: 2
  • Push event: 3
Last Year
  • Issues event: 1
  • Watch event: 3
  • Issue comment event: 2
  • Push event: 3