https://github.com/google-deepmind/cube

Code for CUBE extraction and cultural diversity metric introduced in "Beyond Aesthetics: Cultural Competence in Text-to-Image Models"

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
○
.zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Committers with academic emails
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (11.8%) to scientific vocabulary

Last synced: 8 months ago · JSON representation

Repository

Code for CUBE extraction and cultural diversity metric introduced in "Beyond Aesthetics: Cultural Competence in Text-to-Image Models"

Basic Info

Host: GitHub
Owner: google-deepmind
License: apache-2.0
Language: Python
Default Branch: main
Homepage:
Size: 6.24 MB

Statistics

Stars: 4
Watchers: 5
Forks: 0
Open Issues: 0
Releases: 0

Created over 1 year ago · Last pushed over 1 year ago

Metadata Files

Readme Contributing License

Code for Beyond Aesthetics: Cultural Competence in Text-to-Image Models published in NeurIPS 2024 (Track on Datasets and Benchmarks)

Current T2I benchmarks primarily focus on faithfulness, aesthetics, and realism of generated images, overlooking the critical dimension of cultural competence. In this work, we introduce a framework to evaluate cultural competence of T2I models along two crucial dimensions: cultural awareness and cultural diversity, and present a scalable approach using a combination of structured knowledge base (KB) and large language models (LLMs) to build a large dataset of cultural artifacts to enable this evaluation. In particular, we apply this approach to build CUBE (CUltural BEnchmark for Text-to-Image models), a first-of-its-kind benchmark to evaluate cultural competence of T2I models. CUBE covers cultural artifacts associated with 8 countries across different geo-cultural regions and along 3 concepts: cuisine, landmarks, and art. We also introduce cultural diversity as a novel T2I evaluation component, leveraging quality-weighted Vendi score. In this repository, we release the 1) code for CUBE extraction from WikiData, 2) a notebook with the implementation of the cultural diversity evaluation aspect for T2I and 3) CUBE dataset containing CUBE CSpace and CUBE-1K prompts.

CUBE Extraction

Setup

Fetching latest KB dump (redoing this will refresh latest Wikidata dump)

python3.8 -m venv wikidata source wikidata/bin/activate pip3 install https://ringgaard.com/data/dist/sling-3.0.0-py3-none-linux_x86_64.whl pip install urllib3 pip install absl-py pip install pandas pip install requests sling fetch --dataset kb,mapping

Usage

Move to CUBE directory

cd cube_extraction

Partition KB into smaller parts for easy multiprocessing (this step needs to be done only once)

python3 partition_kb.py

Bash execute permissions

chmod +x run_kb_extraction.sh

Extract cultural artifacts

``` sh runkbextraction.sh

Example usage with arguments:

sh runkbextraction.sh

Enter directory of KB partitions: /path/to/kb_nodes/ (Local cloudtop path)

Enter concept (e.g., cuisine, landmarks, art): cuisine

Enter number of hops: 3

Enter desired output directory: /path/to/output/ (Local cloudtop path)

Enter desired output file name: cuisine_artifacts.json

```

Cultural Diversity

Setup

conda create -n cd python=3.10 && conda activate cd pip install -r requirements.txt

Usage

Refer to the culturaldiversity.ipynb [](https://colab.research.google.com/github/google-deepmind/cube/blob/master/culturaldiversity.ipynb) for a sample usage of the evaluation metric.

Dataset

CUBE dataset contains cultural artifacts across 8 countries (Brazil, France, India, Italy, Japan, Nigeria, Turkey, and USA) and 3 domains (cuisine, landmarks, art).

The dataset is divided into two parts: 1. CUBE-CSpace: a collection of 300k cultural artifacts, intended to be used as grounding for diversity evaluation. For example, * Japanese Cuisine: Ramen, Soba, Sushi, Katsu sandwich * France Landmarks: Eiffel Tower, Mont Saint-Michel, Palace of Versailles * Indian Art/Clothing: Kurta, Lehanga Choli, Dhoti, Patola Saree

CUBE-1K: a curated set of 1000 prompts constructed from cultural artifacts in CUBE-CSpace, that enable evaluation of cultural awareness. For example,
Cuisine: A high resolution image of sushi from Japanese cuisine.
Landmarks: A panoramic view of Eiffel Tower in France.
Art: Clothing Image of a person in kurta from India.

Citation

If you find the code or datset useful, please cite our paper:

@misc{kannen2024aestheticsculturalcompetencetexttoimage, title={Beyond Aesthetics: Cultural Competence in Text-to-Image Models}, author={Nithish Kannen and Arif Ahmad and Marco Andreetto and Vinodkumar Prabhakaran and Utsav Prabhu and Adji Bousso Dieng and Pushpak Bhattacharyya and Shachi Dave}, year={2024}, eprint={2407.06863}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2407.06863}, }

License

Software

All software is licensed under the Apache License, Version 2.0 (Apache 2.0); you may not use this file except in compliance with the Apache 2.0 license. You may obtain a copy of the Apache 2.0 license at: https://www.apache.org/licenses/LICENSE-2.0

Dataset

We distribute CUBE dataset under CC BY-NC-SA 4.0 license

All other materials are licensed under the Creative Commons Attribution 4.0 International License (CC-BY). You may obtain a copy of the CC-BY license at: https://creativecommons.org/licenses/by/4.0/legalcode

Unless required by applicable law or agreed to in writing, all software and materials distributed here under the Apache 2.0 or CC-BY licenses are distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the licenses for the specific language governing permissions and limitations under those licenses.

This is not an official Google product.

Contact

Please direct your questions to nitkan@google.com / shachi@google.com

Owner

Name: Google DeepMind
Login: google-deepmind
Kind: organization

Website: https://www.deepmind.com/
Repositories: 245
Profile: https://github.com/google-deepmind

GitHub Events

Total

Watch event: 4
Member event: 1
Push event: 9
Create event: 2

Last Year

Watch event: 4
Member event: 1
Push event: 9
Create event: 2

Committers

Last synced: about 1 year ago

All Time

Total Commits: 13
Total Committers: 4
Avg Commits per committer: 3.25
Development Distribution Score (DDS): 0.308

Past Year

Commits: 12
Committers: 3
Avg Commits per committer: 4.0
Development Distribution Score (DDS): 0.25

Top Committers

Name	Email	Commits
Nithish Kannnen	6****n	9
DeepMind	n**y@g**m	2
Nithish Kannen	n**n@g**m	1
DeepMind	n**y@d**m	1

Committer Domains (Top 20 + Academic)

google.com: 2 deepmind.com: 1

Issues and Pull Requests

Last synced: about 1 year ago

All Time

Total issues: 0
Total pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Total issue authors: 0
Total pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Dependencies

requirements.txt pypi

diffusers *
google-cloud-aiplatform *
google-generativeai *
torch *
transformers *
unittest2 *
vendi_score *

https://github.com/google-deepmind/cube

Science Score: 23.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Code for Beyond Aesthetics: Cultural Competence in Text-to-Image Models published in NeurIPS 2024 (Track on Datasets and Benchmarks)

CUBE Extraction

Setup

Fetching latest KB dump (redoing this will refresh latest Wikidata dump)

Usage

Extract cultural artifacts

Example usage with arguments:

sh runkbextraction.sh

Enter directory of KB partitions: /path/to/kb_nodes/ (Local cloudtop path)

Enter concept (e.g., cuisine, landmarks, art): cuisine

Enter number of hops: 3

Enter desired output directory: /path/to/output/ (Local cloudtop path)

Enter desired output file name: cuisine_artifacts.json

Cultural Diversity

Setup

Usage

Dataset

Citation

License

Software

Dataset

Contact

Owner

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Dependencies