cremmawiki-anonymizer

Image or Metadata anonymizer for the CREMMAWIKI corpus

https://github.com/alix-tz/cremmawiki-anonymizer

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.1%) to scientific vocabulary

Keywords

anonymization cremma cremmawiki htr-united opencv-python
Last synced: 4 months ago · JSON representation ·

Repository

Image or Metadata anonymizer for the CREMMAWIKI corpus

Basic Info
  • Host: GitHub
  • Owner: alix-tz
  • License: unlicense
  • Language: Python
  • Default Branch: master
  • Homepage:
  • Size: 13 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 1
  • Releases: 1
Topics
anonymization cremma cremmawiki htr-united opencv-python
Created over 3 years ago · Last pushed over 3 years ago
Metadata Files
Readme License Citation

README.md

CREMMAWIKI_Anonymizer (CWA)

CWA is an anonymizer for the CREMMAWIKI corpus.

Set up

Create a virtual environment and use PiPy to install the dependencies (ex: pip install -r requirements.txt)

Run

run_cia.py and run_mi6.py show examples of how to use the CWA module, repectively for CremmawikiImageAnonymizer (CIA) and MetadataInSafeExposure (MI6).

You can clone this repository, replace the images in data with yours, set up an environment and execute run_cia.py.

If you want to change the path the source images, you can also change the value of: - path_to_data in run_cia.py - or path_to_conversion_table, path_to_metadata and/or path_to_new_metadata in run_mi6.py

The new files (image or csv) will be created within the source folder. They will be named such as my_source_img.png becomes my_source_img.out.png, or my_source.csv becomes public_my_source.csv. Which means that CWA will not overwrite your original images, except for the name conversion table (in MI6).

CIA example

| source | output | | :----: | :----: | | image | image |

MI6 example

Before

  • metadata.csv (secret):

| col1 | col2 | writer_name | col3 | | :--: | :--: | :---------: | :--: | | bla | bli | James Bond | blo | | blu | bly | Q | ble |

After

  • public_metadata.csv:

| col1 | col2 | writer_name | col3 | | :--: | :--: | :---------: | :--: | | bla | bli | AW0007 | blo | | blu | bly | AW0001 | ble |

  • conversion_table.csv (secret):

txt James Bond,AW0007 Q,AW0001

Owner

  • Name: Alix Chagué
  • Login: alix-tz
  • Kind: user
  • Company: Inria

PhD student in Digital Humanities @ Université de Montréal and Inria.

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Chagué"
  given-names: "Alix"
  orcid: "https://orcid.org/0000-0002-0136-4434"
title: "CREMMAWIKI_Anonymizer"
version: 0.1.0
date-released: 2022-03-31
url: "https://github.com/alix-tz/CREMMAWIKI_Anonymizer"

GitHub Events

Total
Last Year