https://github.com/cdli-gh/data

This is a copy of the daily dump of catalogue and ATF data from the Cuneiform Digital Library Initiative (http://cdli.ucla.edu)

https://github.com/cdli-gh/data

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    1 of 6 committers (16.7%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.0%) to scientific vocabulary

Keywords

atf catalogue cuneiform metadata
Last synced: 5 months ago · JSON representation

Repository

This is a copy of the daily dump of catalogue and ATF data from the Cuneiform Digital Library Initiative (http://cdli.ucla.edu)

Basic Info
Statistics
  • Stars: 56
  • Watchers: 7
  • Forks: 14
  • Open Issues: 7
  • Releases: 8
Topics
atf catalogue cuneiform metadata
Created about 9 years ago · Last pushed over 2 years ago
Metadata Files
Readme

README.md

CDLI Daily Bulk Data Dump

Last update was August 2022.

The repository contains a daily dump of all public catalogue and text data from the Cuneiform Digital Library Initiative.

Getting the data

Make sure you have the Git Large File Storage extentions (git-lfs) installed, see here for instructions. For installing under, say, Ubuntu, you can also use

$> curl -s https://packagecloud.io/install/repositories/github/git-lfs/script.deb.sh | sudo bash
$> sudo apt-get install git-lfs

Clone the repository

$> git clone https://github.com/cdli-gh/data

Retrieve Git LSF data:

$> cd data
$> git lfs fetch

Format

Text Data

The CDLI transliterations dump is offered in plain text UTF-8 ATF format. For more information about ATF, visit :

  http://oracc.museum.upenn.edu/doc/help/editinginatf/cdliatf/index.html (Scroll down for an example).

Catalogue data

The catalogue is offered in a UTF-8 comma separated format. Most fields are thoroughly explained here:

 https://cdli.ucla.edu/?q=cdli-search-information  

Our data schema is currently being remodeled, get in touch if you would like a sneak peak!

To view a sample of the catalogue, you can use the head command on a Unix machine using this syntax, while you are in the directory where the file is stored: head cdli_catalogue_1of2.csv With Windows Power Shell, try Get-Content *filename* -Head *n*

EPP cdli@ucla.edu

Owner

  • Name: CDLI
  • Login: cdli-gh
  • Kind: organization
  • Email: cdli@orinst.ox.ac.uk
  • Location: Los Angeles, Oxford, Berlin

GitHub Events

Total
  • Watch event: 3
  • Fork event: 1
Last Year
  • Watch event: 3
  • Fork event: 1

Committers

Last synced: 10 months ago

All Time
  • Total Commits: 2,222
  • Total Committers: 6
  • Avg Commits per committer: 370.333
  • Development Distribution Score (DDS): 0.005
Past Year
  • Commits: 1
  • Committers: 1
  • Avg Commits per committer: 1.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
aashithk e****n@g****m 2,211
Christian Chiarcos c****s@w****e 3
Émilie Pagé-Perron e****p@i****t 3
Lars Willighagen l****n@g****m 2
aashithk a****k@c****u 2
Gaurav Shukla g****8@g****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 63
  • Total pull requests: 4
  • Average time to close issues: about 2 months
  • Average time to close pull requests: about 8 hours
  • Total issue authors: 10
  • Total pull request authors: 4
  • Average comments per issue: 0.97
  • Average comments per pull request: 0.0
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • rillian (49)
  • ACL90 (6)
  • morrisalp (1)
  • hohilwik (1)
  • soumyadip007 (1)
  • MrLogarithm (1)
  • epageperron (1)
  • kesinger (1)
  • shubhamdotjain (1)
  • larsgw (1)
Pull Request Authors
  • withgaurav (1)
  • larsgw (1)
  • Maherukh (1)
  • Lord-of-Codes (1)
Top Labels
Issue Labels
atf syntax (42) csv syntax (6)
Pull Request Labels

Dependencies

.github/workflows/release.yml actions
  • actions/checkout v2 composite
  • actions/create-release v1 composite