https://github.com/cdli-gh/data
This is a copy of the daily dump of catalogue and ATF data from the Cuneiform Digital Library Initiative (http://cdli.ucla.edu)
Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
✓Committers with academic emails
1 of 6 committers (16.7%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.0%) to scientific vocabulary
Keywords
Repository
This is a copy of the daily dump of catalogue and ATF data from the Cuneiform Digital Library Initiative (http://cdli.ucla.edu)
Basic Info
- Host: GitHub
- Owner: cdli-gh
- Default Branch: master
- Homepage: http://cdli.ucla.edu/bulk_data
- Size: 5.76 GB
Statistics
- Stars: 56
- Watchers: 7
- Forks: 14
- Open Issues: 7
- Releases: 8
Topics
Metadata Files
README.md
CDLI Daily Bulk Data Dump
Last update was August 2022.
The repository contains a daily dump of all public catalogue and text data from the Cuneiform Digital Library Initiative.
Getting the data
Make sure you have the Git Large File Storage extentions (git-lfs) installed, see here for instructions. For installing under, say, Ubuntu, you can also use
$> curl -s https://packagecloud.io/install/repositories/github/git-lfs/script.deb.sh | sudo bash
$> sudo apt-get install git-lfs
Clone the repository
$> git clone https://github.com/cdli-gh/data
Retrieve Git LSF data:
$> cd data
$> git lfs fetch
Format
Text Data
The CDLI transliterations dump is offered in plain text UTF-8 ATF format. For more information about ATF, visit :
http://oracc.museum.upenn.edu/doc/help/editinginatf/cdliatf/index.html (Scroll down for an example).
Catalogue data
The catalogue is offered in a UTF-8 comma separated format. Most fields are thoroughly explained here:
https://cdli.ucla.edu/?q=cdli-search-information
Our data schema is currently being remodeled, get in touch if you would like a sneak peak!
To view a sample of the catalogue, you can use the head command on a Unix machine using this syntax, while you are in the directory where the file is stored:
head cdli_catalogue_1of2.csv
With Windows Power Shell, try
Get-Content *filename* -Head *n*
EPP cdli@ucla.edu
Owner
- Name: CDLI
- Login: cdli-gh
- Kind: organization
- Email: cdli@orinst.ox.ac.uk
- Location: Los Angeles, Oxford, Berlin
- Website: https://cdli.ucla.edu
- Repositories: 83
- Profile: https://github.com/cdli-gh
GitHub Events
Total
- Watch event: 3
- Fork event: 1
Last Year
- Watch event: 3
- Fork event: 1
Committers
Last synced: 10 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| aashithk | e****n@g****m | 2,211 |
| Christian Chiarcos | c****s@w****e | 3 |
| Émilie Pagé-Perron | e****p@i****t | 3 |
| Lars Willighagen | l****n@g****m | 2 |
| aashithk | a****k@c****u | 2 |
| Gaurav Shukla | g****8@g****m | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 8 months ago
All Time
- Total issues: 63
- Total pull requests: 4
- Average time to close issues: about 2 months
- Average time to close pull requests: about 8 hours
- Total issue authors: 10
- Total pull request authors: 4
- Average comments per issue: 0.97
- Average comments per pull request: 0.0
- Merged pull requests: 2
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- rillian (49)
- ACL90 (6)
- morrisalp (1)
- hohilwik (1)
- soumyadip007 (1)
- MrLogarithm (1)
- epageperron (1)
- kesinger (1)
- shubhamdotjain (1)
- larsgw (1)
Pull Request Authors
- withgaurav (1)
- larsgw (1)
- Maherukh (1)
- Lord-of-Codes (1)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- actions/checkout v2 composite
- actions/create-release v1 composite