https://github.com/cdli-gh/cdli-cts

Canonical Text Services export of the Cuneiform Digital Library Initiative corpus.

https://github.com/cdli-gh/cdli-cts

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    1 of 4 committers (25.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.7%) to scientific vocabulary

Keywords

akkadian capitains cuneiform python sumerian tei-xml

Keywords from Contributors

archival projection interactive generic sequences observability autograding hacking shellcodes modular
Last synced: 6 months ago · JSON representation

Repository

Canonical Text Services export of the Cuneiform Digital Library Initiative corpus.

Basic Info
  • Host: GitHub
  • Owner: cdli-gh
  • Language: Python
  • Default Branch: main
  • Homepage: https://cdli.ucla.edu/
  • Size: 2.14 MB
Statistics
  • Stars: 4
  • Watchers: 2
  • Forks: 2
  • Open Issues: 2
  • Releases: 0
Topics
akkadian capitains cuneiform python sumerian tei-xml
Created over 6 years ago · Last pushed over 2 years ago
Metadata Files
Readme

README.md

Canonical Cuneiform Texts

Currently experimental and incomplete!

This is a collection of cuneiform tablets in Canonical Text Services format, from the database of the Cuneiform Digital Library Initiative. This format is used by a number of viewing an analysis tools for learning and scholarship. It is hoped that this collection will make the texts more accessible.

Adding files

Cuneiform transcriptions are normally maintained in the ATF format. To add new texts or update the collection run the transcriptions through the atf2cts tool from the atf2tei package.

For example, to convert a atf file containing one or more tablet transcriptions and add it to the repository:

pip install pipenv # if necessar. git clone https://github.com/cdli-gh/atf2tei cd atf2tei pipenv install pipenv run python atf2cts.py /path/to/your/transcription.atf mv data/* ../data/

A simple git status should then show the added (or changed) files.

Updating

There is also a script in the update directory which reads the entire CDLI bulk data export and converts a subset of the records, based on a list of CDLI id numbers or particular catalogue field entries.

git clone --depth https://github.com/cdli-gh/data cdli-data pipenv install pyoracc PYTHONPATH=$PWD/atf2tei update/cdli2cts.py -d cdli-data -o .

The data repository is quite large. Passing the --depth option to git downloads only the most recent changes, reducing the size to several hundred MB.

Until atf2cts is properly packaged, it also needs to be checked out with git and the location passed through the PYTHONPATH environment variable. See the section above about added files for how to do this.

A docker container configuration is included which can be used to set up automatic updates. See update for details.

Owner

  • Name: CDLI
  • Login: cdli-gh
  • Kind: organization
  • Email: cdli@orinst.ox.ac.uk
  • Location: Los Angeles, Oxford, Berlin

GitHub Events

Total
Last Year

Committers

Last synced: 8 months ago

All Time
  • Total Commits: 39
  • Total Committers: 4
  • Avg Commits per committer: 9.75
  • Development Distribution Score (DDS): 0.282
Past Year
  • Commits: 2
  • Committers: 1
  • Avg Commits per committer: 2.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Ralph Giles g****s@t****t 28
dependabot[bot] 4****] 7
CDLI CTS Update n****y@c****u 3
Ralph Giles g****s@d****n 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 1
  • Total pull requests: 12
  • Average time to close issues: N/A
  • Average time to close pull requests: about 2 months
  • Total issue authors: 1
  • Total pull request authors: 2
  • Average comments per issue: 0.0
  • Average comments per pull request: 0.33
  • Merged pull requests: 8
  • Bot issues: 0
  • Bot pull requests: 11
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • rillian (1)
Pull Request Authors
  • dependabot[bot] (11)
  • rillian (1)
Top Labels
Issue Labels
Pull Request Labels
dependencies (11)

Dependencies

Pipfile pypi
  • hooktest >=1.2
Pipfile.lock pypi
  • ansicolors ==1.0.2
  • cachetools ==5.2.0
  • certifi ==2022.6.15
  • charset-normalizer ==2.1.0
  • decorator ==5.1.1
  • frozendict ==2.3.2
  • future ==0.18.2
  • hooktest ==1.3.1
  • idna ==3.3
  • importlib-metadata ==4.12.0
  • isodate ==0.6.1
  • jingtrang ==0.1.1
  • linkheader ==0.4.3
  • lxml ==4.9.1
  • mycapytain ==3.0.2
  • prettytable ==0.7.2
  • pyld ==2.0.3
  • pyparsing ==3.0.9
  • rdflib ==6.1.1
  • rdflib-jsonld ==0.6.2
  • requests ==2.28.1
  • setuptools ==63.1.0
  • six ==1.16.0
  • typing ==3.7.4.3
  • typing-extensions ==4.3.0
  • urllib3 ==1.26.9
  • validators ==0.20.0
  • zipp ==3.8.0
update/Dockerfile docker
  • python 3.7-alpine build