https://github.com/cdli-gh/cdli-cts
Canonical Text Services export of the Cuneiform Digital Library Initiative corpus.
Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
✓Committers with academic emails
1 of 4 committers (25.0%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.7%) to scientific vocabulary
Keywords
Keywords from Contributors
Repository
Canonical Text Services export of the Cuneiform Digital Library Initiative corpus.
Basic Info
- Host: GitHub
- Owner: cdli-gh
- Language: Python
- Default Branch: main
- Homepage: https://cdli.ucla.edu/
- Size: 2.14 MB
Statistics
- Stars: 4
- Watchers: 2
- Forks: 2
- Open Issues: 2
- Releases: 0
Topics
Metadata Files
README.md
Canonical Cuneiform Texts
Currently experimental and incomplete!
This is a collection of cuneiform tablets in Canonical Text Services format, from the database of the Cuneiform Digital Library Initiative. This format is used by a number of viewing an analysis tools for learning and scholarship. It is hoped that this collection will make the texts more accessible.
- Converted by atf2tei
from
cdliatf_unblocked.atfin the data repository. - Layout following the CapiTainS Guidelines.
Adding files
Cuneiform transcriptions are normally maintained in the
ATF format.
To add new texts or update the collection run the transcriptions through
the atf2cts tool from the atf2tei
package.
For example, to convert a atf file containing one or more tablet transcriptions and add it to the repository:
pip install pipenv # if necessar.
git clone https://github.com/cdli-gh/atf2tei
cd atf2tei
pipenv install
pipenv run python atf2cts.py /path/to/your/transcription.atf
mv data/* ../data/
A simple git status should then show the added (or changed) files.
Updating
There is also a script in the update directory which reads the entire CDLI bulk data export and converts a subset of the records, based on a list of CDLI id numbers or particular catalogue field entries.
git clone --depth https://github.com/cdli-gh/data cdli-data
pipenv install pyoracc
PYTHONPATH=$PWD/atf2tei update/cdli2cts.py -d cdli-data -o .
The data repository is quite large. Passing the --depth option
to git downloads only the most recent changes, reducing the size
to several hundred MB.
Until atf2cts is properly packaged, it also needs to be checked
out with git and the location passed through the PYTHONPATH
environment variable. See the section above about added files
for how to do this.
A docker container configuration is included which can be used to set up automatic updates. See update for details.
Owner
- Name: CDLI
- Login: cdli-gh
- Kind: organization
- Email: cdli@orinst.ox.ac.uk
- Location: Los Angeles, Oxford, Berlin
- Website: https://cdli.ucla.edu
- Repositories: 83
- Profile: https://github.com/cdli-gh
GitHub Events
Total
Last Year
Committers
Last synced: 8 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Ralph Giles | g****s@t****t | 28 |
| dependabot[bot] | 4****] | 7 |
| CDLI CTS Update | n****y@c****u | 3 |
| Ralph Giles | g****s@d****n | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 8 months ago
All Time
- Total issues: 1
- Total pull requests: 12
- Average time to close issues: N/A
- Average time to close pull requests: about 2 months
- Total issue authors: 1
- Total pull request authors: 2
- Average comments per issue: 0.0
- Average comments per pull request: 0.33
- Merged pull requests: 8
- Bot issues: 0
- Bot pull requests: 11
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- rillian (1)
Pull Request Authors
- dependabot[bot] (11)
- rillian (1)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- hooktest >=1.2
- ansicolors ==1.0.2
- cachetools ==5.2.0
- certifi ==2022.6.15
- charset-normalizer ==2.1.0
- decorator ==5.1.1
- frozendict ==2.3.2
- future ==0.18.2
- hooktest ==1.3.1
- idna ==3.3
- importlib-metadata ==4.12.0
- isodate ==0.6.1
- jingtrang ==0.1.1
- linkheader ==0.4.3
- lxml ==4.9.1
- mycapytain ==3.0.2
- prettytable ==0.7.2
- pyld ==2.0.3
- pyparsing ==3.0.9
- rdflib ==6.1.1
- rdflib-jsonld ==0.6.2
- requests ==2.28.1
- setuptools ==63.1.0
- six ==1.16.0
- typing ==3.7.4.3
- typing-extensions ==4.3.0
- urllib3 ==1.26.9
- validators ==0.20.0
- zipp ==3.8.0
- python 3.7-alpine build