https://github.com/acdh-oeaw/arche-assets
Set of static assets used (mainly) for ARCHE data preprocessing or ARCHE information pages:
Science Score: 36.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
✓Committers with academic emails
2 of 5 committers (40.0%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.2%) to scientific vocabulary
Keywords
Keywords from Contributors
Repository
Set of static assets used (mainly) for ARCHE data preprocessing or ARCHE information pages:
Basic Info
Statistics
- Stars: 0
- Watchers: 6
- Forks: 0
- Open Issues: 0
- Releases: 52
Topics
Metadata Files
README.md
Arche Assets
Set of static assets used (mainly) for ARCHE data preprocessing or ARCHE information pages:
* URI normalization rules used within the ACDH-CH.\
(stored in AcdhArcheAssets/uriNormRules.json)
* Description of input data formats accepted by ARCHE.\
(stored in AcdhArcheAssets/formats.json)
The repository provides also Python 3 and PHP bindings for accessing those assets.
Installation & usage
Python
- Install using pip3:
bash pip3 install acdh-arche-assets - Use with ```Python from AcdhArcheAssets.urinormrules import getrules, getnormalizeduri, getnormid print(f"{getrules()}")
wrong_id = "http://sws.geonames.org/1232324343/linz.html"
goodid = getnormalizeduri(wrongid) print(good_id) # "https://sws.geonames.org/1232324343/"
# extract ID from URL normid = getnormid("http://sws.geonames.org/1232324343/linz.html") print(normid) # "1232324343"
from AcdhArcheAssets.fileformats import getformats, getbymtype, getbyextension
formats = getformats() matchingmapping = getbymtype('image/png') matchingmapping = getby_extension('png')
```
PHP
- Install using using composer:
bash composer require acdh-oeaw/arche-assets - Usage with ```php require_once 'vendor/autoload.php';
printr(acdhOeaw\UriNormRules::getRules()); printr(acdhOeaw\UriNormRules::getRules(['viaf', 'gnd']));
printr(acdhOeaw\ArcheFileFormats::getAll(); printr(acdhOeaw\ArcheFileFormats::getByMime('application/json'); print_r(acdhOeaw\ArcheFileFormats::getByExtension('application/json'); ```
Description of assets
URI normalization rules
Each rule consists of five properties:
name: a rule namematch: a regular expression matching a given URI namespacereplace: a regular expression replace expression normalizing an URI in a given namespaceresolve: a regular expression replace expression transforming an URI in a given namespace to an URL fetching an RDF dataformat: a RDF serialization format to be requested while resolving the URL produced using theresolvefield
Formats
A curated and growing list of file extensions. For each file extension mappings to the respective ARCHE Resource Type Category (stored in acdh:hasCategory) and Media Type (MIME type) (stored in acdh:hasFormat) are given. The indicated Media Type should only be used as a fallback; it is best practice to rely on automated Media Type detection based on file signatures.
Further information is provided as well.
- fileExtension: File extension to be mapped.
- name: Name(s) the format is known
- archeCategory: The corresponding URI of the ARCHE Resource Type Category Vocabulary
- dataType: A broad category to group formats in; mainly intended for visualisation purposes.
- pronomID: ID(s) assigned by PRONOM
- mimeType: Official Media Type(s) (formerly known as MIME types) registered at IANA.
- informalMimeType: Other MIME types kown for the format
- magicNumber: A constant numerical or text value used to identify a file format, e.g. Wikipedia list of file signatures
- ianaTemplate: Link to template at IANA
- reference: Link(s) to format specifications referenced by IANA and others
- longTerm: Indicates if a format is suitable for long-term preservation.\
Possible values and their meaning
- yes - long-term format
- no - not suitable, another format should be used
- restricted - can be used for long-term preservation in some cases (see comment)
- unsure - status remains to be evaluated
- archeDocs: Link to a place with more information for the format.
- comment: Any other noteworthy information not stated elsewhere.
Developement (Python)
install needed developement packages pip install requirements_dev.txt
linting, tests and testcoverage
- to run the test:
tox - check coverage and create report:
coverage run setup.py testandcoverage html - check linting
flake8
Owner
- Name: Austrian Centre for Digital Humanities & Cultural Heritage
- Login: acdh-oeaw
- Kind: organization
- Email: acdh@oeaw.ac.at
- Location: Vienna, Austria
- Website: https://www.oeaw.ac.at/acdh
- Repositories: 476
- Profile: https://github.com/acdh-oeaw
GitHub Events
Total
- Create event: 7
- Issues event: 3
- Release event: 5
- Delete event: 1
- Issue comment event: 6
- Push event: 12
- Pull request event: 2
Last Year
- Create event: 7
- Issues event: 3
- Release event: 5
- Delete event: 1
- Issue comment event: 6
- Push event: 12
- Pull request event: 2
Committers
Last synced: almost 3 years ago
All Time
- Total Commits: 64
- Total Committers: 5
- Avg Commits per committer: 12.8
- Development Distribution Score (DDS): 0.297
Top Committers
| Name | Commits | |
|---|---|---|
| Mateusz Żółtak | z****k@z****g | 45 |
| csae8092 | p****r@o****t | 15 |
| Norbert Czirjak | n****k@o****t | 2 |
| bellerophons-pegasus | b****s@y****e | 1 |
| Kiki Czeitschner | 5****r@u****m | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 7 months ago
All Time
- Total issues: 9
- Total pull requests: 8
- Average time to close issues: 1 day
- Average time to close pull requests: 5 months
- Total issue authors: 2
- Total pull request authors: 3
- Average comments per issue: 1.67
- Average comments per pull request: 1.13
- Merged pull requests: 5
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 4
- Pull requests: 3
- Average time to close issues: 1 day
- Average time to close pull requests: about 8 hours
- Issue authors: 2
- Pull request authors: 1
- Average comments per issue: 2.75
- Average comments per pull request: 1.67
- Merged pull requests: 3
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- csae8092 (7)
- zozlak (2)
Pull Request Authors
- csae8092 (8)
- ghost (2)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 3
-
Total downloads:
- pypi 5,008 last-month
- packagist 6,713 total
-
Total dependent packages: 11
(may contain duplicates) -
Total dependent repositories: 6
(may contain duplicates) - Total versions: 149
- Total maintainers: 2
pypi.org: acdh-arche-assets
A set of static assets used (mainly) for ARCHE data preprocessing
- Homepage: https://github.com/acdh-oeaw/arche-assets
- Documentation: https://acdh-arche-assets.readthedocs.io/
- License: MIT License
-
Latest release: 3.25.0
published 7 months ago
Rankings
Maintainers (1)
packagist.org: acdh-oeaw/arche-assets
Set of static assets used (mainly) for ARCHE data preprocessing
- Homepage: https://github.com/acdh-oeaw/arche-assets
- License: MIT
-
Latest release: 3.25.0
published 7 months ago
Rankings
Maintainers (1)
Funding
packagist.org: acdh-oeaw/uri-norm-rules
Set of static assets used (mainly) for ARCHE data preprocessing
- Homepage: https://github.com/acdh-oeaw/arche-assets
- License: MIT
-
Latest release: 3.25.0
published 7 months ago
Rankings
Maintainers (1)
Funding
Dependencies
- phpunit/phpunit ^9.5 development
- php >= 7.0
- coverage ==5.5 development
- flake8 ==3.9.0 development
- tox ==3.23.0 development
- actions/cache v2 composite
- actions/checkout v2 composite
- actions/checkout v2 composite
- actions/setup-python v1 composite
- py-actions/flake8 v1 composite
- actions/checkout v2 composite
- actions/setup-python v2 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- codecov/codecov-action v3 composite