https://github.com/acdh-oeaw/arche-assets

Set of static assets used (mainly) for ARCHE data preprocessing or ARCHE information pages:

https://github.com/acdh-oeaw/arche-assets

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    2 of 5 committers (40.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.2%) to scientific vocabulary

Keywords

arche

Keywords from Contributors

ontology
Last synced: 6 months ago · JSON representation

Repository

Set of static assets used (mainly) for ARCHE data preprocessing or ARCHE information pages:

Basic Info
  • Host: GitHub
  • Owner: acdh-oeaw
  • License: mit
  • Language: PHP
  • Default Branch: master
  • Homepage:
  • Size: 290 KB
Statistics
  • Stars: 0
  • Watchers: 6
  • Forks: 0
  • Open Issues: 0
  • Releases: 52
Topics
arche
Created over 5 years ago · Last pushed 8 months ago
Metadata Files
Readme License

README.md

Arche Assets

PyPI version codecov Test flake8 Lint Latest Stable Version phpunit License

Set of static assets used (mainly) for ARCHE data preprocessing or ARCHE information pages: * URI normalization rules used within the ACDH-CH.\ (stored in AcdhArcheAssets/uriNormRules.json) * Description of input data formats accepted by ARCHE.\ (stored in AcdhArcheAssets/formats.json)

The repository provides also Python 3 and PHP bindings for accessing those assets.

Installation & usage

Python

  • Install using pip3: bash pip3 install acdh-arche-assets
  • Use with ```Python from AcdhArcheAssets.urinormrules import getrules, getnormalizeduri, getnormid print(f"{getrules()}")

wrong_id = "http://sws.geonames.org/1232324343/linz.html"

goodid = getnormalizeduri(wrongid) print(good_id) # "https://sws.geonames.org/1232324343/"

# extract ID from URL normid = getnormid("http://sws.geonames.org/1232324343/linz.html") print(normid) # "1232324343"

from AcdhArcheAssets.fileformats import getformats, getbymtype, getbyextension

formats = getformats() matchingmapping = getbymtype('image/png') matchingmapping = getby_extension('png')

```

PHP

  • Install using using composer: bash composer require acdh-oeaw/arche-assets
  • Usage with ```php require_once 'vendor/autoload.php';

printr(acdhOeaw\UriNormRules::getRules()); printr(acdhOeaw\UriNormRules::getRules(['viaf', 'gnd']));

printr(acdhOeaw\ArcheFileFormats::getAll(); printr(acdhOeaw\ArcheFileFormats::getByMime('application/json'); print_r(acdhOeaw\ArcheFileFormats::getByExtension('application/json'); ```

Description of assets

URI normalization rules

Each rule consists of five properties:

  • name: a rule name
  • match: a regular expression matching a given URI namespace
  • replace: a regular expression replace expression normalizing an URI in a given namespace
  • resolve: a regular expression replace expression transforming an URI in a given namespace to an URL fetching an RDF data
  • format: a RDF serialization format to be requested while resolving the URL produced using the resolve field

Formats

A curated and growing list of file extensions. For each file extension mappings to the respective ARCHE Resource Type Category (stored in acdh:hasCategory) and Media Type (MIME type) (stored in acdh:hasFormat) are given. The indicated Media Type should only be used as a fallback; it is best practice to rely on automated Media Type detection based on file signatures.

Further information is provided as well.

  • fileExtension: File extension to be mapped.
  • name: Name(s) the format is known
  • archeCategory: The corresponding URI of the ARCHE Resource Type Category Vocabulary
  • dataType: A broad category to group formats in; mainly intended for visualisation purposes.
  • pronomID: ID(s) assigned by PRONOM
  • mimeType: Official Media Type(s) (formerly known as MIME types) registered at IANA.
  • informalMimeType: Other MIME types kown for the format
  • magicNumber: A constant numerical or text value used to identify a file format, e.g. Wikipedia list of file signatures
  • ianaTemplate: Link to template at IANA
  • reference: Link(s) to format specifications referenced by IANA and others
  • longTerm: Indicates if a format is suitable for long-term preservation.\ Possible values and their meaning
    • yes - long-term format
    • no - not suitable, another format should be used
    • restricted - can be used for long-term preservation in some cases (see comment)
    • unsure - status remains to be evaluated
  • archeDocs: Link to a place with more information for the format.
  • comment: Any other noteworthy information not stated elsewhere.

Developement (Python)

install needed developement packages pip install requirements_dev.txt

linting, tests and testcoverage

  • to run the test: tox
  • check coverage and create report: coverage run setup.py test and coverage html
  • check linting flake8

Owner

  • Name: Austrian Centre for Digital Humanities & Cultural Heritage
  • Login: acdh-oeaw
  • Kind: organization
  • Email: acdh@oeaw.ac.at
  • Location: Vienna, Austria

GitHub Events

Total
  • Create event: 7
  • Issues event: 3
  • Release event: 5
  • Delete event: 1
  • Issue comment event: 6
  • Push event: 12
  • Pull request event: 2
Last Year
  • Create event: 7
  • Issues event: 3
  • Release event: 5
  • Delete event: 1
  • Issue comment event: 6
  • Push event: 12
  • Pull request event: 2

Committers

Last synced: almost 3 years ago

All Time
  • Total Commits: 64
  • Total Committers: 5
  • Avg Commits per committer: 12.8
  • Development Distribution Score (DDS): 0.297
Top Committers
Name Email Commits
Mateusz Żółtak z****k@z****g 45
csae8092 p****r@o****t 15
Norbert Czirjak n****k@o****t 2
bellerophons-pegasus b****s@y****e 1
Kiki Czeitschner 5****r@u****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 7 months ago

All Time
  • Total issues: 9
  • Total pull requests: 8
  • Average time to close issues: 1 day
  • Average time to close pull requests: 5 months
  • Total issue authors: 2
  • Total pull request authors: 3
  • Average comments per issue: 1.67
  • Average comments per pull request: 1.13
  • Merged pull requests: 5
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 4
  • Pull requests: 3
  • Average time to close issues: 1 day
  • Average time to close pull requests: about 8 hours
  • Issue authors: 2
  • Pull request authors: 1
  • Average comments per issue: 2.75
  • Average comments per pull request: 1.67
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • csae8092 (7)
  • zozlak (2)
Pull Request Authors
  • csae8092 (8)
  • ghost (2)
Top Labels
Issue Labels
enhancement (2) bug (1)
Pull Request Labels

Packages

  • Total packages: 3
  • Total downloads:
    • pypi 5,008 last-month
    • packagist 6,713 total
  • Total dependent packages: 11
    (may contain duplicates)
  • Total dependent repositories: 6
    (may contain duplicates)
  • Total versions: 149
  • Total maintainers: 2
pypi.org: acdh-arche-assets

A set of static assets used (mainly) for ARCHE data preprocessing

  • Versions: 43
  • Dependent Packages: 2
  • Dependent Repositories: 4
  • Downloads: 5,008 Last month
Rankings
Dependent packages count: 4.7%
Dependent repos count: 7.5%
Downloads: 8.6%
Average: 16.5%
Forks count: 22.6%
Stargazers count: 38.8%
Maintainers (1)
Last synced: 6 months ago
packagist.org: acdh-oeaw/arche-assets

Set of static assets used (mainly) for ARCHE data preprocessing

  • Versions: 53
  • Dependent Packages: 8
  • Dependent Repositories: 2
  • Downloads: 6,056 Total
Rankings
Dependent packages count: 2.4%
Forks count: 16.4%
Dependent repos count: 18.5%
Average: 18.8%
Downloads: 19.9%
Stargazers count: 37.1%
Maintainers (1)
Funding
Last synced: 6 months ago
packagist.org: acdh-oeaw/uri-norm-rules

Set of static assets used (mainly) for ARCHE data preprocessing

  • Versions: 53
  • Dependent Packages: 1
  • Dependent Repositories: 0
  • Downloads: 657 Total
Rankings
Dependent packages count: 9.7%
Forks count: 24.6%
Average: 25.3%
Downloads: 26.1%
Stargazers count: 32.6%
Dependent repos count: 33.5%
Maintainers (1)
Funding
Last synced: 6 months ago

Dependencies

composer.json packagist
  • phpunit/phpunit ^9.5 development
  • php >= 7.0
requirements_dev.txt pypi
  • coverage ==5.5 development
  • flake8 ==3.9.0 development
  • tox ==3.23.0 development
.github/workflows/php.yml actions
  • actions/cache v2 composite
  • actions/checkout v2 composite
.github/workflows/pylint.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v1 composite
  • py-actions/flake8 v1 composite
.github/workflows/pypi.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v2 composite
.github/workflows/pytest.yml actions
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
  • codecov/codecov-action v3 composite
setup.py pypi