Recent Releases of https://github.com/acdh-oeaw/arche-metadata-crawler

https://github.com/acdh-oeaw/arche-metadata-crawler - Bugfixes

EntityListWorksheet::loadEntities(): include also the last entity

- PHP
Published by zozlak 8 months ago

https://github.com/acdh-oeaw/arche-metadata-crawler - Bugfixes

- PHP
Published by zozlak 8 months ago

https://github.com/acdh-oeaw/arche-metadata-crawler - Error log fixes

- PHP
Published by zozlak 8 months ago

https://github.com/acdh-oeaw/arche-metadata-crawler - MetadataChecker - switch to doorkeeper tests

So far the tests were implemented locally. Now the doorkeeper tests are reused which should assure better compatibility with errors reported upon an actual ingestion

- PHP
Published by zozlak about 1 year ago

https://github.com/acdh-oeaw/arche-metadata-crawler - arche-crawl-meta - add --errorLog parameter

If the --errorLog is passed, an CSV error log is created which is easier to inspect than the full text log.

- PHP
Published by zozlak about 1 year ago

https://github.com/acdh-oeaw/arche-metadata-crawler - MetadataChecker tuning

Relax a check of unresolvable object URIs existing as a subject in the dataset

- PHP
Published by zozlak over 1 year ago

https://github.com/acdh-oeaw/arche-metadata-crawler - Bugfixes

Harden for dates-in-spreadsheet corner cases

- PHP
Published by zozlak over 1 year ago

https://github.com/acdh-oeaw/arche-metadata-crawler - Bugfixes

Avoid multiplying license triples in the output if the license triple object was changed by the MetadataCrawler::mapVocabularies().

- PHP
Published by zozlak over 1 year ago

https://github.com/acdh-oeaw/arche-metadata-crawler - Bugfixes

- PHP
Published by zozlak over 1 year ago

https://github.com/acdh-oeaw/arche-metadata-crawler - named entities reading tuning

Allow an entity to have multiple labels.

- PHP
Published by zozlak over 1 year ago

https://github.com/acdh-oeaw/arche-metadata-crawler - Bugfixes

Fixed the bug leading to existence of <any subject> acdh:hasIdentifier owl:Thing/acdh:Resource/acdh:Collection triples.

- PHP
Published by zozlak over 1 year ago

https://github.com/acdh-oeaw/arche-metadata-crawler - Horizontal templates generation tuning

  • allow multiple values for properties with maximum cardinality of 1 with lang-tagged values
  • allow of to 20 values for properties with unbounded max cardinality
  • use ACDH-blue everywhere

- PHP
Published by zozlak over 1 year ago

https://github.com/acdh-oeaw/arche-metadata-crawler - Tuning

MetadataCrawler: avoid single subject metadata split in the output file

- PHP
Published by zozlak almost 2 years ago

https://github.com/acdh-oeaw/arche-metadata-crawler - tuning

  • Checking progress bar added
  • Computed spreadsheet values are always used (allows using formulas in metadata spreadsheets)
  • Vocabulary values are mapped to URIs also for horizontal metadata files
  • After parsing all metadata inputs non-http object values are mapped against the named entities database

- PHP
Published by zozlak almost 2 years ago

https://github.com/acdh-oeaw/arche-metadata-crawler - tuning

  • MetadataChecker:
    • use in-memory UriNormCache to speed up named entities resolution checks
    • do not reports errors if a missing property has a default value
  • MetadataRdf: for each subject in the dataset being read add a <sbj> <id> <sbj> triple

- PHP
Published by zozlak almost 2 years ago

https://github.com/acdh-oeaw/arche-metadata-crawler - tuning

  • arche-crawl-meta and arche-check-meta exit with code 2 if check errors were spotted
  • multiple RDF types per resource is reported as an error
  • order of triples if preserved while graph removal

- PHP
Published by zozlak almost 2 years ago

https://github.com/acdh-oeaw/arche-metadata-crawler - arche-crawl-meta tuning

  • skip graph before performing checks (and generating output) to avoid strange corner cases
  • add --noCheck option to the bin/arche-crawl-meta
  • include entities data collected from the RDF files in the output (previously only data from named entities file were included)

- PHP
Published by zozlak almost 2 years ago

https://github.com/acdh-oeaw/arche-metadata-crawler - Docker image tuning

- PHP
Published by zozlak almost 2 years ago

https://github.com/acdh-oeaw/arche-metadata-crawler - Bugfixes

- PHP
Published by zozlak almost 2 years ago

https://github.com/acdh-oeaw/arche-metadata-crawler - named entities checking

- PHP
Published by zozlak almost 2 years ago

https://github.com/acdh-oeaw/arche-metadata-crawler - Bugfixes

- PHP
Published by zozlak almost 2 years ago

https://github.com/acdh-oeaw/arche-metadata-crawler - Tuning

TemplateCreator::processValidation(): do not enforce any format on object property values

- PHP
Published by zozlak about 2 years ago

https://github.com/acdh-oeaw/arche-metadata-crawler - 0.2.0

Using arche-lib-ingest 4 and arche-lib-schema 7

- PHP
Published by zozlak about 2 years ago