Recent Releases of dataverse-metadata-crawler
dataverse-metadata-crawler - v0.1.5
What's Changed
1. Bug fixes
- Fixed subject columns (
CM_Subject_*) wrongly beFalsein spreadsheet #20 - Fixed
RequestAccessandCM_ProdAbbrevspelling in spreadsheet #20
Full Changelog: https://github.com/scholarsportal/dataverse-metadata-crawler/compare/v0.1.4...v0.1.5
- Python
Published by kenlhlui 11 months ago
dataverse-metadata-crawler - v01.4
1. Feature updates
- Added counting deaccession/draft datasets being crawled into the log.
- Added end of crawling message (✅ Crawling process completed successfully.)
2. Bug fixes
- Removed deaccession/draft datasets metadata from
failed_metadata_uris_yyyymmdd-HHMMSS.json. These metdata record will now only showed inpid_dict_dd_yyyymmdd-HHMMSS.json. - Removed non-created JSON file output listed in the log.
Full Changelog: https://github.com/scholarsportal/dataverse-metadata-crawler/compare/v0.1.3...v0.1.4
- Python
Published by kenlhlui 12 months ago
dataverse-metadata-crawler - v0.1.3
1. Feature updates
- Change
example.ipynbtocolud_cli.ipynbto better represent the use of the notebook. - Updated
colud_cli.ipynbto support interactiveBASE_URLandAPI_KEYinput, for creating the.envfile
2. Others
- Updated the
poetry-export_dependencies.yml(GitHub workflow file) to update therequirements.txtandpoetry.lockfiles in a CI/CD manner.
Full Changelog: https://github.com/scholarsportal/dataverse-metadata-crawler/compare/v0.1.2...v0.1.3
- Python
Published by kenlhlui about 1 year ago
dataverse-metadata-crawler - v0.1.2
1. Feature updates
- Added
example.ipynbfor launching the tool in- no Git or Python install required.
- Updated handling of checking connection. If the
API_KEYinput by the user is invalid, the tool will now fall back to using unauthenticated connection for crawling.
2. Others
- Changed defining headers for making GET requests to
MetaDataCrawler.
Full Changelog: https://github.com/scholarsportal/dataverse-metadata-crawler/compare/v0.1.1...v0.1.2
- Python
Published by kenlhlui about 1 year ago
dataverse-metadata-crawler - v0.1.1
1. Schema changes
- The key for
ds_metadatain the dataset will now use dataset IDs (unique identifiers for each dataset version in the Dataverse system). Example: ``` # Old version "doi:10.5072/FK2/DUGFC4": { # datasetPersistentId "status": "OK", "data": { "id": 850, "datasetId": 2663, "datasetPersistentId": "doi:10.5072/FK2/DUGFC4", ...
New version
{
"2663": { # datasetId
"status": "OK",
"data": {
"id": 850,
"datasetId": 2663,
"datasetPersistentId": "doi:10.5072/FK2/DUGFC4",
...
2. `ds_metadata_yyyymmdd-HHMMSS.json` now contains `data`, `path_info` and `permission_info` at the second-level.
{
...
"status": "OK",
"data": {
...
},
"pathinfo": {
...
},
"permissioninfo": {
...
},
3. Changes to the following fields in `path_info` for consistency with the new schema:
collectionalias -> CollectionAlias
collectionid -> CollectionID
pid -> datasetPersistentId
dsid -> datasetId
pathids -> path_ids
Old version
... "pathinfo": { "collectionalias": "toronto", "collectionid": 22, "pid": "doi:10.5072/FK2/DUGFC4", "dsid": 2663, "path": "/Nick Field Dataverse", "path_ids": [ 2641 ] }
New version
... "path_info": { "CollectionAlias": "toronto", "CollectionID": 22, "datasetPersistentId": "doi:10.5072/FK2/DUGFC4", "datasetId": 2663, "path": "/Nick Field Dataverse", "pathIds": [ 2641 ] }
```
2. Feature updates
- Comibed the representation (
-d) and permission (-p) metadata intods_metadata_yyyymmdd-HHMMSS.jsonas a single JSON file. - Added the following permission roles count of dataset (
DS_Collab,DS_Admin,DS_Contrib,DS_ContribPlus,DS_Curator,DS_FileDown,DS_Member) for spreadsheet output - Only available if-pis enabled
3. Bug Fixes
- Corrected spelling mistakes in the README file.
- Restored missing fields for representation metadata in the spreadsheet:
TermsOfUseCM_AuthorAffCM_TimeEndCM_CollectionStartCM_CollectionEnd- Fixed handling
-fresponses withNoneobjects. ****
- Python
Published by kenlhlui about 1 year ago
dataverse-metadata-crawler - v0.1.0
- Inital release
Full Changelog: https://github.com/scholarsportal/dataverse-metadata-crawler/commits/v0.1.0
- Python
Published by kenlhlui about 1 year ago
dataverse-metadata-crawler - v0.1.0
- Initial Release.
Full Changelog: https://github.com/scholarsportal/dataverse-metadata-crawler/commits/v0.1.0
- Python
Published by kenlhlui about 1 year ago