flatjacs-3.0

A flattened CSV dataset of JACS 3.0 subject areas, principals subjects and detailed four-digit subject codes

https://github.com/sdruskat/flatjacs-3.0

Science Score: 77.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Committers with academic emails
    1 of 1 committers (100.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (5.7%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

A flattened CSV dataset of JACS 3.0 subject areas, principals subjects and detailed four-digit subject codes

Basic Info
  • Host: GitHub
  • Owner: sdruskat
  • License: other
  • Default Branch: main
  • Size: 184 KB
Statistics
  • Stars: 0
  • Watchers: 2
  • Forks: 0
  • Open Issues: 0
  • Releases: 3
Created about 4 years ago · Last pushed about 4 years ago
Metadata Files
Readme License Citation

README.md

flatJACS 3.0

DOI

A dataset representing JACS 3.0 subject areas, principal subjects, and detailed subject codes in a unified dataset, in CSV and JSON.

Data files

The dataset is contained in single files in CSV and JSON: flatJACS_3.0.csv, flatJACS_3.0.json.

Data set documentation

CSV

| Column name | Description | | -- | -- | |area_code_per_principal_codes_website|The JACS 3.0 subject area code as listed on the principal subject codes website| |JACS_3.0_subject_area|The JACS 3.0 subject area| |principal_code|The JACS 3.0 principal subject code as listed on the principal subject codes website| |JACS_3.0_principal_subject|The JACS 3.0 principal subject| |area_code_per_detailed_codes_website|The JACS 3.0 subject area code as listed on the detailed subject codes website| |area_per_detailed_codes_website|The JACS 3.0 subject area as listed on the detailed subject codes website| |detailed_code|The JACS 3.0 detailed four digit code| |JACS_3.0_detailed_subject|The JACS 3.0 detailed subject| |flatJACS_3.0_detailed_subject|The flatJACS 3.0 detailed subject|

NOTE: The flatJACS 3.0 detailed subject is an internationalized description of the JACS 3.0 detailed subject. In this version of the dataset, the two columns differ in one value only: - Q160 is British Sign Language in JACS 3.0, and Sign Languages in flatJACS 3.0, to encompass all sign languages, rather than just the British Sign Language. This differentiation is necessary to make flatJACS 3.0 applicable to subject categorization outside of the United Kingdom (for which the JACS 3.0 codes have originally been created).

JSON

The JSON file has the following structure:

```json

{ "areas": [ { "code": , "name": , "subareas": [ { "code": , "name": , "principal-subjects": [ { "code": , "name": , "detailed-codes": [ { "code": , "JACS-code": , "flatJACS-code": }, ... ] }, ... ] }, ... ] }, ... ] } ```

  • "areas" describe areas as per the principal subject codes website.
  • "subareas" describe subareas. These refer to areas as per the detailed subject codes website. Often, these are 1:1 equivalents to "areas".
  • "principal-subjects" describe principal subjects as per the principal subject codes website|
  • "detailed-codes" describe detailed subjects, and provide a code as well as the original JACS as well as the adapted flatJACS name for the subject.

Attribution

The data sources for this dataset are listed below. The JACS 3.0 coding frame by HESA is published under a Creative Commons Attribution 4.0 International license.

| Data source | Description | | -- | -- | | Principal subject codes website | Provides the JACS 3.0 principal subject codes. | | Detailed subject codes website | Provides the JACS 3.0 detailed (four digit) subject codes. | | Detailed subject codes CSV file | Provides the JACS 3.0 detailed (four digit) subject codes in CSV format. |

Dataset creation

The flatJACS 3.0 dataset has been created by manually splitting codes and names for subject areas, principal subjects and detailed codes, and mapping them into a spreadsheet. The spreadsheet has then been saved as a ,-separated CSV file encoded in UTF-8.

Disclaimer

This dataset has been created to the best of the knowledge and capabilities of the contributors. The contributors and maintainers of the project cannot make any guarantees as to the correctness and accuracy of the dataset. Use this dataset at your own risk. We welcome feedback to improve the dataset. See the Contributing section for details.

License

CC BY 4.0 badge

This dataset is distributed using the Creative Commons Attribution 4.0 International (CC BY 4.0) license. Please provide attribution.

The JACS 3.0 coding frame by HESA, that this dataset remixes, is published under a Creative Commons Attribution 4.0 International license.

Maintainers

This dataset is maintained by Stephan Druskat (@sdruskat).

Contributing

To provide feedback, report issues and suggest improvement, please create a new GitHub issue in this repository.

Owner

  • Name: Stephan Druskat
  • Login: sdruskat
  • Kind: user
  • Location: Berlin
  • Company: German Aerospace Center (DLR)

Software Engineering PhD candidate @DLR-SC, Research Software Engineer (https://hexatomic.github.io)

Citation (CITATION.cff)

# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!

cff-version: 1.2.0
title: flatJACS 3.0
message: Please cite this dataset using these metadata.
type: dataset
authors:
  - given-names: Stephan
    family-names: Druskat
    email: stephan.druskat@dlr.de
    affiliation: >-
      German Aerospace Center (DLR),
      Humboldt-Universität zu Berlin
    orcid: 'https://orcid.org/0000-0003-4925-7248'
identifiers:
  - type: doi
    value: 10.5281/zenodo.6554291
    description: The DOI resolving to version 1.1.0 of flatJACS 3.0
repository-code: 'https://github.com/sdruskat/flatJACS-3.0'
abstract: >-
  A CSV and JSON dataset representing JACS 3.0 subject areas,
  principal subjects, and detailed subject codes in a
  unified dataset.
keywords:
  - research domains
  - taxonomy
  - JSON-LD
  - JACS
  - JACS 3.0
license: CC-BY-4.0
version: '1.1.0'
date-released: '2022-05-16'

GitHub Events

Total
Last Year

Committers

Last synced: about 1 year ago

All Time
  • Total Commits: 26
  • Total Committers: 1
  • Avg Commits per committer: 26.0
  • Development Distribution Score (DDS): 0.0
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Stephan Druskat s****t@d****e 26
Committer Domains (Top 20 + Academic)
dlr.de: 1

Issues and Pull Requests

Last synced: about 1 year ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels