csdmpy

The Core Scientific Dataset Model (CSDM): A versatile and light-weight file-format for scientific datasets.

https://github.com/deepanshs/csdmpy

Science Score: 39.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 4 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.3%) to scientific vocabulary

Keywords

core csdf csdm datasets model python scientific

Keywords from Contributors

interactive serializer packaging network-simulation shellcodes hacking autograding observability genomics embedded
Last synced: 6 months ago · JSON representation

Repository

The Core Scientific Dataset Model (CSDM): A versatile and light-weight file-format for scientific datasets.

Basic Info
  • Host: GitHub
  • Owner: deepanshs
  • License: bsd-3-clause
  • Language: Python
  • Default Branch: master
  • Homepage: https://csdmpy.readthedocs.io
  • Size: 31 MB
Statistics
  • Stars: 17
  • Watchers: 2
  • Forks: 4
  • Open Issues: 5
  • Releases: 21
Topics
core csdf csdm datasets model python scientific
Created over 7 years ago · Last pushed 9 months ago
Metadata Files
Readme Changelog License

readme.md

The csdmpy project

| | | | ------------ | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | Deployment | PyPI version PyPI - Python Version | | Build Status | Github workflow Documentation Status | | License | License | | Metrics | codecov GitHub issues | | Citation | DOI |

The csdmpy package is a Python support for the core scientific dataset (CSD) model file exchange-format. The package is based on the core scientific dataset (CSD) model which is designed as a building block in the development of a more sophisticated portable scientific dataset file standard. The CSD model is capable of handling a wide variety of scientific datasets both within and across disciplinary fields.

The main objective of this python package is to facilitate an easy import and export of the CSD model serialized JSON files for Python users. The package utilizes Numpy library and, therefore, offers the end users versatility to process or visualize the imported datasets with any third party package(s) compatible with Numpy.

For further reading, refer to the documentation.

See example gallery

View

The core scientific dataset (CSD) model

The core scientific dataset (CSD) model is a light-weight, portable, versatile, and standalone data model capable of handling a variety of scientific datasets. The model only encapsulates data values and the minimum metadata, to accurately represent a p-component dependent variable, discretely sampled at M unique points in a d-dimensional coordinate space. The model is not intended to encapsulate any information on how the data might be acquired, processed, or visualized.


Use cases

The data model is versatile in allowing many use cases for most spectroscopy, diffraction, and imaging techniques.

Data Model

The model supports multi-component datasets associated with continuous physical quantities that are discretely sampled in a multi-dimensional space associated with other carefully controlled quantities, for e.g., a mass as a function of temperature, a current as a function of voltage and time, a signal voltage as a function of magnetic field gradient strength, a color image with a red, green, and blue (RGB) light intensity components as a function of two independent spatial dimensions, or the six components of the symmetric second-rank diffusion tensor MRI as a function of three independent spatial dimensions. Additionally, the model supports multiple dependent variables sharing the same d-dimensional coordinate space. For instance, the simultaneous measurement of current and voltage as a function of time. Another example would be the simultaneous acquisition of air temperature, pressure, wind velocity, and solar-flux as a function of Earth’s latitude and longitude coordinates. We refer to these dependent variables as correlated-datasets.

Example py "csdm": { "version": "1.0", # A list of Linear, Monotonic, or Labeled dimensions of the multi-dimensional space. "dimensions": [{ "type": "linear", "count": 1608, "increment": "0.08333333333 yr", "coordinates_offset": "1880.0416666667 yr", }], # A list of dependent variables sampling the multi-dimensional space. "dependent_variables": [{ "type": "internal", "unit": "mm", "numeric_type": "float32", "quantity_type": "scalar", "component_labels": ["GMSL"], "components": [ ["-183.0, -171.125, ..., 59.6875, 58.5"] ] }] }

Installing csdmpy package

$ pip install csdmpy

How to cite

Please cite the following when used in publication.

  1. Srivastava D.J., Vosegaard T., Massiot D., Grandinetti P.J. (2020) Core Scientific Dataset Model: A lightweight and portable model and file format for multi-dimensional scientific data. PLOS ONE 15(1): e0225953.

Check out the media coverage.

Owner

  • Name: Deepansh Srivastava
  • Login: deepanshs
  • Kind: user
  • Location: New Haven, CT
  • Company: Hyperfine

Senior Scientist at Hyperfine

GitHub Events

Total
  • Release event: 1
  • Watch event: 1
  • Delete event: 14
  • Issue comment event: 9
  • Push event: 7
  • Pull request review event: 7
  • Pull request event: 27
  • Create event: 15
Last Year
  • Release event: 1
  • Watch event: 1
  • Delete event: 14
  • Issue comment event: 9
  • Push event: 7
  • Pull request review event: 7
  • Pull request event: 27
  • Create event: 15

Committers

Last synced: about 2 years ago

All Time
  • Total Commits: 314
  • Total Committers: 4
  • Avg Commits per committer: 78.5
  • Development Distribution Score (DDS): 0.213
Past Year
  • Commits: 48
  • Committers: 3
  • Avg Commits per committer: 16.0
  • Development Distribution Score (DDS): 0.146
Top Committers
Name Email Commits
Deepansh Srivastava d****2@g****m 247
deepanshs 2****s 41
dependabot[bot] 4****] 24
Matthew Giammar 4****r 2

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 13
  • Total pull requests: 151
  • Average time to close issues: 25 days
  • Average time to close pull requests: 16 days
  • Total issue authors: 5
  • Total pull request authors: 2
  • Average comments per issue: 0.38
  • Average comments per pull request: 0.98
  • Merged pull requests: 112
  • Bot issues: 0
  • Bot pull requests: 93
Past Year
  • Issues: 0
  • Pull requests: 32
  • Average time to close issues: N/A
  • Average time to close pull requests: 11 days
  • Issue authors: 0
  • Pull request authors: 2
  • Average comments per issue: 0
  • Average comments per pull request: 0.53
  • Merged pull requests: 15
  • Bot issues: 0
  • Bot pull requests: 30
Top Authors
Issue Authors
  • mgiammar (5)
  • mccarthy677 (5)
  • deepanshs (1)
  • sgbaird (1)
  • pjgrandinetti (1)
Pull Request Authors
  • dependabot[bot] (93)
  • deepanshs (58)
Top Labels
Issue Labels
bug (2) documentation (1)
Pull Request Labels
dependencies (93) github_actions (76) python (17) bug (4) management (1) enhancement (1) documentation (1)

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 466 last-month
  • Total dependent packages: 1
  • Total dependent repositories: 6
  • Total versions: 21
  • Total maintainers: 1
pypi.org: csdmpy

A python module for the core scientific dataset model.

  • Versions: 21
  • Dependent Packages: 1
  • Dependent Repositories: 6
  • Downloads: 466 Last month
Rankings
Dependent repos count: 6.1%
Dependent packages count: 7.3%
Downloads: 8.2%
Average: 10.7%
Stargazers count: 14.9%
Forks count: 16.9%
Maintainers (1)
Last synced: 7 months ago

Dependencies

requirements-dev.txt pypi
  • astropy >=3.0
  • codecov *
  • coverage >=4.5.3
  • matplotlib >=3.0
  • numexpr >=2.7.0
  • numpy >=1.17
  • pillow >=7.1.2
  • pre-commit >=1.16.1
  • pytest >=4.6.3
  • pytest-cov *
  • pytest-runner >=5.0
  • requests >=2.21.0
  • setuptools >=27.3
  • sphinx >=2.0
  • sphinx-gallery >=0.6.2
  • sphinx_copybutton >=0.3.0
  • sphinxjp.themes.basicstrap *
requirements.txt pypi
  • astropy >=3.0
  • matplotlib >=3.0
  • numexpr >=2.7.0
  • numpy >=1.17
  • requests >=2.21.0
  • setuptools >=27.3
.github/workflows/codeql-analysis.yml actions
  • actions/checkout v3 composite
  • github/codeql-action/analyze v2 composite
  • github/codeql-action/autobuild v2 composite
  • github/codeql-action/init v2 composite
.github/workflows/continuous-integration-pip.yml actions
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
  • codecov/codecov-action v3.1.1 composite
pyproject.toml pypi
setup.py pypi
docs/requirements.txt pypi
  • astropy >=5.1
  • breathe >=4.34.0
  • matplotlib >=3.3.3
  • numexpr ==2.8.4
  • numpy >=1.20
  • pillow >=7.1.2
  • recommonmark *
  • requests >=2.21.0
  • setuptools >=27.3
  • sphinx <5.0.0
  • sphinx-gallery >0.6.2
  • sphinx-version-warning *
  • sphinx_copybutton >=0.3.0
  • sphinx_tabs >3.0
  • sphinxjp.themes.basicstrap *