Science Capsule - Capturing the Data Life Cycle

Science Capsule - Capturing the Data Life Cycle - Published in JOSS (2021)

https://bitbucket.org/sciencecapsule/sciencecapsule

Science Score: 89.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in JOSS metadata
  • Academic publication links
  • Committers with academic emails
    7 of 7 committers (100.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
    Published in Journal of Open Source Software
Last synced: 10 months ago · JSON representation

Repository

Core ScienceCapsule repository

Basic Info
  • Host: Bitbucket.org
  • Owner: sciencecapsule
  • Language: python
  • Default Branch: master
  • Size: 6.64 MB
Statistics
  • Stars:
  • Forks:
  • Open Issues:
  • Releases: 0
Created almost 7 years ago

https://bitbucket.org/sciencecapsule/sciencecapsule/blob/master/

# [![Science Capsule](docs/sc_logo.png)](http://sciencecapsule.lbl.gov)

### About

[Science Capsule](http://sciencecapsule.lbl.gov) is a framework to understand, reuse and reproduce a scientific workflow. It helps scientists capture the data and knowledge about the life cycle of their scientific experiments or workflows, including the processes, datasets and other relevant artifacts. Science Capsule supports workflows running on different OS platforms and on containers.

Science Capsule is motivated by workflows combining tasks and data from the [Advanced Light Source (ALS)](https://als.lbl.gov/) and [The Materials Project (MP)](https://materialsproject.org/), but the design is applicable to a number of different scientific domains and workflows. Science Capsule
works with existing workflow tools and scripts, and unlike  existing frameworks that require researchers to either modify or instrument their workflows, can be used out-of-the-box for their research.

### Features

* Automatic capture of workflow events with minimal human intervention
* Ability to collect events from multiple event sources
* Capability to create, save, share, and extend workflows using containers
* Allows for easy migration and execution of workflows across multiple platforms
* A simple interface to annotate workflow executions with user notes and images

### Supported platforms

*   Linux
*   Mac OS X
*   Windows

In addition, Science Capsule is designed to work on:

* HPC systems (e.g., the [Cori supercomputer](https://docs.nersc.gov/systems/cori/) at NERSC)
* Docker containers
* HPC containers (e.g., [Shifter](https://github.com/NERSC/shifter))

### Getting started

You can follow the links below to install Science Capsule:

* on [bare-metal](docs/README-baremetal.md) 
* with [containers](docs/README-docker.md)

Once installed, you can:

* run the [test suite](docs/testing.md)
* follow the guide for running an [example workflow](docs/example.md)
* learn more about the [available commands](docs/commands.md)

As the workflow events are captured in Science Capsule, you can [follow these steps to view the timeline](docs/webui.md) on a web browser.

### License

[BSD-3-Clause-LBNL](https://bitbucket.org/sciencecapsule/sciencecapsule/src/master/LICENSE)

JOSS Publication

Science Capsule - Capturing the Data Life Cycle
Published
June 17, 2021
Volume 6, Issue 62, Page 2484
Authors
Devarshi Ghoshal ORCID
Lawrence Berkeley National Lab
Ludovico Bianchi ORCID
Lawrence Berkeley National Lab
Abdelilah Essiari
Lawrence Berkeley National Lab
Michael Beach
Lawrence Berkeley National Lab, University of Washington
Drew Paine ORCID
Lawrence Berkeley National Lab
Lavanya Ramakrishnan ORCID
Lawrence Berkeley National Lab
Editor
Amy Roberts ORCID
Tags
reproducibility scientific workflows data management high performance computing big data analytics

Committers

Last synced: 11 months ago

All Time
  • Total Commits: 456
  • Total Committers: 7
  • Avg Commits per committer: 65.143
  • Development Distribution Score (DDS): 0.226
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Ludovico Bianchi l****i@l****v 353
dghoshal-lbl d****l@l****v 65
abessiari a****i@l****v 28
Michael Beach m****h@l****v 7
Daniel Katz d****z@i****g 1
Drew Paine p****d@l****v 1
Lavanya Ramakrishnan l****n@l****v 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 10 months ago


Dependencies

ui/backend/package-lock.json npm
  • 131 dependencies
ui/backend/package.json npm
  • body-parser ^1.19.0
  • express ^4.17.1
  • mongodb ^3.3.3
  • pkg.json ^2.0.1
jenkins/requirements-dev.txt pypi
  • gitpython * development
  • mongomock * development
  • pandas ==0.25. development
  • psutil <5.6 development
  • psutil * development
  • pymodm * development
  • pymongo * development
  • pytest * development
  • pytest-cov * development
  • pytest-runner * development
  • python-dateutil * development
  • pyyaml * development
  • straight.plugin <=1.5. development
  • supervisor * development
  • supervisor-win ==4.1.1 development
  • tabulate * development
  • watchdog * development
setup.py pypi
  • FIXME *
  • IMPORTANT *
  • changes *
  • for *
  • gitpython *
  • pandas ==1.2.
  • pinning *
  • psutil *
  • psutil <5.6
  • pymodm *
  • pymongo *
  • pyyaml *
  • see *
  • supervisor *
  • supervisor-win >=4.3.0
  • supervisor-win *
  • tabulate *
  • watchdog <0.10.4