https://github.com/alexslemonade/openscpca-nf

A workflow for running OpenScPCA analysis modules

https://github.com/alexslemonade/openscpca-nf

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.8%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

A workflow for running OpenScPCA analysis modules

Basic Info
  • Host: GitHub
  • Owner: AlexsLemonade
  • License: bsd-3-clause
  • Language: R
  • Default Branch: main
  • Size: 720 KB
Statistics
  • Stars: 1
  • Watchers: 6
  • Forks: 0
  • Open Issues: 14
  • Releases: 7
Created over 2 years ago · Last pushed 10 months ago
Metadata Files
Readme Changelog License Code of conduct

README.md

OpenScPCA-nf

A workflow for running analysis modules ported from the OpenScPCA-analysis repository to Nextflow.

See https://github.com/AlexsLemonade/OpenScPCA-admin/blob/main/technical-docs/nextflow-workflow-specifications.md for initial implementation plans for this workflow.

Running the workflow

The workflow is currently set up to run best via AWS batch, but some testing may work locally. You will need to have appropriate AWS credentials set up to run the workflow on AWS and access the data files.

Running the workflow manually

Alternatively, you can run the workflow locally. The following base command will run the main workflow, assuming all AWS permissions are set up correctly:

bash nextflow run AlexsLemonade/OpenScPCA-nf -profile batch

For many use cases you may want to use the --results_bucket argument to avoid writing to the default output bucket (the default is a staging bucket, but may not be accessible to all users). Note that despite the name, this can be a local directory as well as an S3 bucket. For an S3 bucket, the format should be s3://bucket-name/path/to/results.

bash nextflow run AlexsLemonade/OpenScPCA-nf -profile batch --results_bucket {OUTDIR}

Profiles

To run the workflow with simulated data, you can add the simulated profile. As with the main workflow, you will want to specify an output directory for the simulated results with the --results_bucket argument.

bash nextflow run AlexsLemonade/OpenScPCA-nf -profile batch,simulated --results_bucket {SIM_RESULTS_DIR}

Entry points

The workflow also has a couple of entry points other than the main workflow, for testing and creating simulated data.

To run a test version of the workflow to check permissions and infrastructure setup:

bash nextflow run AlexsLemonade/OpenScPCA-nf -profile batch -entry test

To run the workflow that creates simulated SCE objects for the OpenScPCA project, you can use the following command, which specifies running the workflow with the simulate entry point. Note that you will need to specify the directory for the simulation output using the --sim_pubdir argument, as the default output bucket is not writeable except by a few specific roles:

bash nextflow run AlexsLemonade/OpenScPCA-nf -profile batch -entry simulate --sim_pubdir {SIMDIR}

Stub runs

All of the above commands will run the complete workflow processes. To test the general logic of the workflow without running the full workflow you can use a stub run by including the -stub argument and -profile stub.

bash nextflow run AlexsLemonade/OpenScPCA-nf -stub -profile stub

This version of the workflow is run for every pull request to the main branch.

Repository setup

This repository uses pre-commit to enforce code style and formatting. To install the pre-commit hooks described in .pre-commit-config.yaml, run the following command in the repository root:

bash pre-commit install

Workflow releases

This repository employs two different release strategies; one for tracking the workflow code itself, and one used to track the underlying data being used for workflow runs.

Code releases

When the code of the workflow changes significantly, we use GitHub releases to create a tag using semantic versioning (e.g. v1.0.0). The creation of a release on GitHub will trigger a workflow run using the current data release, as specified in nextflow.config by the release_prefix parameter. This run will replace the data in the results bucket for the current data release with the updated results from the workflow run.

Data releases

When a new OpenScPCA data release is created, the release_prefix parameter will be updated, and a Git tag will be added with the format YYYY-MM-DD (e.g. 2024-07-08), matching the value of release_prefix. This will trigger a workflow run for the purposes of populating the results bucket, but these tags should not be used as releases on GitHub.

Owner

  • Name: Alex's Lemonade Stand Foundation
  • Login: AlexsLemonade
  • Kind: organization

Childhood Cancer Data Lab of ALSF

GitHub Events

Total
  • Create event: 31
  • Release event: 3
  • Issues event: 54
  • Delete event: 32
  • Issue comment event: 79
  • Push event: 153
  • Pull request review comment event: 82
  • Pull request review event: 89
  • Pull request event: 70
Last Year
  • Create event: 31
  • Release event: 3
  • Issues event: 54
  • Delete event: 32
  • Issue comment event: 79
  • Push event: 153
  • Pull request review comment event: 82
  • Pull request review event: 89
  • Pull request event: 70

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 65
  • Total pull requests: 88
  • Average time to close issues: 12 days
  • Average time to close pull requests: 2 days
  • Total issue authors: 3
  • Total pull request authors: 6
  • Average comments per issue: 1.08
  • Average comments per pull request: 0.5
  • Merged pull requests: 63
  • Bot issues: 0
  • Bot pull requests: 4
Past Year
  • Issues: 34
  • Pull requests: 38
  • Average time to close issues: 5 days
  • Average time to close pull requests: 4 days
  • Issue authors: 3
  • Pull request authors: 4
  • Average comments per issue: 0.68
  • Average comments per pull request: 0.84
  • Merged pull requests: 24
  • Bot issues: 0
  • Bot pull requests: 3
Top Authors
Issue Authors
  • jashapiro (36)
  • sjspielman (16)
  • allyhawkins (13)
Pull Request Authors
  • jashapiro (56)
  • allyhawkins (14)
  • sjspielman (9)
  • pre-commit-ci[bot] (4)
  • keyrus-rob-jackson (4)
  • jaclyn-taroni (1)
Top Labels
Issue Labels
Pull Request Labels

Dependencies

.github/workflows/run-batch.yml actions
  • actions/checkout v2 composite
  • aws-actions/configure-aws-credentials v1 composite
.github/workflows/nextflow-checks.yml actions
  • actions/checkout v4 composite
  • actions/upload-artifact v4 composite
  • nf-core/setup-nextflow v2.0.0 composite
environment.yml pypi