https://github.com/broadinstitute/submissions

https://github.com/broadinstitute/submissions

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: ncbi.nlm.nih.gov
  • Committers with academic emails
    1 of 4 committers (25.0%) from academic institutions
  • Institutional organization owner
    Organization broadinstitute has institutional domain (www.broadinstitute.org)
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.6%) to scientific vocabulary
Last synced: 7 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: broadinstitute
  • Language: Python
  • Default Branch: main
  • Size: 13.4 MB
Statistics
  • Stars: 1
  • Watchers: 5
  • Forks: 0
  • Open Issues: 2
  • Releases: 0
Created over 3 years ago · Last pushed 9 months ago
Metadata Files
Readme

README.md

Submissions

This repository contains workflows to submit and verify data with two different government repositories: GDC and dbGaP

Submitting Data to GDC

There are two steps to submitting data to the GDC: submission and validation. * First, use the transferToGDC workflow to transfer your files. The README is located at the bottom of the page at that link, which describes what each input to the workflow is. * Note that read-group level metadata can either be provided in the Terra metadata tables (this should be used for older samples run via Zamboni), OR it can be provided in a JSON file (the read_group_metadata_json, which should be used for newer samples run via DRAGEN). * After samples have been submitted, their validation status can be checked by using the validateGDCStatus workflow. This workflow will check each sample's validation status within the GDC and update the Terra metadata tables with the validation status.

Submitting Data to dbGaP

There are two steps to submitting data to dbGapP: submission and validation. * First, use the transferToDbgap workflow to transfer your files. The README is located at the bottom of the page at that link, which described what each input to the workflow is. * Note that read-group level metadata can either be provided in the Terra metadata tables (this should be used for older samples run via Zamboni), OR it can be provided in a JSON file (the read_group_metadata_json, which should be used for newer samples run via DRAGEN). * After samples have been submitted, their validation status can be checked by using the validateDbGapStatus workflow. This workflow will check each sample's validation status within dbGaP and update the Terra metadata tables with the validation status.

Deploying Changes in this Repository

When making changes to the .wdl files

Nothing is required if only the .wdl files are changed. Once your branch is merged to main, dockstore will automatically get updated with the most recent changes. In your Terra workspace, you can always verify what code is running by looking at the source code (in Terra on GCP, this can be found in the "SCRIPT" tab when you're navigated to your workflow configuration page).

When making changes to the Python files

If you've made a change to your Python file, most likely you'll need to recreate and push the image using the V2 Dockerfile since this is the one that contains all the Python code. You'll need to build, tag and push the docker image to this repository. Note that even though this repository is public, you'll need to be added as a collaborator in order to successfully push changes to it.

Building the Docker image - how to find which image to re-build

If you've updated any of the Python code, the docker image(s) will have to be rebuilt and pushed to DockerHub. First track down where in which .wdl file that Python code is called. Now in that .wdl, find the Docker image that's defined in the runtime attributes. This should correspond to one of the Docker files that are located within a subdirectory of Docker. Once you've found the Dockerfile you'll need to re-create, you can use the following commands to build and push the docker images (note, you don't have to necessarily build all three images, but these are the commands to use in case you do): ```commandline docker build -t schaluvadi/horsefish:submissionAspera -f Docker/Aspera/Dockerfile . --platform="linux/amd64"

docker build -t schaluvadi/horsefish:submissionV2 -f Docker/V2/Dockerfile . --platform="linux/amd64"

docker build -t schaluvadi/horsefish:submissionV1 -f Docker/V1/Dockerfile . --platform="linux/amd64" `` You'll need to add the--platform="linux/amd64in case your default platform is different on your machine. Once you've successfully created the Docker image, you can rundocker imagesand you should see a newly created image. If you're like to verify anything, you can open the image in an interactive shell. First rundocker imagesand copy theIMAGE IDof your new image. Next rundocker run -it {IMAGE_ID}. This opens an interactive shell where you can run regular unix commands such ascd,grep,vim`, etc.

Pushing your new Docker image

Once you're recreated your image and verified that your changes have propagated locally, you'll need to push your new image version to this public repository. You can do so by running any of the following commands (depending on which image you have built and need to push): commandLine docker push schaluvadi/horsefish:submissionAspera docker push schaluvadi/horsefish:submissionV2 docker push schaluvadi/horsefish:submissionV1

SSH Key Creation and Usage Guide

This guide provides instructions for creating an SSH key pair and utilizing it to establish secure connections with dbGaP.

SSH Key Generation

To generate an SSH key pair, follow these steps: 1. Open a terminal or command prompt. 2. Use the following command: ssh-keygen -t rsa -m PEM -f ./private.openssh This command will generate two files in your current directory: - private.openssh: Your private key. - private.openssh.pub: Your public key.

Linking Your Public Key

Once you have generated your SSH key pair, follow these steps to link your public key: 1. Send your public SSH key (private.openssh.pub) to richard.lapoint@nih.gov.

Uploading Your Private Key

After linking your public key, you can upload your private key to the designated workspace. Updating key in Terra Note: Keep your private key secure and do not share it with anyone.

Support

For any inquiries or assistance, please contact Nareh Sahakian at sahakian@broadinstitute.org.

Disclaimer

Ensure you follow your organization's security policies and guidelines when managing SSH keys and accessing workspaces.

Owner

  • Name: Broad Institute
  • Login: broadinstitute
  • Kind: organization
  • Location: Cambridge, MA

Broad Institute of MIT and Harvard

GitHub Events

Total
  • Delete event: 40
  • Issue comment event: 1
  • Push event: 64
  • Pull request review event: 4
  • Pull request event: 40
  • Create event: 14
Last Year
  • Delete event: 40
  • Issue comment event: 1
  • Push event: 64
  • Pull request review event: 4
  • Pull request event: 40
  • Create event: 14

Committers

Last synced: 10 months ago

All Time
  • Total Commits: 466
  • Total Committers: 4
  • Avg Commits per committer: 116.5
  • Development Distribution Score (DDS): 0.118
Past Year
  • Commits: 20
  • Committers: 1
  • Avg Commits per committer: 20.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
phendriksen100 1****0 411
Nareh Sahakian 4****n 46
Sushma Chaluvadi s****a@b****g 8
Austin Pardo a****o@g****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 7 months ago

All Time
  • Total issues: 0
  • Total pull requests: 69
  • Average time to close issues: N/A
  • Average time to close pull requests: 26 days
  • Total issue authors: 0
  • Total pull request authors: 4
  • Average comments per issue: 0
  • Average comments per pull request: 0.01
  • Merged pull requests: 63
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 31
  • Average time to close issues: N/A
  • Average time to close pull requests: 1 day
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.03
  • Merged pull requests: 27
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • sahakiann (54)
  • phendriksen100 (36)
  • schaluva (5)
  • snovod (2)
Top Labels
Issue Labels
Pull Request Labels