Science Score: 57.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 10 DOI reference(s) in README -
○Academic publication links
-
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.1%) to scientific vocabulary
Keywords
Repository
A pipeline for processing calling cards data
Basic Info
- Host: GitHub
- Owner: nf-core
- License: mit
- Language: Nextflow
- Default Branch: master
- Homepage: https://nf-co.re/callingcards
- Size: 38.6 MB
Statistics
- Stars: 5
- Watchers: 170
- Forks: 4
- Open Issues: 5
- Releases: 1
Topics
Metadata Files
README.md
Introduction
nf-core/callingcards is a bioinformatics pipeline that can be used to process raw Calling Cards data obtained from either mammals (human, mouse) or yeast. It takes a samplesheet, which describes the sample names, paths to the fastq files, and paths to the barcode details json files, a parameter which identifies the
organism type (either mammals or yeast) and either the name of the reference genome on igenomes or paths to the fasta/gff. It then parses the reads, counts the
number of calling cards insertions, and provides some QC metrics.

- Prepare the Genome
- Prepare the Reads
- Read QC (
FastQC) - Split reads
- Yeast: demultiplex by barcode (
callingCardsTools) - Mammals: split for parallel processing (
seqkit) - Mammals: extract barcode to fastq read ID(
UMItools)
- Yeast: demultiplex by barcode (
- Optionally trim reads (
Trimmomatic)])
- Read QC (
- Align
- Count Hops (
callingCardsTools) - Present QC data (
MultiQC)
Usage
:::note
If you are new to Nextflow and nf-core, please refer to this page on how
to set-up Nextflow. Make sure to test your setup
with -profile test before running the workflow on actual data.
:::
First, prepare a samplesheet with your input data according to your organism.
A Yeast samplesheet will look like:
yeast_samplesheet.csv:
csv
sample,fastq_1,fastq_2,barcode_details
run_6177,run_6177_sample_R1.fastq.gz,run_6177_sample_R2.fastq.gz,run_6177_barcode_details.json
Each row represents a multiplexed fastq file where the barcode_details.json file describes the barcodes which correspond to each transcription factor in the library.
A mammals (human, mouse) samplesheet will look like:
mammals_samplesheet.csv:
csv
sample,fastq_1,fastq_2,barcode_details
midbrain_rep3,midbrain_rep3_R1.fastq.gz,,barcode_details.json
Note that currently, the mammals workflow expects only R1 reads.
Now, you can run the pipeline using:
bash
nextflow run nf-core/callingcards \
-profile <docker/singularity/.../institute>,<default_yeast/default_mammals> \
--input samplesheet.csv \
--genome <igenomes_name (only required for mammals)> \
--outdir <OUTDIR>
Note that the default_yeast and default_mammals profiles are provided for convenience. You should check the parameters which these set to ensure that they are what you need for your data.
:::warning
Please provide pipeline parameters via the CLI or Nextflow -params-file option. Custom config files including those
provided by the -c Nextflow option can be used to provide any configuration except for parameters;
see docs.
:::
For more details and further functionality, please refer to the usage documentation and the parameter documentation.
Pipeline output
To see the results of an example test run with a full size dataset refer to the results tab on the nf-core website pipeline page. For more details about the output files and reports, please refer to the output documentation.
Credits
nf-core/callingcards is implemented in nextflow by Chase Mateusiak. It was adapted from scripts written by:
We thank the following people for their extensive assistance in the development of this pipeline:
Contributions and Support
If you would like to contribute to this pipeline, please see the contributing guidelines.
For further information or help, don't hesitate to get in touch on the Slack #callingcards channel (you can join with this invite).
Citations
An extensive list of references for the tools used by the pipeline can be found in the CITATIONS.md file.
You can cite the nf-core publication as follows:
The nf-core framework for community-curated bioinformatics pipelines.
Philip Ewels, Alexander Peltzer, Sven Fillinger, Harshil Patel, Johannes Alneberg, Andreas Wilm, Maxime Ulysse Garcia, Paolo Di Tommaso & Sven Nahnsen.
Nat Biotechnol. 2020 Feb 13. doi: 10.1038/s41587-020-0439-x.
Owner
- Name: nf-core
- Login: nf-core
- Kind: organization
- Email: core@nf-co.re
- Website: http://nf-co.re
- Twitter: nf_core
- Repositories: 84
- Profile: https://github.com/nf-core
A community effort to collect a curated set of analysis pipelines built using Nextflow.
Citation (CITATIONS.md)
# nf-core/callingcards: Citations ## [nf-core](https://pubmed.ncbi.nlm.nih.gov/32055031/) > Ewels PA, Peltzer A, Fillinger S, Patel H, Alneberg J, Wilm A, Garcia MU, Di Tommaso P, Nahnsen S. The nf-core framework for community-curated bioinformatics pipelines. Nat Biotechnol. 2020 Mar;38(3):276-278. doi: 10.1038/s41587-020-0439-x. PubMed PMID: 32055031. ## [Nextflow](https://pubmed.ncbi.nlm.nih.gov/28398311/) > Di Tommaso P, Chatzou M, Floden EW, Barja PP, Palumbo E, Notredame C. Nextflow enables reproducible computational workflows. Nat Biotechnol. 2017 Apr 11;35(4):316-319. doi: 10.1038/nbt.3820. PubMed PMID: 28398311. ## Pipeline tools - [bedtools](https://doi.org/10.1093/bioinformatics/btq033) > Aaron R. Quinlan, Ira M. Hall, BEDTools: a flexible suite of utilities for comparing genomic features, Bioinformatics, Volume 26, Issue 6, March 2010, Pages 841–842, https://doi.org/10.1093/bioinformatics/btq033 - [bowtie](https://genomebiology.biomedcentral.com/articles/10.1186/gb-2009-10-3-r25) > Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10:R25. - [bowtie2](https://www.nature.com/articles/nmeth.1923) > Langmead B, Salzberg S. Fast gapped-read alignment with Bowtie 2. Nature Methods. 2012, 9:357-359. - [bwa](https://pubmed.ncbi.nlm.nih.gov/19451168/) > Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009 Jul 15;25(14):1754-60. doi: 10.1093/bioinformatics/btp324. Epub 2009 May 18. PMID: 19451168; PMCID: PMC2705234. - [bwamem2](https://doi.org/10.1109/IPDPS.2019.00041) > Vasimuddin Md, Sanchit Misra, Heng Li, Srinivas Aluru. Efficient Architecture-Aware Acceleration of BWA-MEM for Multicore Systems. IEEE Parallel and Distributed Processing Symposium (IPDPS), 2019. 10.1109/IPDPS.2019.00041 - [callingCardsTools](https://zenodo.org/records/10042067) > Chase Mateusiak. (2023). cmatKhan/callingCardsTools: v1.2.0 (v1.2.0). Zenodo. https://doi.org/10.5281/zenodo.10042067 - [FastQC](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) > Andrews, S. (2010). FastQC: A Quality Control Tool for High Throughput Sequence Data [Online]. - [MultiQC](https://pubmed.ncbi.nlm.nih.gov/27312411/) > Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics. 2016 Oct 1;32(19):3047-8. doi: 10.1093/bioinformatics/btw354. Epub 2016 Jun 16. PubMed PMID: 27312411; PubMed Central PMCID: PMC5039924. - [nf-core/rnaseq:GTF2BED](https://github.com/nf-core/rnaseq/blob/master/bin/gtf2bed) > Harshil Patel, Phil Ewels, Alexander Peltzer, Olga Botvinnik, Gregor Sturm, Denis Moreno, Pranathi Vemuri, Maxime U Garcia, silviamorins, Lorena Pantano, Mahesh Binzer-Panchal, nf-core bot, Robert Syme, Matthias Zepper, Gavin Kelly, Friederike Hanssen, James A. Fellows Yates, Chris Cheshire, rfenouil, … Paolo Di Tommaso. (2023). nf-core/rnaseq: nf-core/rnaseq v3.12.0 - Osmium Octopus (3.12.0). Zenodo. https://doi.org/10.5281/zenodo.7998767 - [picard](http://broadinstitute.github.io/picard/) > “Picard Toolkit.” 2019. Broad Institute, GitHub Repository. https://broadinstitute.github.io/picard/; Broad Institute. - [rseqc](https://rseqc.sourceforge.net/) > Wang L, Wang S, Li W. RSeQC: quality control of RNA-seq experiments. Bioinformatics. 2012 Aug 15;28(16):2184-5. doi: 10.1093/bioinformatics/bts356. Epub 2012 Jun 27. PMID: 22743226. - [samtools](https://doi.org/10.1093/gigascience/giab008) > Twelve years of SAMtools and BCFtools Petr Danecek, James K Bonfield, Jennifer Liddle, John Marshall, Valeriu Ohan, Martin O Pollard, Andrew Whitwham, Thomas Keane, Shane A McCarthy, Robert M Davies, Heng Li GigaScience, Volume 10, Issue 2, February 2021, giab008, https://doi.org/10.1093/gigascience/giab008 - [seqkit](https://doi.org/10.1371/journal.pone.0163962) > Shen W, Le S, Li Y, Hu F (2016) SeqKit: A Cross-Platform and Ultrafast Toolkit for FASTA/Q File Manipulation. PLoS ONE 11(10): e0163962. https://doi.org/10.1371/journal.pone.0163962 - [trimmomatic](http://www.usadellab.org/cms/?page=trimmomatic) > Bolger, A. M., Lohse, M., & Usadel, B. (2014). Trimmomatic: A flexible trimmer for Illumina Sequence Data. Bioinformatics, btu170. - [UMITools](https://pubmed.ncbi.nlm.nih.gov/28100584/) > Smith T, Heger A, Sudbery I. UMI-tools: modeling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy. Genome Res. 2017 Mar;27(3):491-499. doi: 10.1101/gr.209601.116. Epub 2017 Jan 18. PMID: 28100584; PMCID: PMC5340976. ## Software packaging/containerisation tools - [Anaconda](https://anaconda.com) > Anaconda Software Distribution. Computer software. Vers. 2-2.4.0. Anaconda, Nov. 2016. Web. - [Bioconda](https://pubmed.ncbi.nlm.nih.gov/29967506/) > Grüning B, Dale R, Sjödin A, Chapman BA, Rowe J, Tomkins-Tinch CH, Valieris R, Köster J; Bioconda Team. Bioconda: sustainable and comprehensive software distribution for the life sciences. Nat Methods. 2018 Jul;15(7):475-476. doi: 10.1038/s41592-018-0046-7. PubMed PMID: 29967506. - [BioContainers](https://pubmed.ncbi.nlm.nih.gov/28379341/) > da Veiga Leprevost F, Grüning B, Aflitos SA, Röst HL, Uszkoreit J, Barsnes H, Vaudel M, Moreno P, Gatto L, Weber J, Bai M, Jimenez RC, Sachsenberg T, Pfeuffer J, Alvarez RV, Griss J, Nesvizhskii AI, Perez-Riverol Y. BioContainers: an open-source and community-driven framework for software standardization. Bioinformatics. 2017 Aug 15;33(16):2580-2582. doi: 10.1093/bioinformatics/btx192. PubMed PMID: 28379341; PubMed Central PMCID: PMC5870671. - [Docker](https://dl.acm.org/doi/10.5555/2600239.2600241) > Merkel, D. (2014). Docker: lightweight linux containers for consistent development and deployment. Linux Journal, 2014(239), 2. doi: 10.5555/2600239.2600241. - [Singularity](https://pubmed.ncbi.nlm.nih.gov/28494014/) > Kurtzer GM, Sochat V, Bauer MW. Singularity: Scientific containers for mobility of compute. PLoS One. 2017 May 11;12(5):e0177459. doi: 10.1371/journal.pone.0177459. eCollection 2017. PubMed PMID: 28494014; PubMed Central PMCID: PMC5426675.
GitHub Events
Total
- Issues event: 2
- Watch event: 2
- Delete event: 13
- Issue comment event: 10
- Push event: 12
- Pull request review event: 1
- Pull request event: 12
- Create event: 7
Last Year
- Issues event: 2
- Watch event: 2
- Delete event: 13
- Issue comment event: 10
- Push event: 12
- Pull request review event: 1
- Pull request event: 12
- Create event: 7
Committers
Last synced: over 2 years ago
Top Committers
| Name | Commits | |
|---|---|---|
| chase mateusiak | c****k@g****m | 37 |
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 9
- Total pull requests: 12
- Average time to close issues: 3 months
- Average time to close pull requests: 2 months
- Total issue authors: 3
- Total pull request authors: 3
- Average comments per issue: 1.56
- Average comments per pull request: 0.75
- Merged pull requests: 4
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 1
- Pull requests: 4
- Average time to close issues: about 2 months
- Average time to close pull requests: about 2 months
- Issue authors: 1
- Pull request authors: 1
- Average comments per issue: 2.0
- Average comments per pull request: 1.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- cmatKhan (8)
- allenyen (3)
- jen-reeve (1)
Pull Request Authors
- nf-core-bot (17)
- cmatKhan (5)
- adamrtalbot (1)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- actions/upload-artifact v3 composite
- nf-core/tower-action v3 composite
- actions/upload-artifact v3 composite
- nf-core/tower-action v3 composite
- mshick/add-pr-comment v1 composite
- actions/checkout v3 composite
- nf-core/setup-nextflow v1 composite
- actions/checkout v3 composite
- actions/setup-node v3 composite
- actions/checkout v3 composite
- actions/setup-node v3 composite
- actions/setup-python v4 composite
- actions/upload-artifact v3 composite
- mshick/add-pr-comment v1 composite
- nf-core/setup-nextflow v1 composite
- psf/black stable composite
- dawidd6/action-download-artifact v2 composite
- marocchino/sticky-pull-request-comment v2 composite
- actions/stale v7 composite
- actions/setup-python 82c7e631bb3cdc910f68e0081d67478d79c6982d composite
- eWaterCycle/setup-singularity 931d4e31109e875b13309ae1d07c70ca8fbc8537 composite
- jlumbroso/free-disk-space 54081f138730dfa15788a46383842cd2f914a1be composite
- nf-core/setup-nextflow v2 composite
- actions/setup-python 82c7e631bb3cdc910f68e0081d67478d79c6982d composite
- rzr/fediverse-action master composite
- zentered/bluesky-post-action 80dbe0a7697de18c15ad22f4619919ceb5ccf597 composite
- coreutils 8.25.*
- coreutils 8.25.*
- perl 5.26.2.*