https://github.com/cnr-ibba/nf-chip-alignment
Align chip probes to a reference genome
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: ncbi.nlm.nih.gov -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.5%) to scientific vocabulary
Keywords
Repository
Align chip probes to a reference genome
Basic Info
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 2
- Releases: 0
Topics
Metadata Files
README.md
Nextflow chip alignment
This nextflow pipeline is an attempt to align probes from a chip-manifest file on a reference genomes using blast. The reason of doing this is to define the strand orientation of the probe on the reference sequence, in order to be able to convert a genotype from illumina top to forward (or ALT/REF) coordinates. This pipeline was developed as a component of the SMARTER project (WP4 in particular) and will be the first step towards a new SNPchiMp release.
Requisites
To run this nextflow pipeline, you require both a genome reference sequence (better
to download it from NCBI ftp) and a manifest csv file from an illumina/affymetrix
chip. Next, you you will need nextflow installed and at least one of this
different executors: conda, singularity and docker. You can choose to clone
this repository if you plan to change this pipeline according your needs:
bash
git clone https://github.com/cnr-ibba/nf-chip-alignment.git
The other way to running this pipeline is described in pipeline sharing nextflow manual.
Testing pipeline
You can test your local configuration by calling the test profile with one of
the three supported executors. For example, to test this pipeline with singularity:
bash
nextflow run cnr-ibba/nf-chip-alignment -resume -profile test,singularity
Running pipeline
You can call this pipeline with you local data using the --genome and --manifest
options, for example:
bash
nextflow run cnr-ibba/nf-chip-alignment -resume -profile singularity --manifest <manifest.csv> --genome <reference genome>
gzipped input files are supported.
About alignments
Alignments are made using megablast task, which is more stringent and faster
then the default blastn.
Alignments are filtered relying percentage aligned and percentage
identity. A probe is considered unmapped if all alignments are filtered,
if there are more than one alignment after filtering or if the SNP doesn't
match to the reference genome. The last condition however is not correlated in
a issue in alignment, however it could be difficult to join this particular SNP
with other data or refer it to a reference SNP. For such reason SNP is discarded
even if there's a perfect match of the probe
Owner
- Name: CNR-IBBA
- Login: cnr-ibba
- Kind: organization
- Location: Milan
- Website: https://www.ibba.cnr.it/
- Repositories: 25
- Profile: https://github.com/cnr-ibba
Bioinformatic Group @ CNR-IBBA
GitHub Events
Total
- Issues event: 1
Last Year
- Issues event: 1