Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 4 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.4%) to scientific vocabulary
Keywords
Repository
Illumina short read assembly data pipeline for phage
Basic Info
- Host: GitHub
- Owner: JoshuaIszatt
- Language: Python
- Default Branch: master
- Homepage: https://pypi.org/project/phanta/
- Size: 133 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 1
- Releases: 2
Topics
Metadata Files
README.md
Phanta
Configurable short read assembly pipeline for phages:
Figure 1: Rough phage assembly pipeline, dotted line indicates where processing begins.
Install dependencies
Create the environment
sh conda env create --file phanta.ymlActivate the environment:
sh conda activate phantaInstall phanta pipeline:
sh pip install phantaOptional: Set up the checkv database
sh checkv download_database /path/to/checkv-db
sh
export CHECKVDB=/path/to/checkv-db
Usage
Open a python terminal and enter:
py
import phanta
dir(phanta)
Detect reads:
This will return a list of Reads class objects that can be passed individually or as a batch to assembly pipelines
py
import phanta
reads = phanta.detect_reads('path_to_input_directory/')
Single phage assembly (Uses the default configurations below if no config parameter is specified):
py
import phanta
reads = phanta.detect_reads('path_to_input_directory/')
phanta.assembly_pipeline(reads[0], 'output_directory/')
Batch phage assembly (Using your own edited config file):
py
import phanta
phanta.batch_assembly_pipeline('path_to_input_directory/', 'output_directory/', config_file='/path_to_config.json')
Reads QC only:
py
import phanta
phanta.batch_assembly_pipeline('path_to_input_directory/', 'output_directory/', qc_only=True)
Citation:
There is a CITATION file in this repository (.CFF), however Phanta may also be cited like so:
Iszatt, Joshua James. (2024). PHANTA short read assembly pipeline (Version v0.3) [Computer software]. Zenodo. https://doi.org/10.5281/zenodo.13981011
Dependencies:
- python>=3
- checkv==1.0.3
- biopython==1.83
- bbmap==39.06
- pandas==2.2.1
- matplotlib==3.8.4
- spades==3.15.5
- fastqc==0.12.1
- multiqc==1.22.1
- pilon==1.24
- packaging
- deprecated
Configuration
These are the defaults, amend as required and pass to the pipeline functions
json
{
"system": {
"RAM": 24000,
"threads": 8
},
"input": {
"interleaved_ext": "_interleaved.fastq",
"r1_ext": "_R1.fastq.gz",
"r2_ext": "_R2.fastq.gz"
},
"reads": {
"read_length": 150,
"trim_length": 10,
"minimum_length": 100,
"read_quality": 30,
"error_correction": true,
"target_coverage": 200
},
"assembly": {
"kmers": "55,77,99,127"
},
"extraction": {
"mincov": 60,
"minlen": 4000,
"mincomplete":90
}
}
Owner
- Name: Joshua Iszatt
- Login: JoshuaIszatt
- Kind: user
- Location: Perth
- Twitter: JoshuaIszatt
- Repositories: 1
- Profile: https://github.com/JoshuaIszatt
PhD candidate. Interested in microbial genomics and protein structures. https://www.researchgate.net/profile/Joshua-Iszatt
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Iszatt"
given-names: "Joshua James"
orcid: "https://orcid.org/0000-0002-6394-5058"
title: "PHANTA short read assembly pipeline"
version: "v0.3"
identifiers:
- type: doi
value: 10.5281/zenodo.13981011
url: "https://github.com/JoshuaIszatt/phanta"
date-released: 2024-06-25
GitHub Events
Total
- Release event: 1
- Delete event: 1
- Member event: 1
- Push event: 4
- Create event: 9
Last Year
- Release event: 1
- Delete event: 1
- Member event: 1
- Push event: 4
- Create event: 9