https://github.com/fernphy/ftol
Pipeline to generate the Fern Tree of Life
Science Score: 36.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
✓DOI references
Found 4 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.3%) to scientific vocabulary
Repository
Pipeline to generate the Fern Tree of Life
Basic Info
- Host: GitHub
- Owner: fernphy
- License: mit
- Language: R
- Default Branch: main
- Homepage: https://fernphy.github.io/
- Size: 9.39 MB
Statistics
- Stars: 4
- Watchers: 1
- Forks: 1
- Open Issues: 16
- Releases: 3
Metadata Files
README.md
FTOL
Code repository to generate an automatically updated Fern Tree of Life.
Please see the accompanying paper: - Nitta JH, Schuettpelz E, Ramírez-Barahona S, Iwasaki W. An open and continuously updated fern tree of life (FTOL) https://doi.org/10.1101/2022.03.31.486640
All code is in R, and workflow is controlled with the targets package.
Docker
A docker image is available to run the code.
Docker tags match the version of FTOL; e.g., the image with tag 1.0.1 is the image used to generate FTOL v1.0.1.
Setup
Run the setup.R script to create the folder structure needed to store external files and download most of the data files automatically.
source("R/setup.R")
Alternatively, you can manually create the following folder hierarchy yourself:
_targets
└── user
├── data_raw
│ ├── ref_aln
│ └── restez
│ └── sql_db
├── intermediates
│ ├── blast_sanger
│ ├── iqtree
│ │ ├── plastome
│ │ ├── sanger
│ │ ├── sanger_1
│ │ ├── sanger_10
│ │ ├── sanger_2
│ │ ├── sanger_3
│ │ ├── sanger_4
│ │ ├── sanger_5
│ │ ├── sanger_6
│ │ ├── sanger_7
│ │ ├── sanger_8
│ │ ├── sanger_9
│ │ └── sanger_fast
│ ├── ref_seqs
│ └── treepl
│ ├── con
│ ├── ml
│ └── ts
└── results
Another folder called ftol_data (to store data files generated by this workflow that will be made available via the ftolr R package) also needs to be created in the project root. This folder is itself a repo that can be cloned from https://github.com/fernphy/ftol_data.
Data
If setup.R was run successfully, it will have already downloaded and unzipped the input data files from FigShare.
Alternatively, you can do so manually following these instructions:
- Download
ref_aln.tar.gz(reference alignments) andrestez_sql_db.tar.gz(local GenBank database) from FigShare (https://doi.org/10.6084/m9.figshare.19474316) - Unzip
ref_aln.tar.gzand put theref_alnfolder in_targets/user/data_raw/ - Unzip
restez_sql_db.tar.gzand put thebatandsql_logsfolders in_targets/user/data_raw/restez/sql_db.
Running the code
Data preparation scripts
The script to generate the local GenBank database (
restez_sql_db.tar.gz) is setup_gb.R.The script (targets workflow) to generate the reference FASTA files for extracting target gene regions (
ref_aln.tar.gz) is preprefseqs_plan.R
The data that result from these scripts have been made available on FigShare as described above, so these generally shouldn't need to be run.
Main workflow
- _targets.R defines the main workflow to generate FTOL. This can be run with
targets::tar_make().
Note that this code was designed to be run on a multi-core machine, so the number of cores specified (e.g., here) may need to be changed.
The complete workflow takes 1-2 weeks to complete, with phylogenetic analysis taking up by far most of the time.
Running with Docker
Launch a container in the background:
docker run \
--rm \
-dt \
-v ${PWD}:/wd \
-w /wd \
-e USERID=$(id -u) \
-e GROUPID=$(id -g) \
-v $HOME/.gitconfig:/home/user/.gitconfig:ro \
-v $HOME/.ssh:/home/user/.ssh:ro \
joelnitta/ftol:latest bash
License
- code: MIT
Owner
- Name: fernphy
- Login: fernphy
- Kind: organization
- Website: https://fernphy.github.io/
- Repositories: 8
- Profile: https://github.com/fernphy
FTOL Working Group
GitHub Events
Total
- Push event: 5
Last Year
- Push event: 5
Dependencies
- rocker/rstudio 4.2.0 build
- joelnitta/ftol latest
- docker/build-push-action v4 composite
- docker/login-action v2 composite
- docker/setup-buildx-action v2 composite
- docker/setup-qemu-action v2 composite