https://github.com/fernphy/ftol

Pipeline to generate the Fern Tree of Life

https://github.com/fernphy/ftol

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 4 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.3%) to scientific vocabulary
Last synced: 6 months ago · JSON representation

Repository

Pipeline to generate the Fern Tree of Life

Basic Info
Statistics
  • Stars: 4
  • Watchers: 1
  • Forks: 1
  • Open Issues: 16
  • Releases: 3
Created about 4 years ago · Last pushed about 1 year ago
Metadata Files
Readme License

README.md

FTOL

DOI

Code repository to generate an automatically updated Fern Tree of Life.

Please see the accompanying paper: - Nitta JH, Schuettpelz E, Ramírez-Barahona S, Iwasaki W. An open and continuously updated fern tree of life (FTOL) https://doi.org/10.1101/2022.03.31.486640

All code is in R, and workflow is controlled with the targets package.

Docker

A docker image is available to run the code.

Docker tags match the version of FTOL; e.g., the image with tag 1.0.1 is the image used to generate FTOL v1.0.1.

Setup

Run the setup.R script to create the folder structure needed to store external files and download most of the data files automatically.

source("R/setup.R")

Alternatively, you can manually create the following folder hierarchy yourself:

_targets └── user ├── data_raw │ ├── ref_aln │ └── restez │ └── sql_db ├── intermediates │ ├── blast_sanger │ ├── iqtree │ │ ├── plastome │ │ ├── sanger │ │ ├── sanger_1 │ │ ├── sanger_10 │ │ ├── sanger_2 │ │ ├── sanger_3 │ │ ├── sanger_4 │ │ ├── sanger_5 │ │ ├── sanger_6 │ │ ├── sanger_7 │ │ ├── sanger_8 │ │ ├── sanger_9 │ │ └── sanger_fast │ ├── ref_seqs │ └── treepl │ ├── con │ ├── ml │ └── ts └── results

Another folder called ftol_data (to store data files generated by this workflow that will be made available via the ftolr R package) also needs to be created in the project root. This folder is itself a repo that can be cloned from https://github.com/fernphy/ftol_data.

Data

If setup.R was run successfully, it will have already downloaded and unzipped the input data files from FigShare.

Alternatively, you can do so manually following these instructions:

  1. Download ref_aln.tar.gz (reference alignments) and restez_sql_db.tar.gz (local GenBank database) from FigShare (https://doi.org/10.6084/m9.figshare.19474316)
  2. Unzip ref_aln.tar.gz and put the ref_aln folder in _targets/user/data_raw/
  3. Unzip restez_sql_db.tar.gz and put the bat and sql_logs folders in _targets/user/data_raw/restez/sql_db.

Running the code

Data preparation scripts

  • The script to generate the local GenBank database (restez_sql_db.tar.gz) is setup_gb.R.

  • The script (targets workflow) to generate the reference FASTA files for extracting target gene regions (ref_aln.tar.gz) is preprefseqs_plan.R

The data that result from these scripts have been made available on FigShare as described above, so these generally shouldn't need to be run.

Main workflow

  • _targets.R defines the main workflow to generate FTOL. This can be run with targets::tar_make().

Note that this code was designed to be run on a multi-core machine, so the number of cores specified (e.g., here) may need to be changed.

The complete workflow takes 1-2 weeks to complete, with phylogenetic analysis taking up by far most of the time.

Running with Docker

Launch a container in the background:

docker run \ --rm \ -dt \ -v ${PWD}:/wd \ -w /wd \ -e USERID=$(id -u) \ -e GROUPID=$(id -g) \ -v $HOME/.gitconfig:/home/user/.gitconfig:ro \ -v $HOME/.ssh:/home/user/.ssh:ro \ joelnitta/ftol:latest bash

License

Owner

  • Name: fernphy
  • Login: fernphy
  • Kind: organization

FTOL Working Group

GitHub Events

Total
  • Push event: 5
Last Year
  • Push event: 5

Dependencies

Dockerfile docker
  • rocker/rstudio 4.2.0 build
docker-compose.yml docker
  • joelnitta/ftol latest
.github/workflows/docker-publish.yaml actions
  • docker/build-push-action v4 composite
  • docker/login-action v2 composite
  • docker/setup-buildx-action v2 composite
  • docker/setup-qemu-action v2 composite