https://github.com/carolindahms/assembly-with-omni-c

Assemblinng a reference genome with PacBio and Omni-C reads

https://github.com/carolindahms/assembly-with-omni-c

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (5.2%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Assemblinng a reference genome with PacBio and Omni-C reads

Basic Info
  • Host: GitHub
  • Owner: carolindahms
  • Default Branch: main
  • Size: 12.7 KB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 1 year ago · Last pushed over 1 year ago
Metadata Files
Readme

README.md

Genome assembly with PacBio and Omni-C reads

  1. Pre-process reads
    • Convert Pacbio Revio bam files to fastq with pbtk
    • Asses read-length distribution in R plots
    • Remove adaptors with bbtools and HiFiAdapterFilt
  2. Count kmers with meryl
  3. Assess genome with Genomescope2.0
  4. Assemble with hifiasm in solo mode
    • run with different -t -s and purging (-l) settings eg -t 15 or 20; -s 0.75, 0.55 or 0.35, -l 1 for light purging
    • run with optimum K (-k)
  5. Evaluate hifiasm results
    • compare multiple assemblies with different filters using bbtools script statswrapper.sh
    • BUSCO with different databases for both primary and alternate haplotigs using different databases (e.g. vertebrata and actinopterygii)
    • assembly stats with gfastats
    • Merqury plots and stats based on kmers
  6. Purge haplotigs using Genomescope stats and minimap2 + evaluate final contig assembly with gfastats

Scaffolding

  1. Preprocess
  2. Map reads separately with bwa mem following steps from Dovetail tutorial
  3. Pairtools to generate bam file + QC
    • Record valid ligation events with pairtools
    • Sort pairsam
    • Mark duplicates
  4. Scaffolding with YaHS + QC + remove contaminants
  5. Omni-C contact maps with JuicerTools

Owner

  • Login: carolindahms
  • Kind: user

GitHub Events

Total
  • Push event: 19
  • Create event: 2
Last Year
  • Push event: 19
  • Create event: 2