hichipper

A preprocessing and QC pipeline for HiChIP data

https://github.com/aryeelab/hichipper

Science Score: 33.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
    Links to: biorxiv.org, nature.com
  • Committers with academic emails
    2 of 5 committers (40.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (15.0%) to scientific vocabulary

Keywords

bioinformatics chromatin-interaction epigenetics hichip
Last synced: 6 months ago · JSON representation

Repository

A preprocessing and QC pipeline for HiChIP data

Basic Info
  • Host: GitHub
  • Owner: aryeelab
  • License: mit
  • Language: HTML
  • Default Branch: master
  • Homepage:
  • Size: 224 MB
Statistics
  • Stars: 40
  • Watchers: 6
  • Forks: 14
  • Open Issues: 49
  • Releases: 0
Topics
bioinformatics chromatin-interaction epigenetics hichip
Created over 9 years ago · Last pushed over 3 years ago
Metadata Files
Readme License

README.md




Build Status PyPI version Documentation Status License: MIT DOI Join the chat at https://gitter.im/aryeelab/hichipper

This package is maintained by Caleb Lareau in the Aryee Lab. Source code is made freely available here and a packaged install version is provided through PyPi.

About

The hichipper package implements our data processing and quality control pipeline for HiChIP data. This package takes output from a HiC-Pro run and a sample manifest file (.yaml) that coordinates optional high-quality peaks (identified through ChIP-Seq) and restriction fragment locations (see folder here) as input and produces output that can be used to 1) determine library quality, 2) identify and characterize DNA loops and 3) interactively visualize loops. Loops are assigned strength and confidence metrics that can be used to evaluate samples individually or for differential analysis in downstream tools like diffloop.

We have used the library QC metrics with as few as 1 million reads, enabling library quality to be assessed through shallow (and cheap) sequencing before performing a full depth sequencing run.

A graphical overview showing how hichipper integrates with other tools in the analysis of raw HiChIP data is shown in the overview figure below. Detailed descriptions of the different branches of input, output, and configuration for hichipper are discussed in the documentation.

big1 big2

Installation/Documentation/FAQ/.etc

Check out the hichipper documentation for detailed installation instructions, dependency configuration, and other information regarding the tool and its implementation.

Workflow Overview

A simple graphical guide to processing HiChIP data is shown below. The role of hichipper is to import aligned read files from (e.g. HiC-Pro) as well as location of restriction fragment files (available here) coordinated through a .yaml configuration file and produce user-friendly output.

In particular, hichipper allows users to pre-supply their own set of gold-standard peaks (e.g. from ChIP-Seq) or call peaks directly from HiChIP data using a novel background detection algorithm. In either case, interactions and chromatin loops can be called using a restriction fragment-aware approach that substantially increases read density in loops.

hichipper_overview

Quality control reports

In the qcReports folder, we collect the .html QC report files associated with text annotations from the experiments performed in the original HiChIP manuscript as well as other reports generated by anonymous collaborators that demonstrate libraries that did not prepare well, likely due to poor in situ ligation. To determine the quality of a new HiChIP library, we recommend comparing the vital statistics and interactive tables and figures between existing libraries.

Citation

If you use hichipper for your work, please cite:

Lareau, C.A. and Aryee, M.J. (2018) "hichipper: A preprocessing pipeline for assessing library quality and DNA loops from HiChIP data." bioRxiv doi: https://doi.org/10.1101/192302

Questions/comments/feedback

are always welcomed. The easiest way for us to have correspondence (if appropriate/interesting for the public) is through raising a new issue or our hichipper chat room on gitter. Otherwise, email Caleb with any private concerns.

hichipper logo made with autowide font with these blue and red color definitions.

Owner

  • Name: aryeelab
  • Login: aryeelab
  • Kind: organization

GitHub Events

Total
  • Issues event: 1
  • Watch event: 6
  • Pull request event: 1
  • Fork event: 2
Last Year
  • Issues event: 1
  • Watch event: 6
  • Pull request event: 1
  • Fork event: 2

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 202
  • Total Committers: 5
  • Avg Commits per committer: 40.4
  • Development Distribution Score (DDS): 0.074
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
caleblareau c****u@g****u 187
Martin Aryee m****e@g****m 11
Afif Elghraoui a****i@n****v 2
areyesq89 a****s@g****m 1
The Gitter Badger b****r@g****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 91
  • Total pull requests: 8
  • Average time to close issues: 3 months
  • Average time to close pull requests: 3 days
  • Total issue authors: 55
  • Total pull request authors: 4
  • Average comments per issue: 2.31
  • Average comments per pull request: 1.0
  • Merged pull requests: 5
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 2
  • Pull requests: 2
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 2
  • Pull request authors: 1
  • Average comments per issue: 0.0
  • Average comments per pull request: 0.0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • caleblareau (15)
  • vidaletal (6)
  • werhoog (5)
  • AnaAzambuja (4)
  • skytguuu (3)
  • dongxiaota (2)
  • afrendeiro (2)
  • martinaryee (2)
  • lijiacd985 (2)
  • armenabnousi (2)
  • aperreault (2)
  • gouthamatla (2)
  • jlpulice (2)
  • jing-wan (1)
  • jamesdalg (1)
Pull Request Authors
  • caleblareau (4)
  • 5Aki1 (2)
  • gitter-badger (1)
  • 0xaf1f (1)
Top Labels
Issue Labels
enhancement (6) bug (2)
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 33 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 1
  • Total versions: 24
  • Total maintainers: 1
pypi.org: hichipper

Processing HiChIP data into loops.

  • Versions: 24
  • Dependent Packages: 0
  • Dependent Repositories: 1
  • Downloads: 33 Last month
Rankings
Dependent packages count: 10.0%
Forks count: 10.2%
Stargazers count: 11.1%
Average: 13.9%
Downloads: 16.7%
Dependent repos count: 21.7%
Maintainers (1)
Last synced: 6 months ago