cooltools

The tools for your .cool's

https://github.com/open2c/cooltools

Science Score: 59.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 6 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Committers with academic emails
    6 of 25 committers (24.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (16.9%) to scientific vocabulary
Last synced: 6 months ago · JSON representation

Repository

The tools for your .cool's

Basic Info
  • Host: GitHub
  • Owner: open2c
  • License: mit
  • Language: Jupyter Notebook
  • Default Branch: master
  • Homepage:
  • Size: 17.2 MB
Statistics
  • Stars: 154
  • Watchers: 11
  • Forks: 52
  • Open Issues: 64
  • Releases: 19
Created about 9 years ago · Last pushed 8 months ago
Metadata Files
Readme Changelog Contributing License Zenodo

README.md

cooltools: enabling high-resolution Hi-C analysis in Python

Pytest Documentation Status Latest Release PyPI Latest Release Bioconda DOI

tools for your .cools

Chromosome conformation capture technologies reveal the incredible complexity of genome folding. A growing number of labs and multiple consortia, including the 4D Nucleome, the International Nucleome Consortium, and ENCODE, are generating higher-resolution datasets to probe genome architecture across cell states, types, and organisms. Larger datasets increase the challenges at each step of computational analysis, from storage, to memory, to researchers’ time. The recently-introduced cooler format readily handles storage of high-resolution datasets via a sparse data model.

cooltools leverages this format to enable flexible and reproducible analysis of high-resolution data. cooltools provides a suite of computational tools with a paired python API and command line access, which facilitates workflows either on high-performance computing clusters or via custom analysis notebooks. As part of the Open2C ecosystem, cooltools also provides detailed introductions to key concepts in Hi-C-data analysis with interactive notebook documentation. For more information, see the preprint: https://doi.org/10.1101/2022.10.31.514564.

Requirements

The following are required before installing cooltools:

  • Python 3.7+
  • numpy
  • cython

Installation

sh pip install cooltools

or install the latest version directly from github:

$ pip install https://github.com/open2c/cooltools/archive/refs/heads/master.zip

See the requirements.txt file for information on compatible dependencies, especially for cooler and bioframe.

Documentation and Tutorials

Documentation can be found here: https://cooltools.readthedocs.io/en/latest/.

Cooltools offers a number of tutorials using the Open2c code ecosystem. For users who are new to Hi-C analysis, we recommend going through example notebooks in the following order:

For users interested in running analyses from the commmand line: - Command line interface: how to use the cooltools CLI.

Note that these notebooks currently focus on mammalian interphase Hi-C analysis, but are readily extendible to other organisms and cellular contexts. To clone notebooks for interactive analysis, visit https://github.com/open2c/open2c_examples. Docs for cooltools are built directly from these notebooks.

Contributing

Cooltools welcomes contributions. The guiding principles for tools are that they are (i) as simple as possible, (ii) as interpretable as possible, (iii) should not involve visualization. The following applies for contributing new functionality to cooltools.

New functionality should: - clearly define the problem - discuss alternative solutions - provide a separate example (provided as a gist/notebook/etc) explaining its use cases on multiple datasets. - be compatible with the latest versions of cooler and cooltools (e.g. should be able to be run on any cooler generated by the latest version of cooler)

New functionality should either: - generalize or extend existing tool without impairing user experience, and be submitted as PR to the relevant tool - or extract a distinct feature of genome organization, and be submitted as pull request to the sandbox

Vignettes, using existing tools in new ways, should be submitted as pull requests to open2c_vignettes as a distinct jupyter notebook, rather than to cooltools sandbox. The bar for contributions to this repository is minimal. We recommend each vignette to include package version information, and raise an error for other versions. If it makes sense, the example data available for download using cooltools can be used to allow an easy way to try out the analysis. Otherwise, the source of data can be specified for others to obtain it.

Practical aspects for contributing can be found in the guide here.

Citing cooltools

Open2C, Nezar Abdennur, Sameer Abraham, Geoffrey Fudenberg, Ilya M. Flyamer, Aleksandra A. Galitsyna, Anton Goloborodko, Maxim Imakaev, Betul A. Oksuz, and Sergey V. Venev*. “Cooltools: Enabling High-Resolution Hi-C Analysis in Python.” bioRxiv, November 1, 2022. https://doi.org/10.1101/2022.10.31.514564.

Owner

  • Name: Open Chromosome Collective
  • Login: open2c
  • Kind: organization
  • Email: open.chromosome.collective@gmail.com

GitHub Events

Total
  • Issues event: 24
  • Watch event: 19
  • Issue comment event: 75
  • Push event: 2
  • Pull request review comment event: 1
  • Pull request review event: 2
  • Pull request event: 2
  • Fork event: 1
Last Year
  • Issues event: 24
  • Watch event: 19
  • Issue comment event: 75
  • Push event: 2
  • Pull request review comment event: 1
  • Pull request review event: 2
  • Pull request event: 2
  • Fork event: 1

Committers

Last synced: 10 months ago

All Time
  • Total Commits: 866
  • Total Committers: 25
  • Avg Commits per committer: 34.64
  • Development Distribution Score (DDS): 0.688
Past Year
  • Commits: 9
  • Committers: 4
  • Avg Commits per committer: 2.25
  • Development Distribution Score (DDS): 0.556
Top Committers
Name Email Commits
sergpolly s****y@g****m 270
Nezar n****r@g****m 171
Anton Goloborodko g****n@g****m 127
Phlya f****r@g****m 120
gfudenberg g****g@g****m 72
Aleksandra Galitsyna a****a@g****m 45
Johannes Nuebler j****r@m****u 15
mimakaev m****v@g****m 10
Yao Xiao 3****x 6
Thomas Reimonn t****n@u****u 5
George Spracklin g****n@g****m 4
Geoff Fudenberg g****g@L****l 4
Elias Friman 4****n 3
betulakgol b****8@g****m 2
Sameer Abraham s****0@g****m 2
Eitan Hemed 3****d 1
Greg Schuette 8****e 1
Peter Kerpedjiev p****v@g****m 1
Tobias Sargeant t****s@g****m 1
Trevor Manz t****z@g****m 1
Deepti Kannan k****i@y****m 1
sergey venev v****s@v****u 1
Maksim Imakaev m****s@z****u 1
Maksim Imakaev m****s@w****u 1
ehildebr e****d@u****u 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 129
  • Total pull requests: 82
  • Average time to close issues: 3 months
  • Average time to close pull requests: 30 days
  • Total issue authors: 66
  • Total pull request authors: 17
  • Average comments per issue: 3.48
  • Average comments per pull request: 2.04
  • Merged pull requests: 62
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 18
  • Pull requests: 3
  • Average time to close issues: 2 months
  • Average time to close pull requests: 6 days
  • Issue authors: 14
  • Pull request authors: 2
  • Average comments per issue: 2.33
  • Average comments per pull request: 3.33
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • gfudenberg (19)
  • sergpolly (9)
  • Phlya (6)
  • BenxiaHu (5)
  • ratheraarif (4)
  • agalitsyna (4)
  • dmalzl (4)
  • jiangshan529 (4)
  • Snehal1894 (3)
  • efriman (3)
  • ddepierre (3)
  • danobaji (3)
  • bskubi (2)
  • Jiangyuan-Liu (2)
  • thomas-reimonn (2)
Pull Request Authors
  • sergpolly (28)
  • Phlya (16)
  • Yaoyx (16)
  • gfudenberg (8)
  • nvictus (7)
  • golobor (6)
  • agalitsyna (5)
  • efriman (5)
  • thomas-reimonn (3)
  • dmalzl (2)
  • gschuette (2)
  • gdolsten (1)
  • manzt (1)
  • kannandeepti (1)
  • alexyakyshev (1)
Top Labels
Issue Labels
enhancement (17) bug (9) question (6) help wanted (1)
Pull Request Labels

Dependencies

.github/workflows/pypi-publish.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v2 composite
.github/workflows/pytest.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v1 composite
.github/workflows/python-publish-test.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v2 composite
docs/requirements.txt pypi
  • Sphinx *
  • bioframe *
  • click >=7.0
  • docutils <=0.16
  • ipython *
  • multiprocess *
  • nbsphinx *
  • numpy *
  • pygments <3,>=2.4.1
  • recommonmark *
  • sphinx-click *
  • sphinx-rtd-theme *
requirements-dev.txt pypi
  • pytest * development
  • pytest-cov * development
  • pytest-flake8 * development
requirements.txt pypi
  • bioframe >=0.3.3
  • click >=7
  • cooler >=0.9.1
  • cython *
  • joblib *
  • matplotlib *
  • multiprocess *
  • numba *
  • numpy *
  • pandas >=1.5.1
  • scikit-image *
  • scikit-learn >=1.1.2
  • scipy *
pyproject.toml pypi
setup.py pypi