hyppo

Python package for multivariate hypothesis testing

https://github.com/neurodata/hyppo

Science Score: 64.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: arxiv.org
  • Committers with academic emails
    4 of 24 committers (16.7%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.8%) to scientific vocabulary

Keywords

data-science hacktoberfest hypothesis-testing independence ksample-testing python

Keywords from Contributors

networks graph-statistics continual-learning decision-trees deep-neural-networks domain-adaptation random-forests transfer-learning network-simulation hacking
Last synced: 4 months ago · JSON representation ·

Repository

Python package for multivariate hypothesis testing

Basic Info
  • Host: GitHub
  • Owner: neurodata
  • License: mit-0
  • Language: Python
  • Default Branch: main
  • Homepage: https://hyppo.neurodata.io/
  • Size: 33.5 MB
Statistics
  • Stars: 231
  • Watchers: 5
  • Forks: 92
  • Open Issues: 46
  • Releases: 14
Topics
data-science hacktoberfest hypothesis-testing independence ksample-testing python
Created over 6 years ago · Last pushed 6 months ago
Metadata Files
Readme Contributing License Code of conduct Citation

README.md

hyppo

CircleCI Codecov Netlify PyPI - Python Version PyPI arXivshield

hyppo (HYPothesis Testing in PythOn, pronounced "Hippo") is an open-source software package for multivariate hypothesis testing. We decided to develop hyppo for the following reasons:

  • With the increase in the amount of data in many fields, hypothesis testing for high-dimensional and nonlinear data is important.
  • Libraries in R exist, but their interfaces are inconsistent, and most are not available in Python.

hyppo intends to be a comprehensive multivariate hypothesis testing package that runs on all major versions of operating systems. It also includes novel tests not found in other packages. It is quick to install and free of charge. If you need to use multivariate hypothesis testing, be sure to give hyppo a try!

Website: https://hyppo.neurodata.io/

Installation

Dependencies

hyppo requires the following:

User installation

The easiest way to install hyppo is using pip.

sh pip install hyppo

To upgrade to a newer release, use the --upgrade flag.

sh pip install --upgrade hyppo

The documentation includes more detailed installation instructions. hyppo is free software; you can redistribute it and/or modify it under the terms of the license.

Release Notes

See the release notes for a history of notable changes to hyppo.

Development

We welcome new contributors of all experience levels. The hyppo community's goals are to be helpful, welcoming, and effective. The contributor guide has detailed information about contributing code, documentation, and tests.

Note: We have recently moved our benchmarks (with relevant figure replication code for our papers) folder to a new repo! We aim to add test development code and paper figure replication codes there, and we will add relevant tests (with tutorials) to hyppo.

Contributors

Thanks goes to these wonderful people:

Made with contrib.rocks.

Project History

hyppo is a rebranding of mgcpy, which was founded in November 2018. mgcpy was designed and written by @tpsatish95, @sampan501, @junhaobearxiong, @sundaysundya, @ananyas713, and @ronakdm. hyppo was designed and written by @sampan501 and first released in February 2020.

Owner

  • Name: neurodata
  • Login: neurodata
  • Kind: organization
  • Email: admin@neurodata.io
  • Location: everywhere

Citation (CITATION.cff)

# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!

cff-version: 1.2.0
title: hyppo
message: 'If you use this software, please cite it as below'
type: software
authors:
  - given-names: Sambit
    family-names: Panda
    email: spanda3@Jhu.edu
    affiliation: Johns Hopkins University
    orcid: 'https://orcid.org/0000-0001-8455-4243'
  - orcid: 'https://orcid.org/0000-0001-6115-719X'
    given-names: Satish
    family-names: Palaniappan
    affiliation: Johns Hopkins University
  - given-names: Eric W.
    family-names: Bridgeford
    affiliation: Johns Hopkins School of Public Health
  - given-names: Ronak
    family-names: Mehta
    affiliation: Johns Hopkins University
  - affiliation: University of Delaware
    given-names: Cencheng
    family-names: Shen
  - given-names: Joshua T.
    family-names: Vogelstein
    affiliation: Johns Hopkins University
    orcid: 'https://orcid.org/0000-0003-2487-6237'
identifiers:
  - type: doi
    value: 10.48550/arXiv.1907.02088
    description: arXiv
repository-code: 'https://github.com/neurodata/hyppo/tree/main'
url: 'https://hyppo.neurodata.io/'
repository: 'https://github.com/neurodata/hyppo-papers'
repository-artifact: 'https://github.com/neurodata/mgcpy-old'
abstract: >-
  We introduce hyppo, a unified library for performing
  multivariate hypothesis testing, including independence,
  two-sample, and k-sample testing. While many multivariate
  independence tests have R packages available, the
  interfaces are inconsistent and most are not available in
  Python. hyppo includes many state of the art multivariate
  testing procedures. The package is easy-to-use and is
  flexible enough to enable future extensions.
keywords:
  - Python
  - multivariate
  - independence
  - k-sample
  - hypothesis
preferred-citation:
  type: article
  authors:
  - family-names: "Panda"
    given-names: "Sambit"
    orcid: "https://orcid.org/0000-0001-8455-4243"
  - family-names: "Palaniappan"
    given-names: "Satish"
  - family-names: "Bridgeford"
    given-names: "Eric W."
  - family-names: "Mehta"
    given-names: "Ronak"
  - family-names: "Shen"
    given-names: "Cencheng"
  - family-names: "Vogelstein"
    given-names: "Joshua T."
    orcid: "https://orcid.org/0000-0003-2487-6237"
  doi: "10.48550/arXiv.1907.02088"
  journal: "arXiv"
  month: 9
  # start: 1 # First page number
  # end: 10 # Last page number
  title: "hyppo: A Multivariate Hypothesis Testing Python Package"
  # issue: 1
  # volume: 1
  year: 2021

GitHub Events

Total
  • Create event: 3
  • Release event: 1
  • Issues event: 7
  • Watch event: 14
  • Delete event: 2
  • Member event: 2
  • Issue comment event: 40
  • Push event: 9
  • Pull request review comment event: 2
  • Pull request review event: 3
  • Pull request event: 18
  • Fork event: 9
Last Year
  • Create event: 3
  • Release event: 1
  • Issues event: 7
  • Watch event: 14
  • Delete event: 2
  • Member event: 2
  • Issue comment event: 40
  • Push event: 9
  • Pull request review comment event: 2
  • Pull request review event: 3
  • Pull request event: 18
  • Fork event: 9

Committers

Last synced: 10 months ago

All Time
  • Total Commits: 330
  • Total Committers: 24
  • Avg Commits per committer: 13.75
  • Development Distribution Score (DDS): 0.164
Past Year
  • Commits: 15
  • Committers: 5
  • Avg Commits per committer: 3.0
  • Development Distribution Score (DDS): 0.333
Top Committers
Name Email Commits
Sambit Panda s****1@g****m 276
allcontributors[bot] 4****] 14
sampan501 s****3@j****u 11
Ronan Perry r****y@g****m 3
SUKI I****2@g****m 3
Vivek Gopalakrishnan v****4@j****u 2
Alex1-ai 7****i 2
Jaewon Chung j****8@g****m 2
Ronak Mehta r****3@j****u 2
Satarupa Deb 8****D 1
Riccardo Orsi 1****7 1
PoojaMandal 6****l 1
Jennifer Tam 8****m 1
Jayanta Dey 5****4 1
Haoyin Xu h****u@g****m 1
Bijan Varjavand b****d@g****m 1
Benjamin Pedigo b****o@g****m 1
Alexander Oakley 2****a 1
Shruti Naik 0****3@n****n 1
Vatsalya Vyas 1****s 1
alyakin314 a****4@g****m 1
dependabot[bot] 4****] 1
ohpraveenprasad 1****d 1
saivythik 6****k 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 4 months ago

All Time
  • Total issues: 61
  • Total pull requests: 117
  • Average time to close issues: 6 months
  • Average time to close pull requests: about 2 months
  • Total issue authors: 31
  • Total pull request authors: 40
  • Average comments per issue: 3.25
  • Average comments per pull request: 2.58
  • Merged pull requests: 86
  • Bot issues: 0
  • Bot pull requests: 34
Past Year
  • Issues: 6
  • Pull requests: 20
  • Average time to close issues: 30 days
  • Average time to close pull requests: 28 days
  • Issue authors: 4
  • Pull request authors: 8
  • Average comments per issue: 2.67
  • Average comments per pull request: 2.5
  • Merged pull requests: 13
  • Bot issues: 0
  • Bot pull requests: 2
Top Authors
Issue Authors
  • sampan501 (14)
  • bdpedigo (6)
  • rflperry (4)
  • oakla (3)
  • adam2392 (3)
  • PSSF23 (2)
  • ebridge2 (2)
  • srecupe2 (2)
  • ameliecr (2)
  • darsh-patel (2)
  • victoris93 (1)
  • j1c (1)
  • Rayerdyne (1)
  • jovo (1)
  • transpersonify (1)
Pull Request Authors
  • sampan501 (30)
  • allcontributors[bot] (28)
  • oakla (7)
  • ricor07 (6)
  • dependabot[bot] (5)
  • j1c (4)
  • Alex1-ai (3)
  • tjoab (2)
  • ebridge2 (2)
  • ohpraveenprasad (2)
  • darsh-patel (2)
  • harsh204016 (2)
  • LaggyMan (2)
  • zdbzdb123123 (2)
  • vikram-8290 (2)
Top Labels
Issue Labels
enhancement (23) bug (21) ndd (15) documentation (11) good first issue (5) question (4) hyppo.ksample (4) help wanted (1) hyppo.discrim (1)
Pull Request Labels
documentation (3) hyppo.independence (3) bug (2) enhancement (2) dependencies (2) python (2)

Packages

  • Total packages: 2
  • Total downloads:
    • pypi 152,370 last-month
  • Total docker downloads: 80
  • Total dependent packages: 3
    (may contain duplicates)
  • Total dependent repositories: 11
    (may contain duplicates)
  • Total versions: 15
  • Total maintainers: 1
pypi.org: hyppo

A comprehensive independence testing package

  • Versions: 14
  • Dependent Packages: 3
  • Dependent Repositories: 11
  • Downloads: 152,370 Last month
  • Docker Downloads: 80
Rankings
Docker downloads count: 3.0%
Downloads: 3.8%
Average: 4.2%
Dependent repos count: 4.4%
Stargazers count: 4.7%
Forks count: 4.7%
Dependent packages count: 4.8%
Maintainers (1)
Last synced: 4 months ago
conda-forge.org: hyppo
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Forks count: 27.4%
Stargazers count: 27.6%
Dependent repos count: 34.0%
Average: 35.0%
Dependent packages count: 51.2%
Last synced: 4 months ago

Dependencies

dev-requirements.txt pypi
  • autograd * development
  • black * development
  • codecov * development
  • coverage * development
  • numba ==0.49.1 development
  • pytest * development
  • pytest-cov * development
  • twine * development
docs/requirements.txt pypi
  • docutils ==0.17.1
  • ipykernel ==5.4.3
  • ipython ==7.31.1
  • matplotlib *
  • myst_parser *
  • nbsphinx ==0.8.0
  • numpydoc ==0.7.0
  • pydata-sphinx-theme ==0.7.1
  • seaborn *
  • sphinx ==4.0.2
  • sphinx-gallery ==0.8.2
  • sphinx_sitemap *
  • sphinxcontrib-bibtex >=2.2.0
  • sphinxcontrib-rawfiles ==0.1.1
requirements.txt pypi
  • autograd >=1.3
  • joblib >=0.17.0
  • numba >=0.46
  • numpy >=1.17,<=1.23.5
  • scikit-learn >=0.22
  • scipy >=1.4.0
setup.py pypi