hetGPy

hetGPy: Heteroskedastic Gaussian Process Modeling in Python - Published in JOSS (2025)

https://github.com/davidogara/hetgpy

Science Score: 100.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 8 DOI reference(s) in README and JOSS metadata
  • Academic publication links
    Links to: joss.theoj.org
  • Committers with academic emails
    2 of 5 committers (40.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
    Published in Journal of Open Source Software

Keywords from Contributors

mesh

Scientific Fields

Engineering Computer Science - 32% confidence
Last synced: 6 months ago · JSON representation ·

Repository

Python implementation of the hetGP R package

Basic Info
  • Host: GitHub
  • Owner: davidogara
  • License: lgpl-2.1
  • Language: Jupyter Notebook
  • Default Branch: main
  • Size: 11.7 MB
Statistics
  • Stars: 5
  • Watchers: 4
  • Forks: 2
  • Open Issues: 0
  • Releases: 6
Created over 2 years ago · Last pushed 8 months ago
Metadata Files
Readme Changelog Contributing License Citation

README.md

hetGPy: Heteroskedastic Gaussian Process Modeling in Python

DOI PyPI Downloads PyPI version

hetGPy is a Python implementation of the hetGP R library. Please see our JOSS paper for an in-depth introduction to the package.

This package has the goals of: * Matching the behavior of the R package * Having minimal Python and external dependencies, which are: * numpy and scipy for computations * matplotlib for visualization * joblib for parallelization * tqdm for progress bars * Eigen (C++) for fast calculations (when vectorization is non-obvious)

The motivation for such a package is due to the rising popularity of implementing simulation models (also known as computer experiments) in Python.

Documentation

The package documentation is available at: https://hetgpy.readthedocs.io/en/latest/

Installing and Environments

pypi

  • hetGPy is available on pypi:

pip install hetgpy

Development Version:

python -m pip install git+https://github.com/davidogara/hetGPy.git

  • To build from the source files:
  1. Clone the repository. Make sure to include --recurve-submodules if you do not already have Eigen installed on your system:

git clone --recurse-submodules https://github.com/davidogara/hetGPy.git

  1. With hetGPy as your current working directory: pip install -e .

We recommend installing in a virtual environment. One way to do this with venv is: python3.10 -m venv .venv

After this you should be able to run the examples in the examples folder.

Note on Dependencies

  • hetGPy requires scipy>=1.14.0 which fixed a memory leakage issue when using L-BFGS-B in scipy.optimize.minizmize. That version of scipy requires Python 3.10.

  • Since hetGPy is designed for large-scale problems, this was chosen as a necessary feature. Experienced users may be able to roll back some of the dependencies, but this is not the recommended use.

  • hetGPy also requires a c++17 compiler and Eigen for the underlying covariance functions. Eigen 3.4.0 is included with the source files (and is a submodule of the git repository), but experienced users may wish to link against their own installation.

Contact

For questions regarding this package, please contact:
David O'Gara
Division of Computational and Data Sciences, Washington University in St. Louis
david.ogara@wustl.edu

References

Binois M, Gramacy RB (2021). “hetGP: Heteroskedastic Gaussian Process Modeling and Sequential Design in R.” Journal of Statistical Software, 98(13), 1-44. doi:10.18637/jss.v098.i13 https://doi.org/10.18637/jss.v098.i13

Owner

  • Name: David O'Gara
  • Login: davidogara
  • Kind: user
  • Location: St. Louis, MO

PhD Student at WashU DCDS. Interested in Systems Science methods for Public Health.

JOSS Publication

hetGPy: Heteroskedastic Gaussian Process Modeling in Python
Published
February 05, 2025
Volume 10, Issue 106, Page 7518
Authors
David O'Gara ORCID
Division of Computationl and Data Sciences, Washington University in St. Louis, USA
Mickaël Binois ORCID
Acumes team, Université Côte d'Azur, Inria, Sophia Antipolis, France
Roman Garnett ORCID
Division of Computationl and Data Sciences, Washington University in St. Louis, USA, Department of Computer Science, McKelvey School of Engineering, Washington University in St. Louis, USA
Ross A. Hammond ORCID
Division of Computationl and Data Sciences, Washington University in St. Louis, USA, School of Public Health, Washington University in St. Louis, USA, Center on Social Dynamics and Policy, Brookings Institution, Washington DC, USA, Santa Fe Institute, Santa Fe, USA
Editor
Matthew Feickert ORCID
Tags
Gaussian Processes computer experiments Bayesian Optimization

Citation (CITATION.cff)

cff-version: "1.2.0"
authors:
- family-names: O'Gara
  given-names: David
  orcid: "https://orcid.org/0000-0002-1957-400X"
- family-names: Binois
  given-names: Mickaël
  orcid: "https://orcid.org/0000-0002-7225-1680"
- family-names: Garnett
  given-names: Roman
  orcid: "https://orcid.org/0000-0002-0152-5453"
- family-names: Hammond
  given-names: Ross A
  orcid: "https://orcid.org/0009-0005-1046-9296"
contact:
- family-names: O'Gara
  given-names: David
  orcid: "https://orcid.org/0000-0002-1957-400X"
doi: 10.5281/zenodo.14782341
message: If you use this software, please cite our article in the
  Journal of Open Source Software.
preferred-citation:
  authors:
  - family-names: O'Gara
    given-names: David
    orcid: "https://orcid.org/0000-0002-1957-400X"
  - family-names: Binois
    given-names: Mickaël
    orcid: "https://orcid.org/0000-0002-7225-1680"
  - family-names: Garnett
    given-names: Roman
    orcid: "https://orcid.org/0000-0002-0152-5453"
  - family-names: Hammond
    given-names: Ross A
    orcid: "https://orcid.org/0009-0005-1046-9296"
  date-published: 2025-02-05
  doi: 10.21105/joss.07518
  issn: 2475-9066
  issue: 106
  journal: Journal of Open Source Software
  publisher:
    name: Open Journals
  start: 7518
  title: "hetGPy: Heteroskedastic Gaussian Process Modeling in Python"
  type: article
  url: "https://joss.theoj.org/papers/10.21105/joss.07518"
  volume: 10
title: "`hetGPy`: Heteroskedastic Gaussian Process Modeling in Python"

GitHub Events

Total
  • Create event: 18
  • Release event: 10
  • Issues event: 26
  • Watch event: 4
  • Delete event: 11
  • Issue comment event: 50
  • Push event: 147
  • Pull request event: 15
  • Fork event: 1
Last Year
  • Create event: 18
  • Release event: 10
  • Issues event: 26
  • Watch event: 4
  • Delete event: 11
  • Issue comment event: 50
  • Push event: 147
  • Pull request event: 15
  • Fork event: 1

Committers

Last synced: 7 months ago

All Time
  • Total Commits: 508
  • Total Committers: 5
  • Avg Commits per committer: 101.6
  • Development Distribution Score (DDS): 0.03
Past Year
  • Commits: 269
  • Committers: 4
  • Avg Commits per committer: 67.25
  • Development Distribution Score (DDS): 0.052
Top Committers
Name Email Commits
David O'Gara d****a@w****u 493
Mickaël Binois 1****s 9
Dan Waxman d****n@s****u 4
dependabot[bot] 4****] 1
Lia Schattner l****r@L****l 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 14
  • Total pull requests: 22
  • Average time to close issues: 7 days
  • Average time to close pull requests: about 17 hours
  • Total issue authors: 4
  • Total pull request authors: 4
  • Average comments per issue: 3.71
  • Average comments per pull request: 0.18
  • Merged pull requests: 20
  • Bot issues: 0
  • Bot pull requests: 2
Past Year
  • Issues: 14
  • Pull requests: 22
  • Average time to close issues: 7 days
  • Average time to close pull requests: about 17 hours
  • Issue authors: 4
  • Pull request authors: 4
  • Average comments per issue: 3.71
  • Average comments per pull request: 0.18
  • Merged pull requests: 20
  • Bot issues: 0
  • Bot pull requests: 2
Top Authors
Issue Authors
  • Edenhofer (5)
  • DanWaxman (5)
  • davidogara (2)
  • matthewfeickert (2)
Pull Request Authors
  • davidogara (12)
  • mbinois (9)
  • DanWaxman (6)
  • dependabot[bot] (2)
Top Labels
Issue Labels
Pull Request Labels
dependencies (2) python (2)