hetGPy
hetGPy: Heteroskedastic Gaussian Process Modeling in Python - Published in JOSS (2025)
Science Score: 100.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 8 DOI reference(s) in README and JOSS metadata -
✓Academic publication links
Links to: joss.theoj.org -
✓Committers with academic emails
2 of 5 committers (40.0%) from academic institutions -
○Institutional organization owner
-
✓JOSS paper metadata
Published in Journal of Open Source Software
Keywords from Contributors
Scientific Fields
Repository
Python implementation of the hetGP R package
Basic Info
- Host: GitHub
- Owner: davidogara
- License: lgpl-2.1
- Language: Jupyter Notebook
- Default Branch: main
- Size: 11.7 MB
Statistics
- Stars: 5
- Watchers: 4
- Forks: 2
- Open Issues: 0
- Releases: 6
Metadata Files
README.md
hetGPy: Heteroskedastic Gaussian Process Modeling in Python
hetGPy is a Python implementation of the hetGP R library. Please see our JOSS paper for an in-depth introduction to the package.
This package has the goals of:
* Matching the behavior of the R package
* Having minimal Python and external dependencies, which are:
* numpy and scipy for computations
* matplotlib for visualization
* joblib for parallelization
* tqdm for progress bars
* Eigen (C++) for fast calculations (when vectorization is non-obvious)
The motivation for such a package is due to the rising popularity of implementing simulation models (also known as computer experiments) in Python.
Documentation
The package documentation is available at: https://hetgpy.readthedocs.io/en/latest/
Installing and Environments
pypi
hetGPyis available on pypi:
pip install hetgpy
Development Version:
python -m pip install git+https://github.com/davidogara/hetGPy.git
- To build from the source files:
- Clone the repository. Make sure to include
--recurve-submodulesif you do not already haveEigeninstalled on your system:
git clone --recurse-submodules https://github.com/davidogara/hetGPy.git
- With
hetGPyas your current working directory:pip install -e .
We recommend installing in a virtual environment. One way to do this with venv is:
python3.10 -m venv .venv
After this you should be able to run the examples in the examples folder.
Note on Dependencies
hetGPyrequiresscipy>=1.14.0which fixed a memory leakage issue when usingL-BFGS-Binscipy.optimize.minizmize. That version of scipy requires Python 3.10.Since
hetGPyis designed for large-scale problems, this was chosen as a necessary feature. Experienced users may be able to roll back some of the dependencies, but this is not the recommended use.hetGPyalso requires a c++17 compiler andEigenfor the underlying covariance functions. Eigen 3.4.0 is included with the source files (and is a submodule of the git repository), but experienced users may wish to link against their own installation.
Contact
For questions regarding this package, please contact:
David O'Gara
Division of Computational and Data Sciences, Washington University in St. Louis
david.ogara@wustl.edu
References
Binois M, Gramacy RB (2021). “hetGP: Heteroskedastic Gaussian Process Modeling and Sequential Design in R.” Journal of Statistical Software, 98(13), 1-44. doi:10.18637/jss.v098.i13 https://doi.org/10.18637/jss.v098.i13
Owner
- Name: David O'Gara
- Login: davidogara
- Kind: user
- Location: St. Louis, MO
- Repositories: 1
- Profile: https://github.com/davidogara
PhD Student at WashU DCDS. Interested in Systems Science methods for Public Health.
JOSS Publication
hetGPy: Heteroskedastic Gaussian Process Modeling in Python
Authors
Tags
Gaussian Processes computer experiments Bayesian OptimizationCitation (CITATION.cff)
cff-version: "1.2.0"
authors:
- family-names: O'Gara
given-names: David
orcid: "https://orcid.org/0000-0002-1957-400X"
- family-names: Binois
given-names: Mickaël
orcid: "https://orcid.org/0000-0002-7225-1680"
- family-names: Garnett
given-names: Roman
orcid: "https://orcid.org/0000-0002-0152-5453"
- family-names: Hammond
given-names: Ross A
orcid: "https://orcid.org/0009-0005-1046-9296"
contact:
- family-names: O'Gara
given-names: David
orcid: "https://orcid.org/0000-0002-1957-400X"
doi: 10.5281/zenodo.14782341
message: If you use this software, please cite our article in the
Journal of Open Source Software.
preferred-citation:
authors:
- family-names: O'Gara
given-names: David
orcid: "https://orcid.org/0000-0002-1957-400X"
- family-names: Binois
given-names: Mickaël
orcid: "https://orcid.org/0000-0002-7225-1680"
- family-names: Garnett
given-names: Roman
orcid: "https://orcid.org/0000-0002-0152-5453"
- family-names: Hammond
given-names: Ross A
orcid: "https://orcid.org/0009-0005-1046-9296"
date-published: 2025-02-05
doi: 10.21105/joss.07518
issn: 2475-9066
issue: 106
journal: Journal of Open Source Software
publisher:
name: Open Journals
start: 7518
title: "hetGPy: Heteroskedastic Gaussian Process Modeling in Python"
type: article
url: "https://joss.theoj.org/papers/10.21105/joss.07518"
volume: 10
title: "`hetGPy`: Heteroskedastic Gaussian Process Modeling in Python"
GitHub Events
Total
- Create event: 18
- Release event: 10
- Issues event: 26
- Watch event: 4
- Delete event: 11
- Issue comment event: 50
- Push event: 147
- Pull request event: 15
- Fork event: 1
Last Year
- Create event: 18
- Release event: 10
- Issues event: 26
- Watch event: 4
- Delete event: 11
- Issue comment event: 50
- Push event: 147
- Pull request event: 15
- Fork event: 1
Committers
Last synced: 7 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| David O'Gara | d****a@w****u | 493 |
| Mickaël Binois | 1****s | 9 |
| Dan Waxman | d****n@s****u | 4 |
| dependabot[bot] | 4****] | 1 |
| Lia Schattner | l****r@L****l | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 14
- Total pull requests: 22
- Average time to close issues: 7 days
- Average time to close pull requests: about 17 hours
- Total issue authors: 4
- Total pull request authors: 4
- Average comments per issue: 3.71
- Average comments per pull request: 0.18
- Merged pull requests: 20
- Bot issues: 0
- Bot pull requests: 2
Past Year
- Issues: 14
- Pull requests: 22
- Average time to close issues: 7 days
- Average time to close pull requests: about 17 hours
- Issue authors: 4
- Pull request authors: 4
- Average comments per issue: 3.71
- Average comments per pull request: 0.18
- Merged pull requests: 20
- Bot issues: 0
- Bot pull requests: 2
Top Authors
Issue Authors
- Edenhofer (5)
- DanWaxman (5)
- davidogara (2)
- matthewfeickert (2)
Pull Request Authors
- davidogara (12)
- mbinois (9)
- DanWaxman (6)
- dependabot[bot] (2)
