sbc

https://github.com/hyunjimoon/sbc

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 7 DOI reference(s) in README
✓
Academic publication links
Links to: arxiv.org, biorxiv.org, springer.com
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (13.3%) to scientific vocabulary

Keywords

diagnostics simulation-based-inference

Last synced: 6 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: hyunjimoon
License: other
Language: R
Default Branch: master
Homepage: https://hyunjimoon.github.io/SBC
Size: 94.5 MB

Statistics

Stars: 61
Watchers: 7
Forks: 5
Open Issues: 47
Releases: 2

Topics

diagnostics simulation-based-inference

Created over 5 years ago · Last pushed 6 months ago

Metadata Files

Readme License Citation

Simulation-based Calibration: SBC

SBC provides tools to validate your Bayesian model and/or a sampling algorithm via the self-recovering property of Bayesian models. This package lets you run SBC easily and perform postprocessing and visualisations of the results to assess computational faithfulness.

Installation

To install the development version of SBC, run

r devtools::install_github("hyunjimoon/SBC")

Quick tour

To use SBC, you need a piece of code that generates simulated data that should match your model (a generator) and a statistical model + algorithm + algorithm parameters that can fit the model to data (a backend). SBC then lets you discover when the backend and generator don't encode the same data generating process (up to certain limitations).

For a quick example, we'll use a simple generator producing normally-distributed data (basically y <- rnorm(N, mu, sigma)) with a backend in Stan that mismatches the generator by wrongly assuming Stan parametrizes the normal distribution via precision (i.e. it has y ~ normal(mu, 1 / sigma ^ 2)).

```r library(SBC) gen <- SBCexamplegenerator("normal")

interface = "cmdstanr" or "rjags" is also supported

backendbad <- SBCexamplebackend("normalbad", interface = "rstan") ```

Note: Using the cmdstanr interface, a small number of rejected steps will be reported. Those are false positives and do not threaten validity (they happen during warmup). This is a result of difficulties in parsing the output of cmdstanr. We are working on a resolution.

You can use SBC_print_example_model("normal_bad") to inspect the model used.

We generate 50 simulated datasets and perform SBC:

r ds <- generate_datasets(gen, n_sims = 50) results_bad <- compute_SBC(ds, backend_bad)

The results then give us diagnostic plots that immediately show a problem: the distribution of SBC ranks is not uniform as witnessed by both the rank histogram and the difference between sample ECDF and the expected deviations from theoretical CDF.

r plot_rank_hist(results_bad) plot_ecdf_diff(results_bad)

We can then run SBC with a backend that uses the correct parametrization (i.e. with y ~ normal(mu, sigma)):

```r backendsd <- SBCexamplebackend("normalsd", interface = "rstan") resultssd <- computeSBC(ds, backend_sd)

plotrankhist(resultssd) plotecdfdiff(resultssd) ```

The diagnostic plots show no problems in this case. As with any other software test, we can observe clear failures, but absence of failures does not imply correctness. We can however make the SBC check more thorough by using a lot of simulations and including suitable derived quantities to guard against known limitations of vanilla SBC.

Paralellization

The examples above are very fast to compute, but in real use cases, you almost certainly want to let the computation run in parallel via the future package.

r library(future) plan(multisession)

More information

The package vignettes provide additional context and examples. Notably:

The main vignette has more theoretical background and instructions how to integrate your own simulation code and models with SBC.
Small model workflow discusses how SBC integrates with model implementation workflow and how you can use SBC to safely develop complex models step-by-step.

Currently SBC supports cmdstanr, rstan, and brms models out of the box. With a little additional work, you can integrate SBC with any exact or approximate fitting method as shown in the Implementing backends vignette.

Citing SBC and related software

Developing and maintaining open source software is an important yet often underappreciated contribution to scientific progress. Thus, whenever you are using open source software, please make sure to cite it appropriately so that developers get credit for their work.

When using SBC, please cite the following publication:

Modrk, M., Moon, A. H., Kim, S., Brkner, P., Huurre, N., Faltejskov, K., ... & Vehtari, A. (2023). Simulation-based calibration checking for Bayesian computation: The choice of test quantities shapes sensitivity. Bayesian Analysis, advance publication, DOI: 10.1214/23-BA1404.

Further, SBC relies on several other R packages and, of course, on R itself. To find out how to cite R and its packages, use citation function. SBC specifically rely on posterior for manipulating posterior draws and future for parallel processing. Also, do not forget to cite the probabilistic inference tool you use as backend (e.g. Stan, JAGS, brms, ...)

References

Theoretical support
- Simulation-Based Calibration Checking for Bayesian Computation: The Choice of Test Quantities Shapes Sensitivity Modrk, Moon, Kim, Brkner, Huurre, Faltejskov, Gelman, Vehtari, 2023
- Validating Bayesian Inference Algorithms with Simulation-Based Calibration Talts, Betancourt, Simpson, Vehtari, Gelman, 2018
- Graphical Test for Discrete Uniformity and its Applications in Goodness of Fit Evaluation and Multiple Sample Comparison Silynoja, Brkner, Vehtari, 2021
- Bayesian Workflow, Gelman et al., 2020
- Toward a principled Bayesian workflow in cognitive science Schad, Betancourt, Vasishth, 2021
- Bayes factor workflow Schad, Nicenboim, Brkner, Betancourt, Vasishth, 2021
Application support

FAQ

How does calibration relate to prediction accuracy?

Comparing the ground truth and the simulated result is a backbone of calibration and comparison target greatly affects the calibrated (i.e. trained) result, similar to reward in reinforcement learning. In this sense, if the U(a(y), theta) term is designed for prediction, the model will be calibrated to have best predictive result as possible.

Acknowledgements

Development of this package was supported by ELIXIR CZ research infrastructure project (Ministry of Youth, Education and Sports of the Czech Republic, Grant No: LM2018131) including access to computing and storage facilities.

Owner

Name: Angie.H Moon
Login: hyunjimoon
Kind: user
Company: @nextoptdev

Website: hyunjimoon.com
Repositories: 8
Profile: https://github.com/hyunjimoon

GitHub Events

Total

Issues event: 19
Watch event: 7
Issue comment event: 29
Push event: 16
Pull request event: 6
Fork event: 4

Last Year

Issues event: 19
Watch event: 7
Issue comment event: 29
Push event: 16
Pull request event: 6
Fork event: 4

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 16
Total pull requests: 3
Average time to close issues: about 3 years
Average time to close pull requests: 5 months
Total issue authors: 5
Total pull request authors: 2
Average comments per issue: 1.81
Average comments per pull request: 1.67
Merged pull requests: 2
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 4
Pull requests: 2
Average time to close issues: about 1 month
Average time to close pull requests: 28 days
Issue authors: 4
Pull request authors: 2
Average comments per issue: 0.5
Average comments per pull request: 2.5
Merged pull requests: 1
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

hyunjimoon (11)
martinmodrak (6)
mhollanders (3)
n-kall (2)
lunafazio (1)
abhsarma (1)
emstruong (1)
rroyaute (1)

Pull Request Authors

bmfazio (3)
timostenz (2)
lunafazio (1)
hyunjimoon (1)

Top Labels

Issue Labels

scalable (2) metric (1) apply (1) enhancement (1)

Pull Request Labels

Dependencies

DESCRIPTION cran

R6 * imports
abind * imports
dplyr * imports
future.apply * imports
ggplot2 * imports
memoise * imports
posterior >= 1.0.0 imports
purrr * imports
stats * imports
stringi * imports
tidyr * imports
tidyselect * imports
MASS * suggests
MCMCpack * suggests
bayesplot * suggests
brms * suggests
cmdstanr >= 0.4.0 suggests
formula.tools * suggests
knitr * suggests
medicaldata * suggests
mvtnorm * suggests
patchwork * suggests
rmarkdown * suggests
rstan * suggests
testthat * suggests

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

sbc