bayesianreasoning

R package with functions to plot and help understand Positive and Negative Predictive Values, and their relationship with Sensitivity, Specificity and Prevalence.

https://github.com/gorkang/bayesianreasoning

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: zenodo.org
○
Committers with academic emails
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (11.3%) to scientific vocabulary

Keywords

bayesian-inference negative-predictive-value positive-predictive-value r

Last synced: 9 months ago · JSON representation

Repository

R package with functions to plot and help understand Positive and Negative Predictive Values, and their relationship with Sensitivity, Specificity and Prevalence.

Basic Info

Host: GitHub
Owner: gorkang
Language: R
Default Branch: main
Homepage: https://gorkang.shinyapps.io/BayesianReasoning/
Size: 69.2 MB

Statistics

Stars: 9
Watchers: 1
Forks: 1
Open Issues: 8
Releases: 6

Topics

bayesian-inference negative-predictive-value positive-predictive-value r

Created about 9 years ago · Last pushed 9 months ago

Metadata Files

Readme Changelog

Bayesian reasoning

Bayesian reasoning in medical contexts

This package includes a few functions to plot and help understand Positive and Negative Predictive Values, and their relationship with Sensitivity, Specificity and Prevalence.

The Positive Predictive Value of a medical test is the probability that a positive result will mean having the disease. Formally p(Disease|+)
The Negative Predictive Value of a medical test is the probability that a negative result will mean not having the disease. Formally p(Healthy|-)

The BayesianReasoning package has three main functions:

PPV_heatmap(): Plot heatmaps with PPV or NPV values for the given test and disease parameters.
PPV_diagnostic_vs_screening(): Plots the difference between the PPV of a test in a diagnostic context (very high prevalence; or a common study sample, e.g. ~50% prevalence) versus a screening context (lower prevalence).
min_possible_prevalence(): Calculates how high should the prevalence of a disease be to reach a desired PPV given certain test parameters.

If you want to install the package can use: remotes::install_github("gorkang/BayesianReasoning"). Please report any problems you find in the Issues Github page.

There is a shiny app implementation with most of the main features available.

PPV_heatmap()

Plot heatmaps with PPV or NPV values for a given specificity and a range of Prevalences and FP or FN (1 - Sensitivity). The basic parameters are:

min_Prevalence: Min prevalence in y axis. “min_Prevalence out of y”
max_Prevalence: Max prevalence in y axis. “1 out of max_Prevalence”
Sensitivity: Sensitivity of the test
max_FP: FP is 1 - specificity. The x axis will go from FP = 0% to max_FP
Language: “es” for Spanish or “en” for English

plot = PPV_heatmap(
  min_Prevalence = 1,
  max_Prevalence = 1000,
  Sensitivity = 100,
  limits_Specificity = c(90, 100),
  Language = "en"
)

NPV

You can also plot an NPV heatmap with PPV_NPV = “NPV”.

plot = PPV_heatmap(
  PPV_NPV = "NPV",
  min_Prevalence = 800,
  max_Prevalence = 1000,
  Specificity = 95,
  limits_Sensitivity = c(90, 100),
  Language = "en"
)

Area overlay

You can add different types of overlay to the plots.

For example, an area overlay showing the point PPV for a given prevalence and FP or FN:

plot = PPV_heatmap(
  min_Prevalence = 1,
  max_Prevalence = 1200,
  Sensitivity = 81,
  limits_Specificity = c(94, 100),
  label_subtitle = "Prenatal screening for Down Syndrome by Age",
  overlay = "area",
  overlay_labels = "40 y.o.",
  overlay_position_FP = 4.8,
  overlay_prevalence_1 = 1,
  overlay_prevalence_2 = 68
)

The area plot overlay can show more details about how the calculation of PPV/NPV is performed:

plot = PPV_heatmap(
  min_Prevalence = 1,
  max_Prevalence = 1200,
  Sensitivity = 81,
  limits_Specificity = c(94, 100),
  label_subtitle = "Prenatal screening for Down Syndrome by Age",
  overlay_extra_info = TRUE,
  overlay = "area",
  overlay_labels = "40 y.o.",
  overlay_position_FP = 4.8,
  overlay_prevalence_1 = 1,
  overlay_prevalence_2 = 68
)

Line overlay

Also, you can add a line overlay highlighting a range of prevalences and FP. This is useful, for example, to show how the PPV of a test changes with age:

plot = PPV_heatmap(
  min_Prevalence = 1, max_Prevalence = 1800, 
  Sensitivity = 90, 
  limits_Specificity = c(84, 100),
  label_subtitle = "PPV of Mammogram for Breast Cancer by Age",
  overlay = "line", 
  overlay_labels = c("80 y.o.", "70 y.o.", "60 y.o.", "50 y.o.", "40 y.o.", "30 y.o.", "20  y.o."),
  overlay_position_FP = c(6.5, 7, 8, 9, 12, 14, 14),
  overlay_prevalence_1 = c(1, 1, 1, 1, 1, 1, 1),
  overlay_prevalence_2 = c(22, 26, 29, 44, 69, 227, 1667)
  )

Another example. In this case, the FP is constant across age:

plot = PPV_heatmap(
  min_Prevalence = 1, max_Prevalence = 2000, Sensitivity = 81, 
  limits_Specificity = c(94, 100),
  label_subtitle = "Prenatal screening for Down Syndrome by Age",
  overlay = "line",
  overlay_labels = c("40 y.o.", "30 y.o.", "20 y.o."),
  overlay_position_FP = c(4.8, 4.8, 4.8),
  overlay_prevalence_1 = c(1, 1, 1),
  overlay_prevalence_2 = c(68, 626, 1068)
  )

PPV_diagnostic_vs_screening()

In scientific studies developing a new test for the early detection of a medical condition, it is quite common to use a sample where 50% of participants has a medical condition and the other 50% are normal controls. This has the unintended effect of maximizing the PPV of the test.

This function shows a plot with the difference between the PPV of a diagnostic context (very high prevalence; or a common study sample, e.g. ~50% prevalence) versus that of a screening context (lower prevalence).

PPV_diagnostic_vs_screening(max_FP = 10, 
                            Sensitivity = 100, 
                            prevalence_screening_group = 1000, 
                            prevalence_diagnostic_group = 2)

min_possible_prevalence()

Imagine you would like to use a test in a population and want to have a 98% PPV. That is, IF a positive result comes out in the test, you would like a 98% certainty that it is a true positive.

How high should the prevalence of the disease be in that group?

min_possible_prevalence(Sensitivity = 100, 
                        FP_test = 0.1, 
                        min_PPV_desired = 98)

To reach a PPV of 98 when using a test with 100 % Sensitivity and 0.1 % False Positive Rate, you need a prevalence of at least 1 out of 21

Another example, with a very good test, and lower expectations:

min_possible_prevalence(Sensitivity = 99.9, 
                        FP_test = .1, 
                        min_PPV_desired = 70)

To reach a PPV of 70 when using a test with 99.9 % Sensitivity and 0.1 % False Positive Rate, you need a prevalence of at least 1 out of 429

plot_cutoff()

Since v0.4.2 you can also plot the distributions of sick and healthy individuals and learn about how a cutoff point changes the True Positives, False Positives, True Negatives, False Negatives, Sensitivity, Specificity, PPV and NPV.

PLOTS = plot_cutoff(prevalence = 0.2,
                    cutoff_point = 33, 
                    mean_sick = 35, 
                    mean_healthy = 20, 
                    sd_sick = 3, 
                    sd_healthy = 5
                    )

PLOTS$final_plot

Then, with remove_layers_cutoff_plot() you can remove specific layers, to help you understand some of these concepts.

# Sensitivity
remove_layers_cutoff_plot(PLOTS$final_plot, delete_what = c("FP", "TN")) + ggplot2::labs(subtitle = "Sensitivity = TP/(TP+FN)")

# Specificity
remove_layers_cutoff_plot(PLOTS$final_plot, delete_what = c("FN", "TP")) + ggplot2::labs(subtitle = "Specificity = TN/(TN+FP)")

# PPV
remove_layers_cutoff_plot(PLOTS$final_plot, delete_what = c("TN", "FN")) + ggplot2::labs(subtitle = "PPV = TP/(TP+FP)")

# NPV
remove_layers_cutoff_plot(PLOTS$final_plot, delete_what = c("TP", "FP")) + ggplot2::labs(subtitle = "NPV = TN/(TN+FN)")

Owner

Name: Gorka Navarrete
Login: gorkang
Kind: user
Location: Spain

Website: https://gorkang.github.io/GN-web/
Twitter: gorkang
Repositories: 47
Profile: https://github.com/gorkang

Bayesian reasoning and shared medical decision making. Center for Social and Cognitive Neuroscience (CSCN). School of Psychology. Adolfo Ibañez University

GitHub Events

Total

Create event: 1
Release event: 1
Issues event: 1
Issue comment event: 1
Push event: 3

Last Year

Create event: 1
Release event: 1
Issues event: 1
Issue comment event: 1
Push event: 3

Committers

Last synced: over 2 years ago

All Time

Total Commits: 206
Total Committers: 1
Avg Commits per committer: 206.0
Development Distribution Score (DDS): 0.0

Past Year

Commits: 0
Committers: 0
Avg Commits per committer: 0.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
gorkang	g**g@g**m	206

Issues and Pull Requests

Last synced: 9 months ago

All Time

Total issues: 40
Total pull requests: 13
Average time to close issues: 6 months
Average time to close pull requests: less than a minute
Total issue authors: 3
Total pull request authors: 1
Average comments per issue: 0.43
Average comments per pull request: 0.0
Merged pull requests: 13
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 1
Pull requests: 0
Average time to close issues: 3 days
Average time to close pull requests: N/A
Issue authors: 1
Pull request authors: 0
Average comments per issue: 1.0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

gorkang (38)
teunbrand (1)
thomasp85 (1)

Pull Request Authors

gorkang (13)

Top Labels

Issue Labels

bug (18) enhancement (16) IMPORTANT (3) check me (2)

Pull Request Labels

Packages

Total packages: 1
Total downloads:
- cran 353 last-month

Total dependent packages: 0
Total dependent repositories: 0
Total versions: 7
Total maintainers: 1

cran.r-project.org: BayesianReasoning

Plot Positive and Negative Predictive Values for Medical Tests

Homepage: https://github.com/gorkang/BayesianReasoning
Documentation: http://cran.r-project.org/web/packages/BayesianReasoning/BayesianReasoning.pdf
License: CC0
Latest release: 0.4.3
published 9 months ago

Versions: 7
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 353 Last month

Rankings

Stargazers count: 18.7%

Forks count: 21.9%

Dependent packages count: 29.8%

Average: 30.5%

Dependent repos count: 35.5%

Downloads: 46.6%

Maintainers (1)

gorkang@gmail.com

Last synced: 9 months ago

Dependencies

DESCRIPTION cran

R >= 3.5.0 depends
dplyr * imports
ggforce * imports
ggplot2 * imports
magrittr * imports
reshape2 * imports
stats * imports
tibble * imports
tidyr * imports
utils * imports
curl * suggests
httr * suggests
knitr * suggests
patchwork * suggests
purrr * suggests
rmarkdown * suggests
testthat * suggests

.github/workflows/rhub.yaml actions

r-hub/actions/checkout v1 composite
r-hub/actions/platform-info v1 composite
r-hub/actions/run-check v1 composite
r-hub/actions/setup v1 composite
r-hub/actions/setup-deps v1 composite
r-hub/actions/setup-r v1 composite

bayesianreasoning

Science Score: 36.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

Bayesian reasoning

Bayesian reasoning in medical contexts

PPV_heatmap()

NPV

Area overlay

Line overlay

PPV_diagnostic_vs_screening()

min_possible_prevalence()

plot_cutoff()

Owner

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

cran.r-project.org: BayesianReasoning

Rankings

Maintainers (1)

Dependencies