heplots

Visualizing Hypothesis Tests in Multivariate Linear Models, http://friendly.github.io/heplots/

https://github.com/friendly/heplots

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 9 DOI reference(s) in README
✓
Academic publication links
Links to: zenodo.org
○
Committers with academic emails
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (16.7%) to scientific vocabulary

Keywords

linear-hypotheses matrices multivariate-linear-models plot repeated-measure-designs visualizing-hypothesis-tests

Last synced: 6 months ago · JSON representation

Repository

Visualizing Hypothesis Tests in Multivariate Linear Models, http://friendly.github.io/heplots/

Basic Info

Host: GitHub
Owner: friendly
Language: R
Default Branch: master
Homepage:
Size: 161 MB

Statistics

Stars: 10
Watchers: 4
Forks: 4
Open Issues: 3
Releases: 10

Topics

linear-hypotheses matrices multivariate-linear-models plot repeated-measure-designs visualizing-hypothesis-tests

Created over 12 years ago · Last pushed 6 months ago

Metadata Files

Readme Changelog Citation

README.Rmd

---
output: github_document
editor_options: 
  markdown: 
    wrap: 72
---



```{r, echo = FALSE}
knitr::opts_chunk$set(
  warning = FALSE,   # avoid warnings and messages in the output
  message = FALSE,
  collapse = TRUE,
  fig.width = 4,
  fig.height = 4,
  dpi = 96,
  comment = "#>",
  fig.path = "man/figures/README-"
)

par(mar=c(3,3,1,1)+.1)
```

```{r, echo=FALSE}
library(heplots)
```



[![Lifecycle:
stable](https://img.shields.io/badge/lifecycle-stable-brightgreen.svg)](https://lifecycle.r-lib.org/articles/stages.html#stable)
[![CRAN_Status_Badge](http://www.r-pkg.org/badges/version/heplots)](http://cran.r-project.org/package=heplots)
[![R_Universe](https://friendly.r-universe.dev/badges/heplots)](https://friendly.r-universe.dev)
[![Last Commit](https://img.shields.io/github/last-commit/friendly/heplots)](https://github.com/friendly/heplots/)
[![Downloads](http://cranlogs.r-pkg.org/badges/grand-total/heplots)](https://cran.r-project.org/package=heplots)
[![DOI](https://zenodo.org/badge/13908453.svg)](https://zenodo.org/badge/latestdoi/13908453)
[![Docs](https://img.shields.io/badge/pkgdown%20site-blue)](https://friendly.github.io/heplots/)


# heplots 

## **Visualizing Hypothesis Tests in Multivariate Linear Models** 


Version `r getNamespaceVersion("heplots")`

## Description 

The `heplots` package provides functions for visualizing hypothesis
tests in multivariate linear models ("MLM" = {MANOVA, multivariate multiple
regression, MANCOVA, and repeated measures designs}). It also provides other tools for
analysis and graphical display of MLMs.

HE plots represent sums-of-squares-and-products matrices for linear
hypotheses (**H**) and for error (**E**) using ellipses (in two
dimensions), ellipsoids (in three dimensions), or by line segments in
one dimension. For the theory and applications, see:

-   [Friendly (2007)](http://datavis.ca/papers/jcgs-heplots.pdf) for the basic theory on which this is based.
-   [Fox, Friendly and Monette (2009)]((http://datavis.ca/palers/FoxFriendlyMonette-2009.pdf)) for a brief introduction,
-   [Friendly (2010)](http://www.jstatsoft.org/v37/i04/paper) for the application of these ideas to repeated
    measure designs,
-   [Friendly, Monette and Fox (2013)](http://datavis.ca/palers/ellipses-STS402.pdf) for a general discussion of the role of elliptical geometry in statistical understanding, 
-   [Friendly & Sigal (2017)](https://doi.org/10.20982/tqmp.13.1.p020) for an applied R tutorial, 
-   [Friendly & Sigal (2018)](https://www.datavis.ca/papers/EqCov-TAS.pdf) for theory and examples of visualizing equality of covariance matrices.

If you use this work in teaching or research, please cite it as given by `citation("heplots")` or see [Citation](authors.html#citation).

Other topics now addressed here include:

-   robust MLMs, using iteratively re-weighted least squared to
    down-weight observations with large multivariate residuals,
    `robmlm()`.
-   `Mahalanobis()` calculates classical and _robust_ Mahalanobis squared
    distances using MCD and MVE estimators of center and covariance.
-   visualizing tests for equality of covariance matrices in MLMs (Box's
    M test), `boxM()` and `plot.boxM()`. Also: `bartlettTests()` and `LeveneTests()`
    for homogeneity of variance for each response in a MLM.
-   $\chi^2$ Q-Q plots for MLMs (`cqplot()`) to detect outliers and
    assess multivariate normality of residuals.
-   bivariate coefficient plots showing elliptical confidence regions
    (`coefplot()`).

In this respect, the `heplots` package now aims to provide a wide range
of tools for analyzing and visualizing multivariate response linear
models, together with other packages:



-   The related [`candisc`](https://friendly.github.io/candisc/) package
    provides HE plots in **canonical discriminant** space, the space of
    linear combinations of the responses that show the maximum possible
    effects and for canonical correlation in multivariate regression
    designs. See the [package documentation](https://friendly.github.io/candisc/)
    for details.



-   Another package,
    [`mvinfluence`](https://friendly.github.io/mvinfluence/), provides
    diagnostic measures and plots for **influential observations** in MLM
    designs. See the [package documentation](https://friendly.github.io/mvinfluence/)
    for details.


Several tutorial vignettes are also included. See
`vignette(package="heplots")`.

## Installation

+-------------------+----------------------------------------------------------------------------+
| CRAN version      | `install.packages("heplots")`                                              |
+-------------------+----------------------------------------------------------------------------+
| R-universe        | `install.packages("heplots", repos = c('https://friendly.r-universe.dev')` |
+-------------------+----------------------------------------------------------------------------+
| Development       | `remotes::install_github("friendly/heplots")`                              |
| version           |                                                                            |
+-------------------+----------------------------------------------------------------------------+


## HE plot functions

The graphical functions contained here all display multivariate model
effects in variable (**data**) space, for one or more response variables
(or contrasts among response variables in repeated measures designs).

-   `heplot()` constructs two-dimensional HE plots for model terms and
    linear hypotheses for pairs of response variables in multivariate
    linear models.

-   `heplot3d()` constructs analogous 3D plots for triples of response
    variables.

-   The `pairs` method, `pairs.mlm()` constructs a scatterplot matrix of
    pairwise HE plots.

-   `heplot1d()` constructs 1-dimensional analogs of HE plots for model
    terms and linear hypotheses for single response variables.

## Other functions

-   `glance.mlm()` extends `broom::glance.lm()` to multivariate response
    models, giving a one-line statistical summary for each response
    variable. `uniStats()` does something similar, but formatted more like a ANOVA table.

-   `boxM()` Calculates Box's *M* test for homogeneity of covariance
    matrices in a MANOVA design. A `plot` method displays a visual
    representation of the components of the test. Associated with this,
    `bartletTests()` and `levineTests()` give the univariate tests of
    homogeneity of variance for each response measure in a MLM.

-   `covEllipses()` draw covariance (data) ellipses for one or more
    group, optionally including the ellipse for the pooled within-group
    covariance. 
    
-   `coefplot()` for an MLM object draws bivariate confidence ellipses.

### Repeated measure designs

For repeated measure designs, between-subject effects and within-subject
effects must be plotted separately, because the error terms (**E**
matrices) differ. For terms involving within-subject effects, these
functions carry out a linear transformation of the matrix **Y** of
responses to a matrix **Y M**, where **M** is the model matrix for a
term in the intra-subject design and produce plots of the **H** and **E**
matrices in this transformed space. The vignette `"repeated"` describes
these graphical methods for repeated measures designs. (This
paper [HE plots for repeated measures
designs](http://www.jstatsoft.org/v37/i04/paper) is now provided as a PDF vignette.)

## Datasets

The package also provides a large collection of data sets illustrating a
variety of multivariate linear models of the types listed above,
together with graphical displays. The table below classifies these with
method tags. Their names are linked to their documentation with graphical output on the
`pkgdown` website, [].

```{r datasets, echo=FALSE}
library(here)
library(dplyr)
library(tinytable)
#dsets <- read.csv(here::here("extra", "datasets.csv"))
dsets <- read.csv("https://raw.githubusercontent.com/friendly/heplots/master/extra/datasets.csv")
dsets <- dsets |> dplyr::select(-X) |> arrange(tolower(dataset))

# link dataset to pkgdown doc
refurl <- "http://friendly.github.io/heplots/reference/"

dsets <- dsets |>
  mutate(dataset = glue::glue("[{dataset}]({refurl}{dataset}.html)")) 

tinytable::tt(dsets)
#knitr::kable(dsets)
```

## Examples

This example illustrates HE plots using the classic `iris` data set. How
do the means of the flower variables differ by `Species`? This dataset
was the impetus for R. A. Fisher (1936) to propose a method of
discriminant analysis using data collected by Edgar Anderson (1928).
Though some may rightly deprecate Fisher for being a supporter of
eugenics, Anderson's `iris` dataset should not be blamed.

A basic HE plot shows the **H** and **E** ellipses for the first two
response variables (here: `Sepal.Length` and `Sepal.Width`). The
multivariate test is significant (by Roy's test) *iff* the **H** ellipse
projects *anywhere* outside the **E** ellipse.

The positions of the group means show how they differ on the two
response variables shown, and provide an interpretation of the
orientation of the **H** ellipse: it is long in the directions of
differences among the means.

```{r iris1}
#| echo=-1,
#| out.width="70%",
#| fig.cap = "HE plot of sepal length and Sepal width for the iris data"
par(mar=c(4,4,1,1)+.1)
iris.mod <- lm(cbind(Sepal.Length, Sepal.Width, Petal.Length, Petal.Width) ~ 
                 Species, data=iris)
heplot(iris.mod)
```

### Contrasts

Contrasts or other linear hypotheses can be shown as well, and the
ellipses look better if they are filled. We create contrasts to test the
differences between `versacolor` and `virginca` and also between
`setosa` and the average of the other two. Each 1 df contrast plots as
degenerate 1D ellipse-- a line.

Because these contrasts are orthogonal, they add to the total 2 df
effect of `Species`. Note how the first contrast, labeled `V:V`,
distinguishes the means of *versicolor* from *virginica*; the second
contrast, `S:VV` distinguishes `setosa` from the other two.

```{r iris2, out.width="70%"}
#| fig.cap = "HE plot of sepal length and Sepal width for the iris data, showing lines reflecting two contrasts among iris species."
par(mar=c(4,4,1,1)+.1)
contrasts(iris$Species)<-matrix(c(0, -1, 1, 
                                  2, -1, -1), nrow=3, ncol=2)
contrasts(iris$Species)
iris.mod <- lm(cbind(Sepal.Length, Sepal.Width, Petal.Length, Petal.Width) ~ 
                 Species, data=iris)

hyp <- list("V:V"="Species1","S:VV"="Species2")
heplot(iris.mod, hypotheses=hyp, 
       fill=TRUE, fill.alpha=0.1)
```

### All pairwise HE plots

All pairwise HE plots are produced using the `pairs()` method for MLM
objects.In the plot, note how the means of most pairs of variables are very
highly correlated, in the order Setosa < Versicolor < Virginica, but this
pattern doesn't hold for relations with `Sepal.Width`.

```{r, iris3}
#| out.width="100%",
#| fig.height = 6,
#| fig.width = 6,
#| fig.cap = "Scatterplot matrix of pairwise HE plots for the iris data."
pairs(iris.mod, hypotheses=hyp, hyp.labels=FALSE,
      fill=TRUE, fill.alpha=0.1)
```

### Canonical discriminant view

For more than two response variables, a multivariate effect can be viewed more simply by projecting
the data into canonical space --- the linear combinations of the responses which show the greatest
differences among the group means relative to within-group scatter. The computations are performed
with the [`candisc`](https://github.com/friendly/candisc/) package, which has an `heplot.candisc()`
method.

```{r iris-can0}
library(candisc)
iris.can <- candisc(iris.mod) |> print()
```

The HE plot in canonical space shows that the differences among species are nearly
entirely one-dimensional. The weights for the variables on the first dimension
show how `Sepal.Width` differs from the other size variables.
```{r iris-can}
#| out.width = "60%",
#| echo = -1,
#| fig.cap = "Canonical HE plot for the iris data"
par(mar=c(4,4,1,1)+.1)
# HE plot in canonical space
heplot(iris.can, var.pos = 1, scale = 40)
```


### Covariance ellipses

MANOVA relies on the assumption that within-group covariance matrices are all equal.
It is useful to visualize these in the space of some of the predictors.
`covEllipses()` provides this both for classical and robust (`method="mve"`) estimates.
The figure below shows these for the three Iris species and the 
pooled covariance matrix, which is the same as the **E** matrix used
in MANOVA tests.
```{r iris4, out.width="80%"}
#| echo = -1,
#| fig.cap = "Covariance ellipses for the iris data, showing the classical and robust estimates."
par(mar=c(4,4,1,1)+.1)
covEllipses(iris[,1:4], iris$Species)
covEllipses(iris[,1:4], iris$Species, 
            fill=TRUE, method="mve", add=TRUE, labels="")
```

## References

Anderson, E. (1928). The Problem of Species in the Northern Blue Flags,
Iris versicolor L. and Iris virginica L. *Annals of the Missouri
Botanical Garden*, **13**, 241--313.

Fisher, R. A. (1936). The Use of Multiple Measurements in Taxonomic
Problems. *Annals of Eugenics*, **8**, 379--388.

Friendly, M. (2006).
[Data Ellipses, HE Plots and Reduced-Rank Displays for Multivariate Linear Models: 
SAS Software and Examples.](https://www.jstatsoft.org/article/view/v017i06) 
_Journal of Statistical Software_, **17**, 1-42.

Friendly, M. (2007).  [HE plots for Multivariate General Linear Models](http://datavis.ca/papers/jcgs-heplots.pdf).
_Journal of Computational and Graphical Statistics_, **16**(2) 421-444.
[DOI](http://dx.doi.org/10.1198/106186007X208407).


Fox, J., Friendly, M. & Monette, G. (2009). [Visualizing hypothesis
tests in multivariate linear models: The heplots package for
R](http://datavis.ca/palers/FoxFriendlyMonette-2009.pdf) *Computational
Statistics*, **24**, 233-246.


Friendly, M. (2010). [HE plots for repeated measures
designs](http://www.jstatsoft.org/v37/i04/paper). *Journal of
Statistical Software*, **37**, 1--37.

Friendly, M.; Monette, G. & Fox, J. (2013). [Elliptical Insights:
Understanding Statistical Methods Through Elliptical
Geometry](http://datavis.ca/palers/ellipses-STS402.pdf) *Statistical
Science*, **28**, 1-39.

Friendly, M. & Sigal, M. (2017). [Graphical Methods for Multivariate
Linear Models in Psychological Research: An R
Tutorial.](https://doi.org/10.20982/tqmp.13.1.p020) *The Quantitative
Methods for Psychology*, **13**, 20-45.

Friendly, M. & Sigal, M. (2018): [Visualizing Tests for Equality of
Covariance Matrices](https://www.datavis.ca/papers/EqCov-TAS.pdf), _The American Statistician_, [DOI](https://doi.org/10.1080/00031305.2018.1497537)

Owner

Name: Michael Friendly
Login: friendly
Kind: user
Location: Toronto
Company: York University

Website: https://datavis.ca
Twitter: datavisFriendly
Repositories: 57
Profile: https://github.com/friendly

GitHub Events

Total

Create event: 4
Issues event: 1
Release event: 3
Watch event: 1
Delete event: 1
Issue comment event: 2
Push event: 102
Pull request event: 4
Fork event: 1

Last Year

Create event: 4
Issues event: 1
Release event: 3
Watch event: 1
Delete event: 1
Issue comment event: 2
Push event: 102
Pull request event: 4
Fork event: 1

Committers

Last synced: over 1 year ago

All Time

Total Commits: 532
Total Committers: 5
Avg Commits per committer: 106.4
Development Distribution Score (DDS): 0.485

Past Year

Commits: 45
Committers: 1
Avg Commits per committer: 45.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
friendly	f**y@e**b	274
Michael Friendly	f**y@y**a	253
jfox	j**x@e**b	3
stefan7th	s**h@e**b	1
dmurdoch	m**n@g**m	1

Committer Domains (Top 20 + Academic)

yorku.ca: 1

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 3
Total pull requests: 10
Average time to close issues: 3 months
Average time to close pull requests: about 2 hours
Total issue authors: 3
Total pull request authors: 4
Average comments per issue: 2.67
Average comments per pull request: 0.5
Merged pull requests: 9
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 1
Pull requests: 2
Average time to close issues: N/A
Average time to close pull requests: about 7 hours
Issue authors: 1
Pull request authors: 2
Average comments per issue: 2.0
Average comments per pull request: 0.0
Merged pull requests: 2
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

ghost (1)
philchalmers (1)
skreibig (1)

Pull Request Authors

friendly (8)
vincentarelbundock (2)
MichaelChirico (1)
dmurdoch (1)

Top Labels

Issue Labels

Pull Request Labels

Packages

Total packages: 1
Total downloads:
- cran 6,448 last-month
Total docker downloads: 43,935

Total dependent packages: 11
Total dependent repositories: 23
Total versions: 45
Total maintainers: 1

cran.r-project.org: heplots

Visualizing Hypothesis Tests in Multivariate Linear Models

Homepage: https://friendly.github.io/heplots/
Documentation: http://cran.r-project.org/web/packages/heplots/heplots.pdf
License: GPL-2 | GPL-3 [expanded from: GPL (≥ 2)]
Latest release: 1.7.8
published 6 months ago

Versions: 45
Dependent Packages: 11
Dependent Repositories: 23
Downloads: 6,448 Last month
Docker Downloads: 43,935

Rankings

Docker downloads count: 0.6%

Downloads: 4.5%

Dependent packages count: 5.0%

Dependent repos count: 5.7%

Average: 7.6%

Forks count: 12.2%

Stargazers count: 17.4%

Maintainers (1)

friendly@yorku.ca

Last synced: 6 months ago

Dependencies

DESCRIPTION cran

R >= 3.5.0 depends
car >= 3.0 depends
MASS * imports
grDevices * imports
graphics * imports
stats * imports
utils * imports
animation * suggests
candisc * suggests
corrgram * suggests
effects * suggests
gplots * suggests
lattice * suggests
mvinfluence * suggests
nlme * suggests
reshape * suggests
reshape2 * suggests
rgl * suggests

revdep/library/heplots/new/cli/DESCRIPTION cran

R >= 3.4 depends
utils * imports
callr * suggests
covr * suggests
digest * suggests
glue >= 1.6.0 suggests
grDevices * suggests
htmltools * suggests
htmlwidgets * suggests
knitr * suggests
methods * suggests
mockery * suggests
processx * suggests
ps >=1.3.4.9000 suggests
rlang >= 1.0.2.9003 suggests
rmarkdown * suggests
rprojroot * suggests
rstudioapi * suggests
testthat * suggests
tibble * suggests
whoami * suggests
withr * suggests

revdep/library/heplots/new/crayon/DESCRIPTION cran

grDevices * imports
methods * imports
utils * imports
mockery * suggests
rstudioapi * suggests
testthat * suggests
withr * suggests

revdep/library/heplots/new/heplots/DESCRIPTION cran

R >= 3.5.0 depends
broom * depends
car >= 3.0 depends
MASS * imports
grDevices * imports
graphics * imports
magrittr * imports
purrr * imports
stats * imports
tibble * imports
utils * imports
animation * suggests
bookdown * suggests
candisc * suggests
corrgram * suggests
dplyr * suggests
effects * suggests
ggplot2 * suggests
gplots * suggests
knitr * suggests
lattice * suggests
mvinfluence * suggests
nlme * suggests
reshape * suggests
reshape2 * suggests
rgl * suggests
rmarkdown * suggests
tidyr * suggests

revdep/library/heplots/new/jsonlite/DESCRIPTION cran

methods * depends
R.rsp * suggests
curl * suggests
httr * suggests
knitr * suggests
rmarkdown * suggests
sf * suggests
testthat * suggests
vctrs * suggests

revdep/library/heplots/new/rlang/DESCRIPTION cran

R >= 3.4.0 depends
winch * enhances
utils * imports
cli >= 3.1.0 suggests
covr * suggests
crayon * suggests
fs * suggests
glue * suggests
knitr * suggests
magrittr * suggests
methods * suggests
pillar * suggests
rmarkdown * suggests
stats * suggests
testthat >= 3.0.0 suggests
tibble * suggests
usethis * suggests
vctrs >= 0.2.3 suggests
withr * suggests

revdep/library/heplots/new/vctrs/DESCRIPTION cran

R >= 3.3 depends
cli >= 3.2.0 imports
glue * imports
rlang >= 1.0.2 imports
bit64 * suggests
covr * suggests
crayon * suggests
dplyr >= 0.8.5 suggests
generics * suggests
knitr * suggests
pillar >= 1.4.4 suggests
pkgdown >= 2.0.1 suggests
rmarkdown * suggests
testthat >=3.0.0 suggests
tibble >= 3.1.3 suggests
waldo >= 0.2.0 suggests
withr * suggests
xml2 * suggests
zeallot * suggests

revdep/library/heplots/old/heplots/DESCRIPTION cran

R >= 3.5.0 depends
car >= 3.0 depends
MASS * imports
grDevices * imports
graphics * imports
stats * imports
utils * imports
animation * suggests
candisc * suggests
corrgram * suggests
effects * suggests
gplots * suggests
lattice * suggests
mvinfluence * suggests
nlme * suggests
reshape * suggests
reshape2 * suggests
rgl * suggests

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

heplots

Science Score: 49.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.Rmd

Owner

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

cran.r-project.org: heplots

Rankings

Maintainers (1)

Dependencies