https://github.com/friendly/vis-mlm-book
Github repo for my in-progress book, "Visualizing Multivariate Data and Models in R" to be published by Taylor & Francis (CRC Press), 2026
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.5%) to scientific vocabulary
Keywords
Repository
Github repo for my in-progress book, "Visualizing Multivariate Data and Models in R" to be published by Taylor & Francis (CRC Press), 2026
Basic Info
- Host: GitHub
- Owner: friendly
- Language: TeX
- Default Branch: master
- Homepage: https://friendly.github.io/Vis-MLM-book/
- Size: 623 MB
Statistics
- Stars: 13
- Watchers: 2
- Forks: 3
- Open Issues: 4
- Releases: 0
Topics
Metadata Files
README.md
Visualizing Multivariate Data and Models in R 
This is the main repository for my book, Visualizing Multivariate Data and Models in R, to be published by Chapman & Hall, CRC press.
This book is about graphical methods for multivariate data, and their uses in understanding relationships particularly when there are several aspects to be considered together in multiple response models such as multivariate analysis of variance and multivariate multiple regression.
Features
Some key substantive features of the book are:
Statistical data visualization is cast in a general framework by goal (see the data, visualize a model, diagnose problems), rather than a categorization by graphic types. It is best informed by principles and goals of communication, for example making graphic comparison easy and ordering factors and variables according to what should be seen (effect ordering).
Data visualization is seen as a combination of exposure---plotting the raw data---and summarization--- plotting statistical summaries---to highlight what should be noticed. For example, data ellipses and confidence ellipses are widely used as simple, effective summaries of data and fitted model parameters. When the data is complex, the idea of visual thinning can be used to balance the tradeoff.
The book exploits the rich connections among statistics, geometry and data visualization. Statistical ideas, particularly for multivariate data, can be more easily understood in terms of geometrical ones that can be seen in diagrams and data displays. Moreover, ideas from one domain can amplify what we can understand from another.
These graphical tools can be used to understand or explain a wide variety of statistical concepts, phenomena, and paradoxes such as Simpson's paradox, effects of measurement error, and so forth.
The HE ("hypothesis - error") plot framework provides a simple way to understand the results of statistical tests and the relations among response outcomes in the multivariate linear model.
Dimension reduction techniques such as PCA and discriminant analysis are presented as "multivariate juicers," able to squeeze the important information in high-dimensional data into informative two-dimensional views.
R packages
The book brings together a collection of novel techniques I and others have developed over the past 15 years and implemented in mature R packages. The principal multivariate analysis packages highlighted here are:
- Hyphothesis-Error plots:
heplots, - Canonical discriminant analysis:
candisc, - Multivariate influence:
mvinfluence - Visualizing collinearity diagnostics:
visCollin, - Generalized ridge trace plots for ridge regression:
genridge, - Matrix linear algebra:
matlib.
See my GiHub packages page for more details and other packages.
Contributing
I welcome contributions. If you have a suggestion or a bug report please post this as an issue on GitHub.
The online version is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
The work is written under a contributor code of conduct. By participating in this project you agree to abide by its terms and the terms of the license.
Owner
- Name: Michael Friendly
- Login: friendly
- Kind: user
- Location: Toronto
- Company: York University
- Website: https://datavis.ca
- Twitter: datavisFriendly
- Repositories: 57
- Profile: https://github.com/friendly
GitHub Events
Total
- Create event: 19
- Commit comment event: 1
- Issues event: 20
- Watch event: 7
- Delete event: 19
- Member event: 1
- Issue comment event: 41
- Push event: 345
- Pull request review event: 2
- Pull request event: 33
Last Year
- Create event: 19
- Commit comment event: 1
- Issues event: 20
- Watch event: 7
- Delete event: 18
- Member event: 1
- Issue comment event: 40
- Push event: 342
- Pull request review event: 2
- Pull request event: 31
Issues and Pull Requests
Last synced: 5 months ago
All Time
- Total issues: 14
- Total pull requests: 42
- Average time to close issues: 3 months
- Average time to close pull requests: 2 days
- Total issue authors: 2
- Total pull request authors: 3
- Average comments per issue: 0.5
- Average comments per pull request: 0.86
- Merged pull requests: 29
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 13
- Pull requests: 30
- Average time to close issues: 4 months
- Average time to close pull requests: 1 day
- Issue authors: 2
- Pull request authors: 1
- Average comments per issue: 0.54
- Average comments per pull request: 1.07
- Merged pull requests: 19
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- friendly (9)
- emstruong (5)
Pull Request Authors
- emstruong (32)
- udialter (8)
- friendly (2)






