palmerpenguins

A great intro dataset for data exploration & visualization (alternative to iris).

https://github.com/allisonhorst/palmerpenguins

Science Score: 46.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 11 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Committers with academic emails
    1 of 7 committers (14.3%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.7%) to scientific vocabulary

Keywords from Contributors

dynamic ecology data-manipulation grammar
Last synced: 6 months ago · JSON representation

Repository

A great intro dataset for data exploration & visualization (alternative to iris).

Basic Info
Statistics
  • Stars: 966
  • Watchers: 23
  • Forks: 231
  • Open Issues: 17
  • Releases: 1
Created almost 6 years ago · Last pushed over 1 year ago
Metadata Files
Readme License

README.Rmd

---
output: github_document
---



```{r, include = FALSE}
knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  fig.path = "man/figures/README-",
  out.width = "75%",
  warning = FALSE,
  message = FALSE,
  fig.retina = 2,
  fig.align = 'center'
)
library(tidyverse)
library(palmerpenguins)
```

# palmerpenguins 



[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.3960218.svg)](https://doi.org/10.5281/zenodo.3960218)
[![CRAN](https://www.r-pkg.org/badges/version/palmerpenguins)](https://cran.r-project.org/package=palmerpenguins)



The goal of palmerpenguins is to provide a great dataset for data exploration & visualization, as an alternative to `iris`. 

```{r flipper-bill, echo=FALSE}
flipper_bill <- ggplot(data = penguins,
                         aes(x = flipper_length_mm,
                             y = bill_length_mm)) +
  geom_point(aes(color = species, 
                 shape = species),
             size = 3,
             alpha = 0.8) +
  geom_smooth(method = "lm", se = FALSE, aes(color = species)) +
  theme_minimal() +
  scale_color_manual(values = c("darkorange","purple","cyan4")) +
  labs(title = "Flipper and bill length",
       subtitle = "Dimensions for Adelie, Chinstrap and Gentoo Penguins at Palmer Station LTER",
       x = "Flipper length (mm)",
       y = "Bill length (mm)",
       color = "Penguin species",
       shape = "Penguin species") +
  theme(legend.position = c(0.85, 0.15),
        plot.title.position = "plot",
        plot.caption = element_text(hjust = 0, face= "italic"),
        plot.caption.position = "plot")

flipper_bill
```



## Installation

You can install the released version of palmerpenguins from [CRAN](https://CRAN.R-project.org) with:

``` r
install.packages("palmerpenguins")
```

To install the development version from [GitHub](https://github.com/) use:

``` r
# install.packages("remotes")
remotes::install_github("allisonhorst/palmerpenguins")
```

## About the data

Data were collected and made available by [Dr. Kristen Gorman](https://www.uaf.edu/cfos/people/faculty/detail/kristen-gorman.php) and the [Palmer Station, Antarctica LTER](https://pallter.marine.rutgers.edu/), a member of the [Long Term Ecological Research Network](https://lternet.edu/). 

The palmerpenguins package contains two datasets. 

```{r showdata}
library(palmerpenguins)
data(package = 'palmerpenguins')
``` 

One is called `penguins`, and is a simplified version of the raw data; see `?penguins` for more info: 

```{r example-penguins}
head(penguins)
```


The second dataset is `penguins_raw`, and contains all the variables and original names as downloaded; see `?penguins_raw` for more info.

```{r example-penguins-raw}
head(penguins_raw)
```

Both datasets contain data for `r nrow(penguins)` penguins. There are `r length(unique(penguins$species))` different species of penguins in this dataset, collected from `r length(unique(penguins$island))` islands in the Palmer Archipelago, Antarctica. 

```{r}
str(penguins)
```


We gratefully acknowledge Palmer Station LTER and the US LTER Network. Special thanks to Marty Downs (Director, LTER Network Office) for help regarding the data license & use. 


## Examples

You can find these and more code examples for exploring palmerpenguins in `vignette("examples")`.

Penguins are fun to summarize! For example:

```{r example, warning=FALSE, message=FALSE}
library(tidyverse)
penguins %>% 
  count(species)
penguins %>% 
  group_by(species) %>% 
  summarize(across(where(is.numeric), mean, na.rm = TRUE))
```

Penguins are fun to visualize! For example:

```{r mass-flipper, warning = FALSE, message = FALSE, echo = FALSE, out.width='75%', fig.retina=2}
mass_flipper <- ggplot(data = penguins, 
                       aes(x = flipper_length_mm,
                           y = body_mass_g)) +
  geom_point(aes(color = species, 
                 shape = species),
             size = 3,
             alpha = 0.8) +
  theme_minimal() +
  scale_color_manual(values = c("darkorange","purple","cyan4")) +
  labs(title = "Penguin size, Palmer Station LTER",
       subtitle = "Flipper length and body mass for Adelie, Chinstrap, and Gentoo Penguins",
       x = "Flipper length (mm)",
       y = "Body mass (g)",
       color = "Penguin species",
       shape = "Penguin species") +
  theme(legend.position = c(0.2, 0.7),
        plot.title.position = "plot",
        plot.caption = element_text(hjust = 0, face= "italic"),
        plot.caption.position = "plot")

mass_flipper
```

```{r flipper-hist, echo=FALSE}
flipper_hist <- ggplot(data = penguins, aes(x = flipper_length_mm)) +
  geom_histogram(aes(fill = species), 
                 alpha = 0.5, 
                 position = "identity") +
  scale_fill_manual(values = c("darkorange","purple","cyan4")) +
  theme_minimal() +
  labs(x = "Flipper length (mm)",
       y = "Frequency",
       title = "Penguin flipper lengths")

flipper_hist
```

## Artwork

You can download palmerpenguins art (useful for teaching with the data) in `vignette("art")`. If you use this artwork, please cite with: "Artwork by @allison_horst".

### Meet the Palmer penguins

```{r, echo = FALSE, fig.align='center'}
knitr::include_graphics("man/figures/lter_penguins.png", dpi = 300)
```

### Bill dimensions

The culmen is the upper ridge of a bird's bill. In the simplified `penguins` data, culmen length and depth are renamed as variables `bill_length_mm` and `bill_depth_mm` to be more intuitive.  

For this penguin data, the culmen (bill) length and depth are measured as shown below (thanks Kristen Gorman for clarifying!):

```{r, echo = FALSE, fig.align='center'}
knitr::include_graphics("man/figures/culmen_depth.png", dpi = 300)
```

## License

Data are available by  [CC-0](https://creativecommons.org/share-your-work/public-domain/cc0/) license in accordance with the [Palmer Station LTER Data Policy](https://pallter.marine.rutgers.edu/data/) and the [LTER Data Access Policy for Type I data](https://lternet.edu/data-access-policy/).

## Citation

To cite the palmerpenguins package, please use:

```{r}
citation("palmerpenguins")
```


## Additional data use information

Anyone interested in publishing the data should contact [Dr. Kristen Gorman](https://www.uaf.edu/cfos/people/faculty/detail/kristen-gorman.php) about analysis and working together on any final products. From Gorman et al. (2014): "Individuals interested in using these data are expected to follow the US LTER Network’s Data Access Policy, Requirements and Use Agreement: https://lternet.edu/data-access-policy/."

## References

**Data originally published in:** 

+ Gorman KB, Williams TD, Fraser WR (2014). Ecological sexual dimorphism and environmental variability within a community of Antarctic penguins (genus *Pygoscelis*). PLoS ONE 9(3):e90081.  https://doi.org/10.1371/journal.pone.0090081


**Data citations:**

Adélie penguins: 

+ Palmer Station Antarctica LTER and K. Gorman, 2020. Structural size measurements and isotopic signatures of foraging among adult male and female Adélie penguins (*Pygoscelis adeliae*) nesting along the Palmer Archipelago near Palmer Station, 2007-2009 ver 5. Environmental Data Initiative. https://doi.org/10.6073/pasta/98b16d7d563f265cb52372c8ca99e60f (Accessed 2020-06-08).

Gentoo penguins:

+ Palmer Station Antarctica LTER and K. Gorman, 2020. Structural size measurements and isotopic signatures of foraging among adult male and female Gentoo penguin (*Pygoscelis papua*) nesting along the Palmer Archipelago near Palmer Station, 2007-2009 ver 5. Environmental Data Initiative. https://doi.org/10.6073/pasta/7fca67fb28d56ee2ffa3d9370ebda689 (Accessed 2020-06-08).

Chinstrap penguins: 

+ Palmer Station Antarctica LTER and K. Gorman, 2020. Structural size measurements and isotopic signatures of foraging among adult male and female Chinstrap penguin (*Pygoscelis antarcticus*) nesting along the Palmer Archipelago near Palmer Station, 2007-2009 ver 6. Environmental Data Initiative. https://doi.org/10.6073/pasta/c14dfcfada8ea13a17536e73eb6fbe9e (Accessed 2020-06-08).

Owner

  • Name: Allison Horst
  • Login: allisonhorst
  • Kind: user
  • Company: Observable

Teaching | Data Science | Art | Science Communication

GitHub Events

Total
  • Issues event: 4
  • Watch event: 73
  • Issue comment event: 3
  • Fork event: 28
Last Year
  • Issues event: 4
  • Watch event: 73
  • Issue comment event: 3
  • Fork event: 28

Committers

Last synced: 7 months ago

All Time
  • Total Commits: 192
  • Total Committers: 7
  • Avg Commits per committer: 27.429
  • Development Distribution Score (DDS): 0.203
Past Year
  • Commits: 1
  • Committers: 1
  • Avg Commits per committer: 1.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Allison Horst a****t@u****u 153
Alison Presmanes Hill a****l@g****m 31
Hadley Wickham h****m@g****m 4
Jannik Buhr 1****r 1
Jennifer (Jenny) Bryan j****n@g****m 1
Trang Le g****r@g****m 1
deanerik 3****k 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 30
  • Total pull requests: 72
  • Average time to close issues: 4 months
  • Average time to close pull requests: about 18 hours
  • Total issue authors: 29
  • Total pull request authors: 12
  • Average comments per issue: 1.97
  • Average comments per pull request: 0.24
  • Merged pull requests: 66
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 3
  • Pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: about 4 hours
  • Issue authors: 3
  • Pull request authors: 1
  • Average comments per issue: 0.0
  • Average comments per pull request: 0.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • allisonhorst (2)
  • MonkmanMH (1)
  • jhk0530 (1)
  • john-sandall (1)
  • RaphaelS1 (1)
  • friendly (1)
  • davidhodge931 (1)
  • markvanderloo (1)
  • Bassei (1)
  • amrrs (1)
  • ttimbers (1)
  • gupton1944 (1)
  • moon-jong (1)
  • karaesmen (1)
  • charliejhadley (1)
Pull Request Authors
  • allisonhorst (33)
  • apreshill (26)
  • hadley (4)
  • deanerik (2)
  • gungorMetehan (2)
  • jennybc (1)
  • koliharesh (1)
  • trangdata (1)
  • brunj7 (1)
  • ttimbers (1)
  • friendly (1)
  • jmbuhr (1)
Top Labels
Issue Labels
CRAN :penguin: (1)
Pull Request Labels

Packages

  • Total packages: 2
  • Total downloads:
    • cran 44,173 last-month
  • Total docker downloads: 48,971
  • Total dependent packages: 39
    (may contain duplicates)
  • Total dependent repositories: 151
    (may contain duplicates)
  • Total versions: 4
  • Total maintainers: 1
cran.r-project.org: palmerpenguins

Palmer Archipelago (Antarctica) Penguin Data

  • Versions: 2
  • Dependent Packages: 38
  • Dependent Repositories: 149
  • Downloads: 44,173 Last month
  • Docker Downloads: 48,971
Rankings
Forks count: 0.3%
Stargazers count: 0.3%
Downloads: 1.6%
Dependent repos count: 1.7%
Average: 2.4%
Dependent packages count: 2.6%
Docker downloads count: 7.9%
Maintainers (1)
Last synced: 6 months ago
conda-forge.org: r-palmerpenguins
  • Versions: 2
  • Dependent Packages: 1
  • Dependent Repositories: 2
Rankings
Forks count: 14.0%
Stargazers count: 14.6%
Average: 19.4%
Dependent repos count: 20.1%
Dependent packages count: 29.0%
Last synced: 6 months ago

Dependencies

DESCRIPTION cran
  • R >= 2.10 depends
  • dplyr * suggests
  • ggplot2 * suggests
  • knitr * suggests
  • recipes * suggests
  • rmarkdown * suggests
  • tibble * suggests
  • tidyr * suggests
.github/workflows/pkgdown.yaml actions
  • JamesIves/github-pages-deploy-action 4.1.4 composite
  • actions/checkout v2 composite
  • r-lib/actions/setup-pandoc v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite