Appendix A — Data

Caution

This section is still being developed. The contents are subject to change.

The data packages used are available to preview below.

show/hide
data_pkgs <- c("palmerpenguins", 
               "fivethirtyeight", 
               "ggplot2movies", 
               "babynames")
install.packages(data_pkgs)

A.0.1 palmerpenguins::penguins

The majority of the graphs in the manual are built using the palmerpenguins::penguins data.

…so…many…PENGUINS!

Artwork by allison_horst

Source: https://github.com/allisonhorst/palmerpenguins/

A.0.2 fivethirtyeight

Use the table below to view the datasets in this package.

To view a table of available datasets in the fivethirtyeight package, view the Data Frame Name and Article Title columns in the datasets_master table

Source: https://github.com/fivethirtyeight/data

A.0.3 ggplot2movies::movies

Source: https://www.imdb.com/

A.0.4 babynames::babynames

Source: http://www.ssa.gov/oact/babynames/limits.html

Why not manually create the graph datasets with data.frame() or tibble()/tribble()?

In my opinion, using manually generated data is great for reproducible examples, but they rarely look like data ‘caught in the wild.’ The data packages above are also well maintained and can be used to provide a variety of examples.