
Scatter plots
Description
Scatter plots are used to display two continuous variables.
If one of the continuous variables is assumed to affect the other (i.e., outcome vs. predictor), place the outcome on the y axis and the predictor on the x axis.
Getting set up
PACKAGES:
Install packages.
Code
install.packages("palmerpenguins")
library(palmerpenguins)
library(ggplot2)DATA:

The penguins data
Code
penguins <- palmerpenguins::penguins
glimpse(penguins)Rows: 344
Columns: 8
$ species <fct> Adelie, Adelie, Adelie, Adelie, Adelie, Adelie, Adel…
$ island <fct> Torgersen, Torgersen, Torgersen, Torgersen, Torgerse…
$ bill_length_mm <dbl> 39.1, 39.5, 40.3, NA, 36.7, 39.3, 38.9, 39.2, 34.1, …
$ bill_depth_mm <dbl> 18.7, 17.4, 18.0, NA, 19.3, 20.6, 17.8, 19.6, 18.1, …
$ flipper_length_mm <int> 181, 186, 195, NA, 193, 190, 181, 195, 193, 190, 186…
$ body_mass_g <int> 3750, 3800, 3250, NA, 3450, 3650, 3625, 4675, 3475, …
$ sex <fct> male, female, female, NA, female, male, female, male…
$ year <int> 2007, 2007, 2007, 2007, 2007, 2007, 2007, 2007, 2007…
The grammar
CODE:
Create labels with labs()
Initialize the graph with ggplot() and provide data
Map bill_length_mm to the x axis
Map flipper_length_mm to the y axis
Add geom_point()
Code
labs_scatter <- labs(
title = "Bill Length vs. Flipper Length",
x = "Bill Length (mm)", y = "Flipper length (mm)")
ggp2_scatter <- penguins |>
ggplot(
aes(x = bill_length_mm,
y = flipper_length_mm)) +
geom_point()
ggp2_scatter +
labs_scatterGRAPH:
If multiple points occupy the same value position, use transparency (alpha) to improve visibility.
