Scatter plots
Description
Scatter plots are used to display two continuous variables.
If one of the continuous variables is assumed to affect the other (i.e., outcome
vs. predictor
), place the outcome
on the y
axis and the predictor
on the x
axis.
Getting set up
PACKAGES:
Install packages.
Code
install.packages("palmerpenguins")
library(palmerpenguins)
library(ggplot2)
DATA:
The penguins
data
Code
<- palmerpenguins::penguins
penguins glimpse(penguins)
Rows: 344
Columns: 8
$ species <fct> Adelie, Adelie, Adelie, Adelie, Adelie, Adelie, Adel…
$ island <fct> Torgersen, Torgersen, Torgersen, Torgersen, Torgerse…
$ bill_length_mm <dbl> 39.1, 39.5, 40.3, NA, 36.7, 39.3, 38.9, 39.2, 34.1, …
$ bill_depth_mm <dbl> 18.7, 17.4, 18.0, NA, 19.3, 20.6, 17.8, 19.6, 18.1, …
$ flipper_length_mm <int> 181, 186, 195, NA, 193, 190, 181, 195, 193, 190, 186…
$ body_mass_g <int> 3750, 3800, 3250, NA, 3450, 3650, 3625, 4675, 3475, …
$ sex <fct> male, female, female, NA, female, male, female, male…
$ year <int> 2007, 2007, 2007, 2007, 2007, 2007, 2007, 2007, 2007…
The grammar
CODE:
Create labels with labs()
Initialize the graph with ggplot()
and provide data
Map bill_length_mm
to the x
axis
Map flipper_length_mm
to the y
axis
Add geom_point()
Code
<- labs(
labs_scatter title = "Bill Length vs. Flipper Length",
x = "Bill Length (mm)", y = "Flipper length (mm)")
<- penguins |>
ggp2_scatter ggplot(
aes(x = bill_length_mm,
y = flipper_length_mm)) +
geom_point()
+
ggp2_scatter labs_scatter
GRAPH:
If multiple points occupy the same value position, use transparency (alpha
) to improve visibility.