Exploratory visualization

Jonathan Dushoff

September 2021

Exploring data

Rote analysis vs. snooping

Spurious correlations

There’s a whole website about this

What can you do?

The best you can

Individual variables

Individual variables

Orchard data

A standard plot

A terrible plot

What does this one mean?

What does this one mean?

A non-parametric plot

Bike example

Just the means

Standard errors

Standard errors

Standard deviations (2 sd, in fact)

Data shape

Data shape

Shape and weight

Shape and weight

Shape and weight

Shape and weight

Log scales

Bivariate data

Banking

Sunspots

Sunspots

Code (with built-in data)

Is smoking good for you?

Smoking data

Smoking data

Smoking data

Scatter plots

Scatter plot

Seeing the density better

Seeing the density worse

Maybe fixed

A loess trend line

Two loess trend lines

Many loess trend lines

Theory of loess

Robust methods

rlm fits

rlm fits

Density plots

Contours

Contours

Contours

Hexes

Hexes

Hexes

Color principles

Multiple dimensions

Multiple dimensions

Pairs example

Multiple factors