STA5092Z EDA Lecture Content

Lecture Material Possible resources
1-2 [html], [slides], [pdf] R installation, basics, workflows JB2
Visualizing raw data with ggplot HW3, RP6
3-4 [html], [slides], [pdf] Managing data frames with dplyr JB5–7, HW5, RP3
Filter, select, arrange, mutate, summarize Some from DSFI
5-6 [html], [slides], [pdf] EDA checklist RP4, HW7
Right questions, correlation, missing values, outliers
7-8 [html], [slides], [pdf] Reshaping, tidying, joining dataframes JB8, JB14–16, HW12–13
tidyr, more dplyr
9–10 Principles of good graphics JB24–27, RP5, RP14–15
11–12 Exploring time series data JB10–13, HW14–16
13 Exploring spatial data
14 mapview, leaflet, sf
15 Functional programming with R
16 Writing functions, purrr
17 Visualising data using animations RP11–12
18 gganimate
19–20 Version control with Git and GitHub JB3, some from DSFI
21 Dashboards JB42
22 flexdashboard, shiny
23–24 Dimensionality reduction and Clustering RP13