Chapter 3 Exploratory Data Analysis I: Data Wrangling
Description
Presentation of the Markdown language. Creation of a dynamic report. Presentation of the R language syntax. Use of R packages for data management. Presentation of CRAN and OPENCSI for packet access. Presentation of the principles of open science and reproducible research.
Concepts discussed :
1 markdown language
2 R language
3 open science
Pre-Session Activities/Resources
Harkness, Timandra. 2020. “Stop Flaunting Those Curves! Time for Stats to Get down and Dirty with the Public.” Harvard Data Science Review, July. https://doi.org/10.1162/99608f92.caab8ba0.
Wickham, Hadley. 2014. “Tidy Data.” Journal of Statistical Software 59 (1, 1): 1-23. https://doi.org/10.18637/jss.v059.i10.
Activities/Resources during the session
Use and manipulation of an anonymized dataset of customers of a U.S. retail company.
Familiarization with basic R data manipulation functions.
Post-session Activities/Resources
General Resources
Happy Git and GitHub for the useR by Jennifer Bryan
R Markdown: The Definitive Guide by Yihui Xie, J.J. Allaire & Garrett Grolemund
papaja: Reproducible APA manuscripts with R Markdown by Frederik Aust & Marius Barth