This book aims at helping business school students move beyond the spreadsheet and gain a basic understanding of data science, including data literacy and techniques such as data curing, basic coding environments such as R, and visualization platforms to guide a team’s data analyses.

For this book, you will need to have R installed on your machine or have created an account on as well as on Github.

To cite this book:

Thierry Warin. 2021. Data Pipeline with R.

    title = {Data Pipeline} with {R},
    url = {},
    abstract = {},
    author = {Warin, Thierry},
    year = {2021},
    doi = {}


I would like to thank Marine Leroi and Martin Paquette as well CIRANO (Montréal) for their help with this material. The errors or omissions are mine.