Textbooks and other resources
The following is a non-exhaustive list of free online textbooks and resources that use R
Textbooks/Readings
R Programming
R for Data Science – Garrett Grolemund and Hadley Wickham
- Open-source online version is available for free; Available for purchase online
- No official solution manual for the book exercises exists, but several can be found online, like this version by Jeffrey B. Arnold. Your exact solutions may vary, but these are a good starting point.
Hands-On Programming with R by Garrett Grolemund. This is a non-statistical introduction to R programming with many hands-on examples.
Advanced R – Hadley Wickham
- Hardcover available online, but the online version is free
- A deeper dive into R as a programming language, not just a tool for data science. Most of this material is best covered on your own after you are familiar with R.
Modern Data Science with R – Benjamin S. Baumer, Daniel T. Kaplan, Nicholas J. Horton
Statistics with R
Modern Dive: A moderndive into R and the tidyverse by Chester Ismay and Albert Y. Kim
Learning Statistics with R by Danielle Navarro
OpenIntro Statistics Open-source online version is available for free
An Introduction to Statistical Learning: with Applications in R – Gareth James, Daniela Witten, Trevor Hastie, and Robert Tibshirani
- Each chapter includes code that demonstrates how to implement different methods. Unfortunately, their code use a lot of base R functions and syntax, whereas our emphasis is on getting things done with the
tidyversecollection of R packages. However, this is still a great book and the code provided is useful. - You can download a free PDF of the entire book from the authors’ site
- Each chapter includes code that demonstrates how to implement different methods. Unfortunately, their code use a lot of base R functions and syntax, whereas our emphasis is on getting things done with the
Broadening Your Statistical Horizons is an applied textbook on generalized linear models, with all of the examples / code in R.
Tidy Modeling with R The purpose of this book is to demonstrate how the tidyverse and tidymodels can be used to produce high quality models.
Text Mining with R by Julia Silge and David Robinson. What happens if your data is text, rather than numbers? What if you wanted to do sentiment analysis?
Visualisations
The de-facto standard for visualisations in R is the ggplot2 package. If you want to read Hadley Wickham’s paper that implemented the grammar of graphics into R, you can find it here
Data Visualization: A Practical Introduction by Kieran Healy.
Fundamentals of Data Visualization by Claus O. Wilke.
R Graphics Cookbook A practical guide by Winston Chang that provides any specific examples/ recipes to help you generate high-quality graphs quickly. I use it as quick reference to get my ggplot working.
Interactive web-based data visualization with R, plotly, and shiny
Online resources
Data science and statistical programming can be challenging. Computers are dumb and tiny errors in your code can cause hours of frustration (even if you’ve been doing this stuff for years!).
Fortunately, there are tons of online resources to help you with this. Two of the most important are StackOverflow (a Q&A site with thousands of answers to all sorts of statistical and programming questions) and RStudio Community (a forum specifically designed for people using RStudio and the tidyverse).
I highly recommend subscribing to the R Weekly newsletter which is sent every Monday and is full of helpful tutorials and ideas on how to do stuff with R.
RStudio Cheatsheets Printable cheat sheets for common R tasks and features
Software
Typora is a lightweight, stand alone editor for Markdown documents
Companies, Government Agencies, and NGOs Using R
Podcasts
- Tim Harford’s More or Less explains and debunks the numbers and statistics used in political debate, the news and everyday life. A great episode on sampling can be found here
- Everything Hertz: A podcast by scientists, for scientists. Methodology, scientific life, and bad language