This project provides R code to accompany the book "Real-World Machine Learning" as described in my original blog post about this project. It is meant to be a supplement to (not a replacement of) the book for those who wish to learn how to implement the code listings and generate the figures using the R programming language (instead of Python). Check out the rwml-R project page for HTML renderings of the markdown files.
The renv package is used to manage a reproducible environment of R packages.
The tidyverse
collection of packages is used for (``a coherent system of packages for data manipulation, exploration and visualization that share a common design philosophy'')[https://rviews.rstudio.com/2017/06/08/what-is-the-tidyverse/]. The main exceptions to this philosophy are the use of data.table for larger data (> 2 Gb), "Visualizing Categorical Data" (vcd
)
for exploring categorical data (including mosaic plots), and (`gridExtra`)[https://cran.r-project.org/package=gridExtra]
for combining and organizing plots.
Quarto is used for document authoring and publishing.
Assuming your development environment is configured correctly with renv
and Quarto
installed, and you have downloaded any proprietary data that is required, the markdown files should render as-is. If you want to run the code in interactive mode, you will need to update the project_dir
variable in the first code cell of each markdown document.