fivethirtyeight

An R package that provides access to the code and data sets published by FiveThirtyEight https://github.com/fivethirtyeight/data. Note that while we received guidance from editors at 538, this package is not officially published by 538.

Installation

Get the latest released version from CRAN:

install.packages("fivethirtyeight")

Or the development version from GitHub:

# If you haven't installed the remotes package yet, do so:
# install.packages("remotes")
remotes::install_github("rudeboybert/fivethirtyeight", build_vignettes = TRUE)

Usage

All data in the fivethirtyeight package are lazy-loaded, so you can access any dataset without running data():

library(fivethirtyeight)

head(bechdel)
?bechdel

# If using RStudio:
View(bechdel)

To see a detailed list of all 128 datasets, including information on the corresponding articles published on FiveThirtyEight.com, click here.

Add-on Package

There are also 19 datasets that could not be included in fivethirtyeight due to CRAN package size restrictions:

#>  [1] "castle_solutions"           "castle_solutions_2"        
#>  [3] "castle_solutions_3"         "comic_characters"          
#>  [5] "goose"                      "house_district_forecast"   
#>  [7] "mayweather_mcgregor_tweets" "mlb_elo"                   
#>  [9] "nba_all_elo"                "nba_carmelo"               
#> [11] "nba_elo"                    "nfl_elo"                   
#> [13] "quasi_winshares"            "raptor_by_player"          
#> [15] "raptor_by_team"             "ratings"                   
#> [17] "senators"                   "spi_matches"               
#> [19] "twitter_presidents"

These 19 datasets are included in the fivethirtyeightdata add-on package, which you can install by running:

install.packages('fivethirtyeightdata', repos = 'https://fivethirtyeightdata.github.io/drat/', type = 'source')

So for example, to load the senators dataset, run:

library(fivethirtyeight)
library(fivethirtyeightdata)
senators

Article in “Technology Innovations in Statistics Education”

The fivethirtyeight package was featured in The fivethirtyeight R Package: “Tame Data” Principles for Introductory Statistics and Data Science Courses by Kim, Ismay, and Chunn (2018) published in Volume 11, Issue 1 of the journal “Technology Innovations in Statistics Education”.

Abstract: As statistics and data science instructors, we often seek to use data in our courses that are rich, real, realistic, and relevant. To this end we created the fivethirtyeight R package of data and code behind the stories and interactives at the data journalism website FiveThirtyEight.com. After a discussion on the conflicting pedagogical goals of “minimizing prerequisites to research” (Cobb 2015) while at the same time presenting students with a realistic view of data as it exists “in the wild,” we articulate how a desired balance between these two goals informed the design of the package. The details behind this balance are articulated as our proposed “Tame data principles for introductory statistics and data science courses.” Details of the package’s construction and example uses are included as well.

Data Analysis Examples in Vignettes

For some data sets, there are user-contributed example analyses in the form a package vignette. For example, look at “Bechdel analysis using the tidyverse” based on the bechdel dataset used in the article The Dollar-And-Cents Case Against Hollywood’s Exclusion of Women. For a complete list of vignettes run

vignette("user_contributed_vignettes", package = "fivethirtyeightdata")

More Information

Andrew Flowers gave a great demonstration of the package and the bechdel vignette during his rstudio::conf talk in Orlando, Florida in January 2017. The video of his talk is available here.
Click this Google Sheet for a master spreadsheet connecting
1. the original 538 data on GitHub with
2. the data frames in the package with
3. information on the corresponding article

Name		Name	Last commit message	Last commit date
Latest commit History 671 Commits
.github		.github
R		R
data-raw		data-raw
data		data
images_for_guide		images_for_guide
inst		inst
man		man
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
NAMESPACE		NAMESPACE
NEWS.md		NEWS.md
README.Rmd		README.Rmd
README.md		README.md
cran-comments.md		cran-comments.md
data_import_procedure.md		data_import_procedure.md
fivethirtyeight.Rproj		fivethirtyeight.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fivethirtyeight

Installation

Usage

Add-on Package

Article in “Technology Innovations in Statistics Education”

Data Analysis Examples in Vignettes

More Information

About

Releases 8

Packages

Contributors 26

Languages

License

rudeboybert/fivethirtyeight

Folders and files

Latest commit

History

Repository files navigation

fivethirtyeight

Installation

Usage

Add-on Package

Article in “Technology Innovations in Statistics Education”

Data Analysis Examples in Vignettes

More Information

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 8

Packages 0

Contributors 26

Languages

Packages