Bayesian Measurement Models for Online Sparse Settings

The rise of popular mobile education applications produced data where a large number of students answers a small subset of items in a large question bank. Traditional linking approaches from the education measurement literature cannot scale to this context.

We propose and evaluate a set of models designed to overcome traditional limitations. The models take advantage of factorization techniques and Bayesian variational inference to meet the needs of this context.

Author:

Zhaolei (Henry) Shi -- [email protected]

Introduction

Our code base leverage Pyro's infrastructure for Bayesian stochastic variational inference.

We implement the following models, Y_ij = 1 denotes that student i responded correctly to question j:

2param model: p( Y_ij = 1 | theta_i, alpha_i, beta_j) = 1 / (1 + exp(- alpha_j * (theta_i - beta_j))
Factorization model: p(Y_ij = 1 | theta_i, delta_j, beta_j) = 1 / (1 + exp(- (inner_prod(theta_i, delta_j) - beta_j))
Hierarchical model: similar to the factorization model, but now delta_j is replaced with the concatenation of 1) a vector of trainable parameters, 2) one or more matmul(H, X_j), the matrix transformations of observed question characteristics. H is a matrix of trainable parameters.

Code structure

pyro_model.py implements the models and the likelihood functions.
holder.py wraps models in Model classes to allow for easy training and prediction, also contains classes for data loaders.
dataset.py implements functions to parse the dataset and return the appropriate data loader.
experiment.py implements the experiment classes to automate training, auto-stop after convergence, and evaluation.
custom_tvt.py implements customized ways to divide the dataset into training, validation, and test sets (e.g. ensuring any question will show up in all three sets).
make_prediction.py creates predictions for learning curve analysis.
run_experiment.py creates models and data loaders and runs training and prediction.

Running an example

You can run the model on a small dataset using the following line in a terminal:

python run_experiment.py

Reading through run_experiment.py will give you a good sense of how the different pieces fit together.

Learning Pyro

To get a better sense of how Pyro works, you can follow the examples in pyro_tutorial (copied from Pyro's documentation). Also, consider reading through a minimal implementation of Pyro. This was instrumental for my own understanding of Pyro.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
pyro_tutorial		pyro_tutorial
work_dir		work_dir
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
custom_tvt.py		custom_tvt.py
dataset.py		dataset.py
experiment.py		experiment.py
holder.py		holder.py
make_prediction.py		make_prediction.py
model_utils.py		model_utils.py
pyro_model.py		pyro_model.py
run_experiment.py		run_experiment.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bayesian Measurement Models for Online Sparse Settings

Introduction

Code structure

Running an example

Learning Pyro

About

Releases

Packages

Languages

gsbDBI/bm_model

Folders and files

Latest commit

History

Repository files navigation

Bayesian Measurement Models for Online Sparse Settings

Introduction

Code structure

Running an example

Learning Pyro

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages