MNIST Classification with Soft Decision Trees

This repository provides an implementation of two machine learning models for classifying the MNIST dataset: a Soft Decision Tree (SDT) and a Gradient Boosted Soft Decision Tree (GB_SDT). The goal of these models is to demonstrate the effectiveness of decision tree-based approaches in image classification tasks, particularly using the MNIST dataset.

Getting Started

Prerequisites

Before you begin, ensure you have met the following requirements:

Python 3.6 or later
PyTorch
torchvision
numpy

These dependencies can be installed using the requirements.txt file included in the repository.

Installation

To install the necessary packages, follow these steps:

Clone the repository:

git clone https://your-repository-url.git
cd your-repository-directory

Install the relevant python version in .python-version
Install the requirements.txt

pip install -r requirements.txt

Usage

Train the Soft Decision Tree (SDT) model:

python sdt_train.py --data_dir ./data/mnist --epochs 50 --batch_size 128

Train the Gradient Boosted Soft Decision Tree (GB_SDT) model:

python gb_sdt_train.py --epochs 50 --batch_size 128 --n_trees 4 --depth 5

For more options and customization, refer to the help of each script:

python sdt_train.py --help
python gb_sdt_train.py --help

Frequently Asked Questions

Training loss suddenly turns into NAN
- Reason: Sigmoid function used in internal nodes of SDT can be unstable during the training stage, as its gradient is much close to 0 when the absolute value of input is large.
- Solution: Using a smaller learning rate typically works.

Experiment Result on MNIST

After training for 40 epochs with batch_size 128, the best testing accuracy using a SDT model of depth 5, 7 are 94.15 and 94.38, respectively (which is much close to the accuracy reported in raw paper). Related hyper-parameters are available in main.py. Better and more stable performance can be achieved by fine-tuning hyper-parameters.

Below are the testing accuracy curve and training loss curve. The testing accuracy of SDT is evaluated after each training epoch.

Package Dependencies

This package is originally developed in Python 3.11.5. Following are the name and version of packages used in SDT and GB_SDT. In my practice, it works fine under different versions of Python or PyTorch.

torch 2.1.2
torchaudio 2.1.2
torchvision 0.16.2

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
exploration_notebooks		exploration_notebooks
saved_models		saved_models
.DS_Store		.DS_Store
.gitignore		.gitignore
.python-version		.python-version
GB_SDT.py		GB_SDT.py
README.md		README.md
SDT.py		SDT.py
Soft_Decision_Trees_Report.pdf		Soft_Decision_Trees_Report.pdf
dataset.py		dataset.py
gb_sdt_train.py		gb_sdt_train.py
requirements.txt		requirements.txt
train_sdt.py		train_sdt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MNIST Classification with Soft Decision Trees

Getting Started

Prerequisites

Installation

Usage

Frequently Asked Questions

Experiment Result on MNIST

Package Dependencies

About

Releases

Packages

Contributors 2

Languages

Ethan-Shapiro/Soft-Decision-Tree-Feature-Learning

Folders and files

Latest commit

History

Repository files navigation

MNIST Classification with Soft Decision Trees

Getting Started

Prerequisites

Installation

Usage

Frequently Asked Questions

Experiment Result on MNIST

Package Dependencies

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages