Skip to content

tgritsaev/coursework-tabular

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Researching neural network qualities in the tabular data domain (TDD)

This repository contains code for:

  • visualizing neural network losslandscapes in the TDD
  • the presence of the grokking effect in the TDD
  • neural networks ability to pruning in the TDD
  • experiments for finding optimal hyperparameter search algorithm in the TDD

Relevant papers

Starting points

Overview of the code

The following "guidelines" are not strict at all, but they may be helpful:

  • bin for experimental code (i.e. where the research is happening) and for pipelines that produce artifacts
  • lib for code that is unlikely to change
  • dev private directory, do not publish
  • research for research *.ipynb files

Set up the environment

Software

Preliminaries:

# clone the repo somehow
export PROJECT_DIR=$(pwd)/tabular-hpo  # path to the repository root
cd $PROJECT_DIR

conda create -n template python=3.10.9
conda activate template

pip install torch==1.13.1+cu116 -f https://download.pytorch.org/whl/torch_stable.html
pip install -r requirements.txt

conda env config vars set PYTHONPATH=${PROJECT_DIR}:
conda env config vars set PROJECT_DIR=${PROJECT_DIR}

conda deactivate
conda activate template

Data

cd $PROJECT_DIR
wget https://zenodo.org/record/7612634/files/data.tar?download=1 -O data.tar

Example

Once everything is ready, this should work:

python bin/go.py exp/mlp/california/0-tuning

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published