Neural Network Training with Difference of Convex Programming (DCP)

This is a research project (in joint work with Chuan He, ISyE) featuring neural network training with difference of convex probramming (DCP). When training neural networks, SGD usually stagnates at a bad local minima after some epochs. We aim to use DCP to avoid such situations. Towards this end, we proposed three versions of the algorithm specifically for training neural network, aiming for different scenaria:

One-pass DC algorithm: deals with small DC subproblem
Two-pass DC algorithm: deals with large DC subproblem
Stochastic DC algorithm: scales to large training size

Documentation and examples

The API documentation (all exported functions) of this package can be found in index.html. For an illustration of basic usage for some key functions, check examples.ipynb in the root directory.

Installation

Run the following to install Python pakcage nndcp locally:

# on a virtualenv or conda environment
git clone https://github.umn.edu/liu00980/nndcp.git
cd nndcp
pip install -e .[dev]

Usage

nndcp currently provides four modules:

`data`

Module data contains three processed real datasets Communities and Crime Data Set, Wine Quality Data Set and California Housing, ready for training. We provide three forms for each dataset: pandas dataframe with missing values imputed as medians, predictors (the first colum is all-one vector) with numerical variables standardized and categorical variables one-hot encoded, and standardized response. To get them, call:

calhousing = load_calhousing()
# pandas dataframe
calhousing_df = calhousing["calhousing_df"]
# predictors
X = calhousing["X"]
# response
Y = calhousing["Y"]

`utils.util`

Subpackage utils contains some helper functions. For now, it only has one module utils.util consisting of functions for transformation between different Python objects, intermediate calculation and etc. Please refer to the documentation of nndcp to check functions provided by this module.

`SGDtraining`

This module provides a pipeline for training neural network with SGD.

# example of using trainnn_sgd
trainnn_sgd(
    train_ds = train_dataset,
    val_ds = validation_dataset,
    batch_size = batch_size,
    train_model = train_model,
    loss_fn = torch.nn.MSELoss(),
    nepochs = 100,
    lr = 0.1,
    verbose = True,
    show = 10
)

`DCshallow`

This module implements the one-pass DC algorithm with shallow ReLU network for small DC subproblem. To perform the algorithm, call:

# example of using trainnn_dcshallow
trainnn_dcshallow(
    train_ds = train_dataset,
    val_ds = validation_dataset,
    width = width,
    train_model = train_model,
    lmda = 10.0,
    iterations = 20,
    verbose = True,
    solver = 'MOSEK'
)

The starting point can be a SGD-trained network model, which is specified by train_model. When the network is deep, one-pass algorithm might not be suitable due to computation efficiency.

Developing `nndcp`

To install nndcp, along with the tools you need to develop and run tests, run the following in your virtualenv or conda environment:

pip install -e .[dev]

This command builds a symbolic link to your Python project and install your package locally. Compared to usual pip install, this avoids transporting between your local project and the one in site-packages. It is helpful if you are constantly trying and testing the package.

Testing with `pytest`

One can test (as a developer) the usage of trainnn_dcshallow and trainnn_sgd with pytest. First, we need to install pytest:

pip install pytest

Then under tests/, run the following to check two existed test files:

pytest -s

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
__pycache__		__pycache__
build/lib		build/lib
dist		dist
docs		docs
src		src
tests		tests
LICENSE.txt		LICENSE.txt
MANIFEST.in		MANIFEST.in
README.md		README.md
examples.ipynb		examples.ipynb
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Network Training with Difference of Convex Programming (DCP)

Documentation and examples

Installation

Usage

`data`

`utils.util`

`SGDtraining`

`DCshallow`

Developing `nndcp`

Testing with `pytest`

About

Releases

Packages

Languages

License

yifei-liu-stat/python-library-nndcp

Folders and files

Latest commit

History

Repository files navigation

Neural Network Training with Difference of Convex Programming (DCP)

Documentation and examples

Installation

Usage

data

utils.util

SGDtraining

DCshallow

Developing nndcp

Testing with pytest

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`data`

`utils.util`

`SGDtraining`

`DCshallow`

Developing `nndcp`

Testing with `pytest`

Packages