Skip to content

Latest commit

 

History

History
179 lines (121 loc) · 10.3 KB

index.rst

File metadata and controls

179 lines (121 loc) · 10.3 KB

Deep Learning with Tensorflow Documentation

This project is a collection of various Deep Learning algorithms implemented using the TensorFlow library. This package is intended as a command line utility you can use to quickly train and evaluate popular Deep Learning models and maybe use them as benchmark/baseline in comparison to your custom models/datasets.

Requirements

  • python 2.7
  • tensorflow >= 0.8 (tested on tf 0.8 and 0.9)

Installation

Through pip::

pip install yadlt

Through github:

  • cd in a directory where you want to store the project, e.g. /home/me
  • clone the repository: git clone https://github.com/blackecho/Deep-Learning-TensorFlow.git
  • cd Deep-Learning-TensorFlow/dlmodels
  • now you can configure (see below) the software and run the models!

Command line utility Configuration

Available models

Below you can find a list of the available models along with an example usage from the command line utility. Please note that the parameters are not optimized in any way, I just put random numbers to show you how to use the program.

Convolutional Networks

Cmd example usage::

python cmd_line/convolutional/run_conv_net.py --dataset custom --main_dir convnet-models --model_name my.Awesome.CONVNET --train_dataset path/to/train_set.npy --valid_dataset path/to/valid_set.npy --test_set path/to/test_set.npy --layers conv2d-5-5-32-1,maxpool-2,conv2d-5-5-64-1,maxpool-2,full-1024,softmax --batch_size 50 --verbose 1 --learning_rate 1e-4 --opt adam --dropout 0.5

This command trains a Convolutional Network using the provided training, validation and testing sets, and the specified training parameters. The architecture of the model, as specified by the --layer argument, is:

  • 2D Convolution layer with 5x5 filters with 32 feature maps and stride of size 1
  • Max Pooling layer of size 2
  • 2D Convolution layer with 5x5 filters with 64 feature maps and stride of size 1
  • Max Pooling layer of size 2
  • Fully connected layer with 1024 units
  • Softmax layer

For the default training parameters please see cmd_line/run_conv_net.py. The TensorFlow trained model will be saved in config.models_dir/convnet-models/my.Awesome.CONVNET.

Recurrent Neural Network (LSTM)

Cmd example usage::

python cmd_line/recurrent/run_lstm.py --dataset ptb --main_dir lstm-models --ptb_dir /path/to/ptb/dataset

Instructions to download the ptb dataset:

Restricted Boltzmann Machine

Cmd example usage::

python cmd_line/boltzmann/run_rbm.py --dataset custom --main_dir rbm-models --model_name my.Awesome.RBM --train_dataset path/to/train_set.npy --valid_dataset path/to/valid_set.npy --num_hidden 250 --num_epochs 10 --batch_size 128 --learning_rate 0.0001 --gibbs_sampling_steps 3 --verbose 1

This command trains a RBM with 250 hidden units using the provided training and validation sets, and the specified training parameters. For the default training parameters please see cmd_line/run_rbm.py. The TensorFlow trained model will be saved in config.models_dir/rbm-models/my.Awesome.RBM.

Deep Belief Network

Stack of Restricted Boltzmann Machines used to build a Deep Network for supervised learning.

Cmd example usage::

python cmd_line/boltzmann/run_dbn.py --dataset mnist --main_dir dbn-models --model_name my-deeper-dbn  --verbose 1 --rbm_layers 512,256 --rbm_learning_rate 0.005 --rbm_num_epochs 15 --rbm_batch_size 25 --finetune_batch_size 25 --finetune_learning_rate 0.001 --finetune_num_epochs 10 --finetune_loss_func softmax_cross_entropy --finetune_dropout 0.7 --finetune_act_func relu

This command trains a DBN on the MNIST dataset. Two RBMs are used in the pretraining phase, the first is 784-512 and the second is 512-256. The training parameters of the RBMs can be specified layer-wise: for example we can specify the learning rate for each layer with: --rbm_learning_rate 0.005,0.1. In this case the fine-tuning phase uses dropout and the ReLU activation function.

Deep Autoencoder

Stack of Restricted Boltzmann Machines used to build a Deep Network for unsupervised learning.

Cmd example usage::

python cmd_line/boltzmann/run_deep_autoencoder.py --dataset cifar10 --cifar_dir path/to/cifar10 --main_dir deep-autoencoder --model_name deeper-is-better --rbm_layers 8192,2048,512,256 --rbm_batch_size 128 --finetune_batch_size 128 --finetune_loss_func cross_entropy --verbose 1

This command trains a Deep Autoencoder built as a stack of RBMs on the cifar10 dataset. The layers in the finetuning phase are 3072 -> 8192 -> 2048 -> 512 -> 256 -> 512 -> 2048 -> 8192 -> 3072, that's pretty deep.

Denoising Autoencoder

Cmd example usage::

python cmd_line/autoencoders/run_autoencoder.py --n_components 1024 --batch_size 64 --num_epochs 20 --verbose 1 --learning_rate 0.05 --corr_type masking --corr_frac 0.5 --enc_act_func sigmoid --dec_act_func sigmoid --loss_func cross_entropy --opt momentum --momentum 0.9 --main_dir dae-models --model_name dae1024

This command trains a Denoising Autoencoder on MNIST with 1024 hidden units, sigmoid activation function for the encoder and the decoder, and 50% masking noise. You can also initialize an Autoencoder to an already trained model by passing the parameters to its build_model() method. If you are using the command line, you can add the options --weights /path/to/file.npy, --h_bias /path/to/file.npy and --v_bias /path/to/file.npy. If you want to save the reconstructions of your model, you can add the option --save_reconstructions /path/to/file.npy and the reconstruction of the test set will be saved. You can also save the parameters of the model by adding the option --save_paramenters /path/to/file. Three files will be generated: file-enc_w.npy, file-enc_b.npy and file-dec_b.npy.

Stacked Denoising Autoencoder

Stack of Denoising Autoencoders used to build a Deep Network for supervised learning.

Cmd example usage::

python cmd_line/autoencoders/run_stacked_autoencoder_supervised.py --dae_layers 1024,784,512,256 --dae_batch_size 25 --dae_num_epochs 5 --verbose 1 --dae_corr_type masking --dae_corr_frac 0.0 --finetune_num_epochs 25 --finetune_batch_size 32 --finetune_opt momentum --momentum 0.9 --finetune_learning_rate 0.05 --dae_enc_act_func sigmoid --dae_dec_act_func sigmoid --dae_loss_func cross_entropy --finetune_act_func relu --finetune_loss_func softmax_cross_entropy --dropout 0.7

This command trains a Stack of Denoising Autoencoders 784 <-> 1024, 1024 <-> 784, 784 <-> 512, 512 <-> 256, and then performs supervised finetuning with ReLU units. This basic command trains the model on the training set (MNIST in this case), and print the accuracy on the test set. If in addition to the accuracy you want also the predicted labels on the test set, just add the option --save_predictions /path/to/file.npy. You can also get the output of each layer on the test set. This can be useful to analyze the learned model and to visualized the learned features. This can be done by adding the --save_layers_output /path/to/file. The files will be saved in the form file-layer-1.npy, file-layer-n.npy.

Stacked Deep Autoencoder

Stack of Denoising Autoencoders used to build a Deep Network for unsupervised learning.

Cmd example usage::

python cmd_line/autoencoders/run_stacked_autoencoder_unsupervised.py --dae_layers 512,256,128 --dae_batch_size 25 --dae_num_epochs 5 --verbose 1 --dae_corr_type masking --dae_corr_frac 0.0 --finetune_num_epochs 25 --finetune_batch_size 32 --finetune_opt gradient_descent --finetune_learning_rate 0.05 --dae_enc_act_func sigmoid --dae_dec_act_func sigmoid --dae_loss_func cross_entropy --finetune_enc_act_func tanh --finetune_dec_act_func sigmoid --finetune_loss_func cross_entropy --dropout 0.7

This command trains a Stack of Denoising Autoencoders 784 <-> 512, 512 <-> 256, 256 <-> 128, and from there it constructs the Deep Autoencoder model. The final architecture of the model is 784 <-> 512, 512 <-> 256, 256 <-> 128, 128 <-> 256, 256 <-> 512, 512 <-> 784. If you want to get the reconstructions of the test set performed by the trained model you can add the option --save_reconstructions /path/to/file.npy. Like for the Stacked Denoising Autoencoder, you can get the layers output by calling --save_layers_output_test /path/to/file for the test set and --save_layers_output_train /path/to/file for the train set. The Deep Autoencoder accepts, in addition to train validation and test sets, reference sets. These are used as reference samples for the model. For example, if you want to reconstruct frontal faces from non-frontal faces, you can pass the non-frontal faces as train/valid/test set and the frontal faces as train/valid/test reference. If you don't pass reference sets, they will be set equal to the train/valid/test set.

MultiLayer Perceptron

Just train a Stacked Denoising Autoencoder of Deep Belief Network with the --do_pretrain false option.

Utilities

Each model has the following utilities:

  • --seed n: set numpy and tensorflow random number generators to n
  • --restore_previous_model: restore a previously trained model with the same model_name and model architecture of the current model. Note: when using this feature with models that support pretraining (e.g. stacked_denoising_autoencoder) you should keep the --do_pretrain option to true and set the --num_epochs option to 0.

TODO list

  • Add Performace file with the performance of various algorithms on benchmark datasets
  • Variational Autoencoders
  • Reinforcement Learning implementation (Deep Q-Learning)