Stacked LSTM Architecture Study on Time Series Datasets

Long short-term memory(LSTM) networks are powerful machine learning models which are cur rently used for wide range of applications like Speech recognition, Music Composition, Human action Recognition, Time series Prediction etc. This is made possible due to the Universal nature of LSTM network where given enough neural units, it can properly model and compute any conven tional computing problem, provided it has proper weight matrix.

In our study, We have stacked Layers of LSTM networks for better performance. And by taking inspirance from ConvNets for Image Classification, We mplemented Residual Connections and Highway Connections between LSTM layers. Then, We compared our models on various Datasets.

Experiment Setup

We have used tensorflow as Deep Learning Framework and used its GPU processing facility on NVIDIA GeForce GTX 960M Graphic card for faster parallel batch processing.

Python Libraries Required

numpy, tensorflow, matplotlib, sklearn

Training models and Data Handlers

main_runner.py is used for setting all hyper-parameters and running in a loop for figuring out best parameters.

classes with data_handler suffix are responsible for data input and can be configured for different datasets without altering model definition.

Also, Each model can also run by itself to test each package (BY default, HAR dataset is used)

Model Configuraion

class DeepLSTMConfig(Config): def init(self): super(DeepLSTMConfig, self).init() self.train_count = len(X_train) # 7352 training series self.test_data_count = len(X_test) # 2947 testing series self.n_steps = len(X_train[0]) # 128 time_steps per series

    # Training
    self.learning_rate = 0.005
    self.lambda_loss_amount = 0.0015
    self.training_epochs = 300
    self.batch_size = 1500

    # LSTM structure
    self.n_inputs = len(X_train[0][0])  # == 9 Features count is of 9: three 3D sensors features over time
    self.n_hidden = 32  # nb of neurons inside the neural network
    self.n_classes = 6  # Final output classes
    self.W = {
        'hidden': tf.Variable(tf.random_normal([self.n_inputs, self.n_hidden])),
        'output': tf.Variable(tf.random_normal([self.n_hidden, self.n_classes]))
    }
    self.biases = {
        'hidden': tf.Variable(tf.random_normal([self.n_hidden], mean=1.0)),
        'output': tf.Variable(tf.random_normal([self.n_classes]))
    }
    self.keep_prob_for_dropout = 0.85
    self.bias_mean = 0.3
    self.weights_stddev = 0.2
    self.n_layers_in_highway = 0
    self.n_stacked_layers = 3
    self.batch_norm_enabled = True
    self.also_add_dropout_between_stacked_cells = False

Main Runner

if name == 'main': run_with_config = single_layer_lstm.run_with_config config = single_layer_lstm.config

for learning_rate in [0.005, 0.0025, 0.003, 0.0005]: #1, 0.0025, 0.002]:  # [0.01, 0.007, 0.001, 0.0007, 0.0001]:
	for decay in [0.9]: #[0.005, 0.01]:
		for bn_enabled in [True, False]:
			for n_stacked in [1]: #2 3 6
				for epoch_count in [200, 300, 450]:
					config.training_epochs = epoch_count
					config.tensor_board_logging_enabled = False #should be always False, log summary folder gets impacted by mulitple runs
					config.n_stacked_layers = n_stacked
					config.batch_norm_enabled = bn_enabled
					config.learning_rate = learning_rate
					config.decay = decay
					run_with_config(config) #, trX, trY, teX, teY)

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
utils		utils
.gitignore		.gitignore
HAR_data_handler.py		HAR_data_handler.py
LICENSE		LICENSE
Opp_data_handler.py		Opp_data_handler.py
README.md		README.md
UCR_looping_runner.py		UCR_looping_runner.py
UCR_time_series_data_handler.py		UCR_time_series_data_handler.py
acknowledgements.md		acknowledgements.md
base_config.py		base_config.py
deep_lstm_model.py		deep_lstm_model.py
deep_lstm_model_MNIST_dataset.py		deep_lstm_model_MNIST_dataset.py
deep_lstm_model_UCR_dataset.py		deep_lstm_model_UCR_dataset.py
high_har_bn.png		high_har_bn.png
high_two_bn.png		high_two_bn.png
highway_carry_lstm_model.py		highway_carry_lstm_model.py
highway_lstm_model.py		highway_lstm_model.py
highway_lstm_model_UCR_dataset.py		highway_lstm_model_UCR_dataset.py
highway_lstm_model_mnist_dataset.py		highway_lstm_model_mnist_dataset.py
highway_lstm_model_plotlib.py		highway_lstm_model_plotlib.py
highway_tranform_lstm_model.py		highway_tranform_lstm_model.py
highway_tranform_lstm_model_MNIST_dataset.py		highway_tranform_lstm_model_MNIST_dataset.py
highway_tranform_lstm_model_UCR_dataset.py		highway_tranform_lstm_model_UCR_dataset.py
lstm_mnist.py		lstm_mnist.py
main_runner.py		main_runner.py
main_runner_MNIST_results.py		main_runner_MNIST_results.py
main_runner_all_models_results.py		main_runner_all_models_results.py
main_runner_all_models_results_UCR.py		main_runner_all_models_results_UCR.py
main_runner_with_plot.py		main_runner_with_plot.py
mnist_data_handler.py		mnist_data_handler.py
res_har_bn.png		res_har_bn.png
res_two_bn.png		res_two_bn.png
residual_lstm_model.py		residual_lstm_model.py
residual_lstm_model_MNIST_dataset.py		residual_lstm_model_MNIST_dataset.py
residual_lstm_model_UCR_dataset.py		residual_lstm_model_UCR_dataset.py
single_har_bn.png		single_har_bn.png
single_layer_lstm.py		single_layer_lstm.py
single_layer_lstm_MNIST.py		single_layer_lstm_MNIST.py
single_layer_lstm_UCR_dataset.py		single_layer_lstm_UCR_dataset.py
single_layer_lstm_looper_UCR.py		single_layer_lstm_looper_UCR.py
single_two.png		single_two.png
stacked_HAR.png		stacked_HAR.png
stacked_two.png		stacked_two.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stacked LSTM Architecture Study on Time Series Datasets

Experiment Setup

Python Libraries Required

Training models and Data Handlers

Model Configuraion

Main Runner

Plots

About

Releases

Packages

Languages

License

praveendareddy21/Stacked_LSTMS_Highway_Residual_On_TimeSeries_Datasets

Folders and files

Latest commit

History

Repository files navigation

Stacked LSTM Architecture Study on Time Series Datasets

Experiment Setup

Python Libraries Required

Training models and Data Handlers

Model Configuraion

Main Runner

Plots

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages