If this is your first foray into Neural Networks, welcome! I hope you enjoy yourself as much as I have.
This project is my first attempt at creating an application that allows for quick interactions with a basic neural network. I find that I learn better when I have something quick and easy to tinker with that allows me to analyze pieces of a concept at my own pace. Hopefully, this will help some of you that learn in a similar manner.
This project is written in C# and uses C# 6.0 Syntax. You will need an environment that is capable of compiling the C# 6.0 syntax in order to use this program.
Great question!
A Neural Network can be thought of as a series of nodes (or neurons) that are interconnected, much like they are in the brain. The network can have any number (N) of inputs and any number (M) of outputs. In between the inputs and outputs are a series of "hidden" neurons that make up the hidden layers of the network. These hidden layers provide the meat of the network and allow for some of the neat functionalities we can get out of a Neural Network.
Before explaining the pieces of a neural network, it might be helpful to start with an example.
Building off of this excellent article from 2013, let's use the concept of Exclusive Or (XOR). XOR will output true when the inputs differ:
Input A | Input B | Output |
---|---|---|
false | false | false |
false | true | true |
true | false | true |
true | true | false |
Considering this, let's break down a Neural Network into its three basic parts:
- The Inputs
- These are the inputs into the Neural Network. From the XOR example above, the inputs would be Input A and Input B.
- Each input can be considered a Neuron whose output is the initial input value.
- The Hidden Layers
- This is the meat of the Neural Network. This is where the magic happens. The Neurons in this layer are assigned weights for each of their inputs. These weights start off fairly random, but as the network is "trained" (discussed below), the weights are adjusted in order to make the neuron's output, and therefore the Neural Network's output closer to the expected result.
- The Outputs
- These are the outputs from the system. From the XOR example above, the output from the system would be either 'true' or 'false'. In the Neural Network, the Outputs are the last line of Neurons. These Neurons are also assigned a weight for each of their inputs and are "fed" by the Neurons in the hidden layer.
Using the XOR example, if we were to give our Neural Network the inputs 'true' and 'false' we would expect the system to return 'true'.
Because I love examples, here's another:
Input A | Input B | Input C | Output |
---|---|---|---|
true | false | false | true |
false | true | true | false |
true | fase | true | false |
true | true | true | true |
In the above table, we can infer the following patterns:
- The output is true if the number of inputs set to true is odd OR the number of inputs set to false is even.
- The output is false if the number of inputs set to true is even OR the number of inputs set to false is odd.
The job of the Neural Network is to try and figure out that pattern. It does this via training.
Training the Neural Network is accomplished by giving it a set of input data and the expected results for those inputs. This data is then continuously run through the Neural Network until we can be reasonably sure that it has a grasp of the patterns present in that data.
In this project, the Neural Network is trained via the following common Neural Network training methods:
- Back-Propagation
- After each set of inputs is run through the system and an output generated, that output is validated against the expected output.
- The percentage of error that results is then propagated backwards (hence the name) through the Hidden Layers of the Neural Network. This adjusts the weights assigned to each of a neuron's inputs in the Hidden Layers.
- Ideally, each Back-Propagation will bring the Neural Network's output closer to the expected output of the provided inputs.
- Biases
- Biases allow us to modify our activation function (discussed below) in order to generate a better output for each neuron.
- See Here for an excellent explanation as to what a bias does for a Neural Network.
- Momentum
- Used to prevent the system from converging to a local minimum.
- See Here
- Learning Rate
- This will change the overall learning speed of the system.
- See Here
A Neuron's output is defined by an Activation Function.
In our case, we are using a Sigmoid Function to define each Neuron's output. The Sigmoid function will convert any value to a value between 0 and 1. In the Neural Network, the Sigmoid functions will be used to generate initial weights and to help update percent errors.
This program is fairly simple to use.
When it starts, you will be presented with the option to load the Neural Network configuration from a data file. The file is currently set up to use the common XOR example. This means that the network will be initialized with the following:
- Number of Inputs: 2
- Number of Hidden Neurons: 10
- Number of Outputs: 1
Test Data:
Input A | Input B | Output |
---|---|---|
0 | 0 | 0 |
0 | 1 | 1 |
1 | 0 | 1 |
1 | 1 | 0 |
To change the file (data.txt):
- Make the first line as follows: {{number of inputs}} {{number of hidden neurons}} {{number of outputs}}
- Add any number of lines representing your data. Simply put in your inputs (space delmited) followed by your expected outputs (space delimited).
Alternatively, you can manually type in your own manual configuration and data sets. There is a fair amount of validation in place to guide you through the process.
Once the data sets are...set... the program will begin training the Neural Network. The network can be trained by number of epochs or by setting a minimum error. All of this can be set via the constants in the program. If minimum error is used, the network will be trained until the error threshold is reached. If the error threshold can't be reached, the training will stop when the number of epochs run reaches the maximum integer value.
After the Neural Network is trained, you will be asked to enter input data and verify the output. If the output is wrong, you can enter the correct output. The network will then be "retrained". If the input is correct, the network will be "encouraged".
If you used the XOR example and gave inputs of 0 and 1, you might see output values like: 0.985902....
This is the Neural Network attempting to get as close as possible to the expected output of 1. As the network is encouraged, this number will get higher.
I'm continuing to learn about neural networks and their applications. Currently, I'm working on an object-oriented Recurrent Neural Network. Once that project nears completion, I will put it into a repository and add the link here.
Feel free to reuse any part of this program.
The Network and its supporting classes are self-contained, meaning that the "UI" portion of the program only serves to gather the necessary information to instantiate the Network object and its supporting classes. You could theoretically take the Network and supporting classes and bring it into your own application with little to no modification. The network only requires the number of inputs, number of hidden neurons, the number of outputs and (optionally) a specified learning rate and momentum.
I forgot that people sometimes want to close their applications via the command line. When the program starts asking you to verify outputs, you can type "exit" to quit. I'm a bit tired and will implement a better way to do this in the future.
Hopefully my code is readable and and reusable for you. I put a lot of effort into maintaining best practices. It's a learning process and I welcome critique.
I used a few resources while building this project. I'm super thankful for those who have done a lot of work previously.
I am Trask - A Neural Network in 11 Lines of Python - This piqued my intrest in Neural Networks when it popped up on Reddit recently.
The Nature of Code - Chapter 10: Neural Networks - This was often able to answer some of my questions and made for a great read.
C# Backpropagation Tutorial - This was the initial C# project I looked at. I took and modified a few elements that I really liked such as the Sigmoid and Neuron classes.
A Step by Step Backpropagation Example - This was an excellent explanation of Back-Propagation and helped me tremendously with some of the math involved.
Coding Neural Network Back-Propagation Using C# - This was another great C# example. Dr. James McCaffrey (the author) has a lot of great insights in this article and others that he has written on the subject.
Simple C# Artificial Neural Network - This article played a large role in the November 2015 refactoring. This convinced me to get rid of the layer class altogether and helped clean up the network training code.