Build Neural Network with MNIST Database

This repository contains code for the construction of basic neural networks from scratch. Since it is a ground-up implementation, use of third-party libraries is kept to a minimum, with only numpy for efficient matrix computation and matplotlib to visualise the results, being used (at least up till Version 2 of the network).

Dataset used: MNIST digits database

To better understand the code and computation in the algorithm, this series by 3Blue1Brown is highly recommended. Here, he gives a comprehensive overview and provides intuition for the math behind the algorithm.

Note that at the time of writing, I realised the axis display for the graphs were slightly off by 1 (results for epoch 1 were displayed at epoch 0 and so on). Also, there was a slight error in the backpropagation formula for network 2 that will yield you slightly different results from the sample shown below even with the exact same set-up and clone (though, after training for a certain number of epochs, this margin becomes very negligible. Still for correctness sake, the updated formula was committed).

The results graph for network 1 has been correctly formatted as shown. That said, the models have been trained, images saved, and displayed and I will leave it as such for the remaining. Feel free to clone and run the program for the respective segments to verify if there is any notiiceable difference, or just tinker with the parameters and train your network to obtain better results.

NEW ADDITION: To further improve the results of your model, see here!

Installation Guide

Clone with

git clone https://github.com/4ndrelim/DeepLearningBuildNN.git

Install third-party libraries (Note that this is sufficient up till Network V2.2; a requirements.txt will be made for Network 3). On the command line run:
```
$ pip3 install numpy
```
and
```
$ pip3 install matplotlib
```
Training your network (via command-line on the terminal)
- To view just the numerical results (only the finalized network of each version is included here), run test.py in your preferred code editor or in the terminal:
```
$ python3 test.py
```
- To visualise the results of the different versions and sub-versions, run visualise.py or in the terminal:
```
$ python3 visualise.py
```
- comment-out unwanted training sections to save time
Training your network (IDLE)
- If you do not already have IDLE (environment for Python) installed, install here.
- To run the networks, launch test.py or visualise.py via IDLE and run the program.

Usage

Toggling of hyperparameters
- Networks in visualise.py and test.py have their hyperparams specified and toggled in hyperparams.py
- It may be better to simply re-write hyperparams in each individual python file (in sub-directories under /visualisation) if for some reasons, one wishes to run the several versions of Network 2 with different hyperparams in the same program (e.g visualise.py)
- It should be unsurprising that the results of each epoch and final accuracy is the same over many iterations of the program, since the same pseudo-random seed was specified; comment-out the seed if this behaviour is unwanted
- Hyperparams can still be better selected to further improve the model; most common and typical way of selection is done empirically as shown in exploring suitable parameters
Saving & loading your network
- A save and load function has been included in network_2.py to save the weights and biases of your trained model
- Instructions on usage are documented with the function
- Note that network.py does not have this save function but you should be able to copy-paste (save minor edits) since the implementation of save and load functions are independent of the network features

Overview of Sample Results

With the hyperparameters specified in hyperparams.py:

Network V1.0: 95.12% success rate

Network V2.2: 97.24% success rate

Netowrk V3.0: PENDING

Results can but emulated by running test.py and can likely be further be improved by training more epochs.

Graphical Evaluation

The different graphs below can be re-produced by running visualise.py

Version 1

Conventional implementation of a neural network. Read more here.

Version 2

Here we explore and build a stronger network with the following modification:

a) L2 regularization

b) Cross-entropy function

c) Better initialization of weights

d) Early stopping

Version 2.0
- Network is implemented with L2 regularization and uses cross-entropy cost function to improve performance. Read more here.
Version 2.1
- Compare the effects of different initialization of weights. Read more here.
- Exploring suitable parameters (case study of learning rate) Read more here.
Version 2.2
- Network implemented with early stopping. Read more here.

Version 3

Network is implemented further with convolutional layers before dense layers. coming soon!

Data Augmentation & Expansion

The original dataset of 50,000 training images was expanded to 250,000 images with a series of up-down-left-right translation of pixels. Data augmentation generally helps to regularize the network and avoid over-fitting of model. The greatly expanded dataset gave the model more training e.gs and with variation, hence, re-running network 2 in test.py easily gave results of > 98% accuracy (98.12%).

How To Use:

If you wish to download the dataset but not run the network yet

Run expand_mnist_data.py in the root folder and the expanded data should be in data/
In mnist_loader.py, simply uncomment the line to load the expanded data file
Launch the respective programs and run as per normal

OR (download just download dataset and run the network)

In mnist_loader.py, simply uncomment the 2 lines to check and load expanded data file
Launch the respective programs and run as per normal

Note that only a basic form of augmentation was done here, one can consider implement the following to achieve even better results:

Rotating (However, in the context of digits, rotation must not be done to a large extent, case in point: '6' and '9' may be misclassified)
More shifts
Random elastic deformations

Resources & Credits

Referenced "Neural Networks and Deep Learning".

Note: Some of the content in this repository was originally forked from "neural-networks-and-deep-learning" by Michael Nielson but I eventually decided to make a personal repository given vast personal changes.

Implementation of some parts may differ significantly from author's to ensure:

a) Further optimization

b) Additional features

c) Compatiblity with Python3

d) My own learning!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Build Neural Network with MNIST Database

Table of Contents:

UPDATE:

Installation Guide

Usage

Overview of Sample Results

Graphical Evaluation

Version 1

Version 2

Version 3

Data Augmentation & Expansion

How To Use:

OR (download just download dataset and run the network)

Note that only a basic form of augmentation was done here, one can consider implement the following to achieve even better results:

Resources & Credits

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 299 Commits
data		data
visualisation		visualisation
.gitignore		.gitignore
README.md		README.md
expand_mnist_data.py		expand_mnist_data.py
hyperparams.py		hyperparams.py
mnist_loader.py		mnist_loader.py
network.py		network.py
network_2.py		network_2.py
network_3.py		network_3.py
test.py		test.py
visualise.py		visualise.py

4ndrelim/DeepLearningBuildNN

Folders and files

Latest commit

History

Repository files navigation

Build Neural Network with MNIST Database

Table of Contents:

UPDATE:

Installation Guide

Usage

Overview of Sample Results

Graphical Evaluation

Version 1

Version 2

Version 3

Data Augmentation & Expansion

How To Use:

OR (download just download dataset and run the network)

Note that only a basic form of augmentation was done here, one can consider implement the following to achieve even better results:

Resources & Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages