MNIST Dataset - Classifying Handwritten Digits

Last Updated: 06/09/24

Outstanding Tasks

None

Index

Analysis Goal
Modeling
1. Simple feed-forward neural network
2. Feed-forward neural network with data augmentation
3. Convolutional neural network with data augmentation

1. Analysis Goal

Classify handwritten digits from the MNIST database

2. Modeling

A. Simple feed-forward neural network

98.44% Accuracy

A fully connected neural network, taking a 28x28 greyscale image input, flattening, passing it through three hidden dense layers with ReLU activation, outputing a probability distribution over 10 classes using softmax activation:

Model Architecture

Input: (28, 28, 1)
Flatten: (784)
Dense: (784) to (512), ReLU
Dense: (512) to (256), ReLU
Dense: (256) to (128), ReLU
Output Dense: (128) to (10), softmax

The model was compiled with Adam optimizer, categorical cross-entropy loss, and accuracy as the evaluation metric. The model was trained for 20 epochs.

B. Feed-forward neural network with data augmentation

98.71% Accuracy

Using the same neural network described above, data augmentation is applied to improve the model's generalizability:

Data Augmentation

Rotation: Randomly rotate up to 10 degrees
Zoom: Randomly zoom in up to 10%
Width shift: Randomly shift horizontally up to 10% of width
Height shift: Randomly shift vertically up to 10% of height

The model was compiled with Adam optimizer, categorical cross-entropy loss, and accuracy as the evaluation metric. The model was trained for 20 epochs.

C. Covolutional neural network with data augmentation

99.43% Accuracy

A convolutional neural network (CNN), taking a 28x28 grayscale image input, processing it through several convolutional and pooling layers, followed by dense layers, and outputing a probability distribution over 10 classes using softmax activation. Data augmentation is applied to the training data to improve the model's generalization. The same data augmentation applied to the feed-forward neural network was used to improve the model's generalizability:

Model Architecture

Input: (28, 28, 1), Conv2D with 32 filters, kernel size (3, 3), ReLU
MaxPooling2D: Pool size (2, 2)
Conv2D: 64 filters, kernel size (3, 3), ReLU
MaxPooling2D: Pool size (2, 2)
Conv2D: 64 filters, kernel size (3, 3), ReLU
Flatten: (None, 64)
Dense: (64) to (128), ReLU
Droupout Layer: Regularization rate 0.50
Output Dense: (128) to (10), softmax

The model was compiled with Adam optimizer, categorical cross-entropy loss, and accuracy as the evaluation metric. The model was trained for 20 epochs.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
archive		archive
.gitignore		.gitignore
MNIST_Modeling.ipynb		MNIST_Modeling.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MNIST Dataset - Classifying Handwritten Digits

1. Analysis Goal

2. Modeling

A. Simple feed-forward neural network

B. Feed-forward neural network with data augmentation

C. Covolutional neural network with data augmentation

About

Releases

Packages

Languages

maryschindler/MNIST_2024

Folders and files

Latest commit

History

Repository files navigation

MNIST Dataset - Classifying Handwritten Digits

1. Analysis Goal

2. Modeling

A. Simple feed-forward neural network

B. Feed-forward neural network with data augmentation

C. Covolutional neural network with data augmentation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages