Introduction to Tensorflow2.0 with Python

About this Repo

The first part of the project was to learn Tensorflow2.0. I followed the tutorial given by Nour Islam Mokhtari. So, i you are interested in the work or want to learn Tensorflow, make sure to check him out.:grin:
The second part of the repo is to reproduce the work given by CHRIS DEOTTE, where he posted a great article about How to choose CNN Architecture MNIST👈 on Kaggle.com.
Here I chose several models mentioned in this article and create a comparison chart in regard of the validation accuracy. For more details, make sure to check it out if you are interested.:thumbsup:

Dataset

Both the first and second part of the project uses MNIST dataset, the very same dataset I used in the other repo, simple-neural-network-python. In that repo, I built a fully connected neural network from scratch using python. Feel free to check it out if you are interested.:raised_hands:

Part 1. Learning Tensorflow2.0, Keras

The first goal of this project is to learn how to use Tensorflow2.0 to classify handwritten digits. I demonstrated three methods to build a model in this framework. For detail implementation, please see ./models.py

Sequential API (The implementation can be found in ./mnist_example.py)
Functional Model API
Model Subclassing

Part 2. Model Analysis

The second goal was to reproduce the work of Chris Deotte. In his post, he shared the strategy of finding the best CNN architecture for MNIST. Among all the architectures he mentioned, I chose four of them and plot the validation accuracy performance of 5 and 10 epochs respectively. Here are the models,

Basic: 32C5-P2-64C5-p2-D128-D10
Deeper CNN: 32C3-32C3-P2-64C3-64C3-P2-D128-D10
Basic w/Dropout: 32C5-P2-Dp40%-64C5-P2-Dp40%-D128-Dp40%-D10
Deeper CNN w/Dropout,BatchNormalization: 32C3-BN-32C3-BN-P2-Dp40%-64C3-BN-64C3-BN-P2-BN-Dp40%-D128-D10
(32C5 means a convolution layer with 32 feature maps using a 5x5 filter and stride 1. P2 means max pooling using 2x2 filter and stride 2. BN means BatchNormalizer. Dp40% means 40% Dropout.)
To prevent repeating myself and for a cleaner code, I chose functional model API to build the model.(Detail implementation in ./models.py) and the rest of the code can be found in ./mnist_kaggle_best.py.

Result

Trains for 10 epochs
By comparing model 1(32C5) and model 2(32C3-32C3), we can see how adding convolution layers can effect the accuracy.
By comparing model 1(32C5) and model 3(32C5-Drop), we can see how adding dropout layers can effect the performance.
Finally, in model 4(32C3-32C3-BN-Drop), we get the best performance by adding more convolution layers, dropout layers, and batch normalization layers.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
result		result
.gitignore		.gitignore
README.md		README.md
mnist_example.py		mnist_example.py
mnist_kaggle_best.py		mnist_kaggle_best.py
models.py		models.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Introduction to Tensorflow2.0 with Python

About this Repo

Dataset

Part 1. Learning Tensorflow2.0, Keras

Part 2. Model Analysis

Result

About

Uh oh!

Releases

Packages

Languages

mike1393/intro-to-tensorflow2.0-python

Folders and files

Latest commit

History

Repository files navigation

Introduction to Tensorflow2.0 with Python

About this Repo

Dataset

Part 1. Learning Tensorflow2.0, Keras

Part 2. Model Analysis

Result

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages