Neural Network for Digit Classification with Adam Optimization

A from-scratch implementation of a neural network for handwritten digit classification (0-9) using only NumPy, featuring the Adam optimization algorithm. This project demonstrates the fundamentals of deep learning by building a complete neural network without high-level frameworks.

Overview

This project implements a 3-layer neural network that classifies handwritten digits from the MNIST dataset. The implementation compares the performance of Adam optimization against standard gradient descent, showcasing the effectiveness of adaptive learning rate methods.

Key Features

Pure NumPy Implementation: No deep learning frameworks (TensorFlow, PyTorch) used for the core network
Adam Optimization: Full implementation of the Adam optimizer as described in the original paper
Performance Comparison: Side-by-side comparison of Adam-optimized vs. standard gradient descent
High Accuracy: Achieves 100% accuracy on the training set with Adam optimization
Educational: Clear, documented code ideal for learning neural network fundamentals

Project Structure

├── Image_classification_neural_network_numpy-Adam Optimization.ipynb
├── README.md
└── LICENCE

Neural Network Architecture

The network consists of three layers:

Layer	Type	Neurons	Activation
Input	Dense	784 (28×28 pixels)	-
Hidden 1	Dense	128	ReLU
Hidden 2	Dense	40	ReLU
Output	Dense	10 (digits 0-9)	Softmax

Loss Function: Mean Squared Error (MSE)
Optimization: Adam (β₁=0.9, β₂=0.99, ε=1e-8)

Dataset

Source: Kaggle Digit Recognizer Competition
Training Set: 42,000 labeled images
Test Set: 28,000 unlabeled images
Image Format: 28×28 grayscale pixels (784 features)

Requirements

numpy
pandas
matplotlib
scikit-learn
tensorflow  # Only used for validation metrics
pillow

Installation & Usage

Clone the repository

git clone https://github.com/jvachier/Image_classification_neural_network_numpy-Adam-Optimization.git
cd Image_classification_neural_network_numpy-Adam-Optimization

Install dependencies

pip install numpy pandas matplotlib scikit-learn tensorflow pillow

Download the dataset
- Download train.csv and test.csv from Kaggle Digit Recognizer
- Place them in the project directory

Run the notebook

jupyter notebook "Image_classification_neural_network_numpy-Adam Optimization.ipynb"

Implementation Details

Adam Optimization Algorithm

The Adam (Adaptive Moment Estimation) optimizer combines the advantages of two popular methods:

RMSprop: Uses adaptive learning rates
Momentum: Accelerates convergence in relevant directions

The update rules are:

$$m_t = \beta_1 m_{t-1} + (1-\beta_1) g_t$$

$$v_t = \beta_2 v_{t-1} + (1-\beta_2) g_t^2$$

$$\hat{m}_t = \frac{m_t}{1-\beta_1^t}$$

$$\hat{v}_t = \frac{v_t}{1-\beta_2^t}$$

$$\theta_t = \theta_{t-1} - \alpha \frac{\hat{m}_t}{\sqrt{\hat{v}_t} + \epsilon}$$

Forward Propagation

Layer 1: Z[1] = W[1]X + b[1], A[1] = ReLU(Z[1])
Layer 2: Z[2] = W[2]A[1] + b[2], A[2] = ReLU(Z[2])
Layer 3: Z[3] = W[3]A[2] + b[3], A[3] = Softmax(Z[3])

Backpropagation

Gradients are computed using the chain rule and used to update weights and biases through the Adam optimizer.

Results

Training Accuracy: 100% (with Adam optimization)
Convergence: Significantly faster with Adam compared to standard gradient descent
Visualization: Includes training curves for loss, accuracy, MSE, and R² score

References

Kingma, D. P., & Ba, J. (2014). Adam: A Method for Stochastic Optimization. arXiv preprint arXiv:1412.6980.

Links

Kaggle Notebook: Classification with Neural Network - Adam - NumPy
Dataset: Kaggle Digit Recognizer

License

This project is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). See the LICENCE file for details.

Author

jvachier
Created: July 2022

Acknowledgments

This project was created as an educational exercise to understand the inner workings of neural networks and optimization algorithms by implementing them from scratch.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github		.github
Image_classification_neural_network_numpy-Adam Optimization.ipynb		Image_classification_neural_network_numpy-Adam Optimization.ipynb
LICENCE		LICENCE
MATHEMATICAL_DOCUMENTATION.md		MATHEMATICAL_DOCUMENTATION.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Network for Digit Classification with Adam Optimization

Overview

Key Features

Project Structure

Neural Network Architecture

Dataset

Requirements

Installation & Usage

Implementation Details

Adam Optimization Algorithm

Forward Propagation

Backpropagation

Results

References

Links

License

Author

Acknowledgments

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Neural Network for Digit Classification with Adam Optimization

Overview

Key Features

Project Structure

Neural Network Architecture

Dataset

Requirements

Installation & Usage

Implementation Details

Adam Optimization Algorithm

Forward Propagation

Backpropagation

Results

References

Links

License

Author

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages