Deep learning architecture comparison using digit recognition as the learning problem.
The repository makes use of the following architectures deep learning architectures:
- Feedforward Networks (MLPs)
- Convolutional Neural Networks (CNNs)
- Transformer-based Architectures (ViT)
- Learning Rate Scheduler
- Gradient Accumulation
- Gradient Checkpointing
- Layer Freezing
- Knowledge Distillation