Transformers Implementation

This tutorial demonstrates how to implement a Transformer model using both TensorFlow and PyTorch. It covers all steps from data preprocessing to training and inference for a language translation task.

Introduction
Setup
The Data
- Download and Prepare the Dataset
- Create a Dataset (tf.data for TensorFlow and standard PyTorch Dataset)
Text Preprocessing
- Standardization
- Text Vectorization
- Process the Dataset
Model Components
- The Encoder
- The Attention Layer
- The Decoder
Training
- Training the Model
- Inference
- Exporting the Model
Optional
- Using a Dynamic Loop (TensorFlow only)
Additional Resources

Introduction

This tutorial provides side-by-side implementations of Transformers using TensorFlow and PyTorch, showcasing their unique features and APIs.

TensorFlow: Inspired by the official TensorFlow Transformer tutorial.
PyTorch: Implements similar concepts using PyTorch's flexible API.

Getting Started

Environment Setup: Ensure you have TensorFlow or PyTorch installed based on the framework you plan to use.
Follow the Sections: Start from data preparation, proceed through preprocessing, build the model, and train it.
Experiment with Both Frameworks: Compare TensorFlow's tf.data pipeline and PyTorch's Dataset class to understand their respective advantages.

Highlights

TensorFlow: Provides dynamic loops and seamless data pipelines using tf.data.
PyTorch: Offers greater flexibility and is highly customizable, with a focus on hands-on control.

Additional Resources

TensorFlow Users: Visit the official TensorFlow tutorials for more.
PyTorch Users: Explore the PyTorch documentation for further reading.

Acknowledgments

François Chollet
Tensorflow.org
pytorch.org

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transformers Implementation

Table of Contents

Introduction

Getting Started

Highlights

Additional Resources

Acknowledgments

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Transformers Implementation

Table of Contents

Introduction

Getting Started

Highlights

Additional Resources

Acknowledgments