TinyTorch

TinyTorch is a lightweight deep learning training framework implemented from scratch in C++.

For more details, please refer to my blog post: Write a nn training framework from scratch

Key Features

PyTorch-Style API: Similar naming conventions as PyTorch (Tensor, Functions, nn.Module, Optimizer).
Pure C++ Implementation: No dependency on external deep learning libraries.
CPU & CUDA Support: Runs on both CPU and CUDA-enabled GPUs.
Mixed Precision: Supports FP16, FP32, BF16.
Distributed: Multi-machine, multi-GPU training & inference.
LLM Inference: Supports inference for llama/qwen/mistral models: https://github.com/keith2018/TinyGPT

Implemented Operators and Components

Activation Functions

relu, gelu, silu
softmax, logSoftmax

Mathematical Operations

add, sub, mul, div, matmul
sin, cos, sqrt, pow
maximum, minimum

Comparison and Logical Operations

lt, le, gt, ge, eq, ne
logicNot, logicAnd, logicOr

Statistical and Reduction Operations

min, argmin, max, argmax
sum, mean, var

Tensor Shape and Indexing Operations

reshape, view, permute, transpose
flatten, unflatten, squeeze, unsqueeze
split, concat, stack, hstack, vstack, narrow
topk, sort, cumsum
gather, scatter

Optimizers

SGD, Adagrad, RMSprop, AdaDelta, Adam, AdamW

Other

Dataset, DataLoader, data.Transform

Automatic differentiation

TinyTorch's automatic differentiation (AD) is implemented by building a computation graph. Each operation on a Tensor is represented by a Function object, which is responsible for both the forward and backward passes. The Function nodes are connected via a nextFunctions field, creating the dependency graph. During the backward() call, the framework traverses this graph in reverse order, computing and propagating gradients using the chain rule.

Getting Started

Prerequisites

CMake
C++17 or a more recent compiler
CUDA Toolkit 11.0+ (optional)

Build

mkdir build
cmake -B ./build -DCMAKE_BUILD_TYPE=Release
cmake --build ./build --config Release

Run `MNIST` Demo

cd demo/bin
./TinyTorch_demo

Run Tests

cd build
ctest

License

This code is licensed under the MIT License (see LICENSE).

Name		Name	Last commit message	Last commit date
Latest commit History 154 Commits
.github/workflows		.github/workflows
demo		demo
doc		doc
src		src
test		test
third_party/ankerl		third_party/ankerl
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TinyTorch

Key Features

Implemented Operators and Components

Activation Functions

Mathematical Operations

Comparison and Logical Operations

Statistical and Reduction Operations

Tensor Shape and Indexing Operations

Neural Network Layers and Loss Functions

Optimizers

Other

Automatic differentiation

Getting Started

Prerequisites

Build

Run `MNIST` Demo

Run Tests

License

About

Uh oh!

Releases

Packages

Languages

License

keith2018/TinyTorch

Folders and files

Latest commit

History

Repository files navigation

TinyTorch

Key Features

Implemented Operators and Components

Activation Functions

Mathematical Operations

Comparison and Logical Operations

Statistical and Reduction Operations

Tensor Shape and Indexing Operations

Neural Network Layers and Loss Functions

Optimizers

Other

Automatic differentiation

Getting Started

Prerequisites

Build

Run MNIST Demo

Run Tests

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Run `MNIST` Demo

Packages