Fast-SCNN (PyTorch)

PyTorch implementation of Fast-SCNN for fast semantic segmentation, based on the paper below.

Paper

Fast-SCNN: Fast Semantic Segmentation Network

Repository layout

Path	Description
`models/`	Fast-SCNN model definition
`loss/`	Training losses (e.g. Dice)
`utils/`	Dataset, transforms, training helpers
`paper/`	Paper figures (e.g. architecture diagram)
`train.py`	Training entry point
`test.py`	Example inference on images / video

Architecture

Table 1 — Network structure

Input	Block	t	c	n	s
1024 × 2048 × 3	Conv2D	-	32	1	2
512 × 1024 × 32	DSConv	-	48	1	2
256 × 512 × 48	DSConv	-	64	1	2
128 × 256 × 64	bottleneck	6	64	3	2
64 × 128 × 64	bottleneck	6	96	3	2
32 × 64 × 96	bottleneck	6	128	3	1
32 × 64 × 128	PPM	-	128	-	-
32 × 64 × 128	FFM	-	128	-	-
128 × 256 × 128	DSConv	-	128	2	1
128 × 256 × 128	Conv2D	-	nums of classes	1	1

Table 2 — Operator notation

Input	Operator	Output
h × w × c	Conv2D 1/1, f	h × w × tc
h × w × tc	DWConv 3/s, f	h/s × w/s × tc
h/s × w/s × tc	Conv2D 1/1, −	h/s × w/s × c′

Usage

Environment

Create a virtual environment (recommended), then install dependencies:

pip install -r requirements.txt

Note: requirements.txt pins older PyTorch/CUDA builds. Adjust torch / torchvision to match your CUDA or CPU setup if install fails.

Dataset layout

Point --data-root at the folder that contains your images, labels, and list files. Each line in train.txt / val.txt is: image path and label path, both relative to data-root (space-separated).

Example:

my_dataset/
  train.txt
  val.txt
  img/
    image1.jpg
    image2.jpg
  label/
    image1.png
    image2.png

train.txt / val.txt (paths relative to my_dataset/):

img/image1.jpg label/image1.png
img/image2.jpg label/image2.png

Training

Single GPU (default device is CUDA if available, else CPU):

python train.py --data-root /path/to/my_dataset --num-classes 21

Common options:

Option	Default	Description
`--data-root`	`voc2012`	Dataset root
`--train-list`	`<data-root>/train.txt`	Training list file
`--val-list`	`<data-root>/val.txt`	Validation list file
`--num-classes`	`21`	Class count (e.g. 21 for VOC)
`--epochs`	`2000`	Total epochs
`--batch-size`	`160`	Train batch size
`--base-lr`	`0.01`	Initial learning rate
`--input-h`, `--input-w`	`320`, `320`	Crop size
`--save-dir`	`save`	Checkpoint directory
`--save-freq`	`20`	Save every N epochs
`--log-dir`	`runs`	TensorBoard logs
`--resume`	—	Checkpoint path to continue training

Resume from a checkpoint:

python train.py --data-root /path/to/my_dataset --resume save/train_100.pth

TensorBoard:

tensorboard --logdir runs

Multi-GPU with DataParallel:

python train.py --data-root /path/to/my_dataset --multigpu

Distributed training (typical launcher; adjust for your PyTorch version):

python -m torch.distributed.launch --nproc_per_node=N train.py --data-root /path/to/my_dataset --dist

Pretrained weights (optional)

A VOC2012-oriented checkpoint trained at 540×540 is shared for faster initialization (Chinese README link; extract password from original mirror if needed):

百度网盘 · password: v98k

Inference (`test.py`)

test.py loads weights from WEIGHTS_PATH (default save/train_1999.pth), uses CUDA, and expects class count / resolution consistent with the script (MDL_CLS, seg.hw). Before running:

Place a compatible checkpoint at save/train_1999.pth (or edit WEIGHTS_PATH in test.py).
For video, ensure test.mp4 exists or change the path in if __name__ == '__main__'.
Create an output folder if required (e.g. ./result/ for processVideo).

Then run:

python test.py

Adapt testImg('your_image.jpg') vs processVideo('your_video.mp4') in the __main__ block for image vs video.

TODO

Support

If you want to support this project:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fast-SCNN (PyTorch)

Paper

Repository layout

Architecture

Table 1 — Network structure

Table 2 — Operator notation

Usage

Environment

Dataset layout

Training

Pretrained weights (optional)

Inference (`test.py`)

TODO

Support

Star History

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
loss		loss
models		models
paper		paper
utils		utils
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
readme.md		readme.md
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Folders and files

Latest commit

History

Repository files navigation

Fast-SCNN (PyTorch)

Paper

Repository layout

Architecture

Table 1 — Network structure

Table 2 — Operator notation

Usage

Environment

Dataset layout

Training

Pretrained weights (optional)

Inference (test.py)

TODO

Support

Star History

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Inference (`test.py`)

Packages