This project segments dash-cam video frames using a pretrained UNET/DeepLabV3Plus models and renders the output as a GIF for quick review. It supports both local video files and YouTube URLs as input.
The dataset used for this project can be found at Kaggle
- Python 3.9+
pip install -r requirements.txt
python -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
python -m pip install -r requirements.txt# Segment a YouTube video
python main.py --youtube "https://youtu.be/h1u5OzTdbpc"
# Segment a local file
python main.py --file ./video/demo_video.mp4
# Optional: verbose logs
python main.py --file ./video/demo_video.mp4 --verbose- Obtained a mean_iou Score of
0.65907on test data
- Segmented frames are written to
processed_frames/. - Final GIF is written to
images/.

