A Fast Multimodal Data Loading Pipeline (FastDLP)

Training long video generation model or large-scale robot learning models presents significant data loading challenges. Whether training in-context robot learning models that require processing thousands of frames per sequence, or video generation models that need to synthesize extended temporal horizons, the bottleneck often lies in efficiently loading and processing these massive multimodal datasets. Prior data loading approaches struggle to keep up with modern GPU training speeds when handling hours of synchronized video, proprioception, and action data.

Overview

The objective of FastDLP is to maximize data loading throughput for multimodal data, with a particular focus on handling long video sequences efficiently. The pipeline achieves high-performance through:

Optimized video frame loading using parallel jpg reading and decoding
Efficient batch collation with minimal memory copies

The hope is that by enabling high throughput, we can train models with 100% GPU utilization while training on large-scale robot datasets with hundreds of hours of video data. The pipeline is particularly optimized for sequences of 1000+ timesteps common in robotic manipulation tasks.

Happy to discuss and collaborate on the design of this system! Reach me via Issues or email: max.fu.letian@berkeley.edu

Installation

pip install -e .

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
fastdlp		fastdlp
tests		tests
.gitignore		.gitignore
pyproject.toml		pyproject.toml
readme.md		readme.md
requirements-test.txt		requirements-test.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Fast Multimodal Data Loading Pipeline (FastDLP)

Overview

Installation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

A Fast Multimodal Data Loading Pipeline (FastDLP)

Overview

Installation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages