This repo contains the resources used for 2 short courses presented at SDSS 2025:
- Building Containerized Applications for Data Science
- Accelerating Data Science Workflows with Kubernetes
The goal of these resources is to enable Data Science students, practitioners, experts, and researchers to build familiarity and competency with using Docker and Kubernetes for Data Science workflows
All of the compute used for these tutorials was performed on the National Research Platform
Content Authors:
J. Alex Hurt, University of Missouri
- Containerization
- Slides - The slides used for the course
- Walkthough Markdown Files (01-08.md) - Markdown files walking through the hands-on concepts
- Examples
- Building Custom Containers - Example of building a simple Python Application for Image Processing
- Dashboard - Example of a Data Science Dashboard in Docker
- Custom Jupyter Containers - Example of Building a Custom Container with Jupyter Capabilities
- Kubernetes
- Slides - The slides used for the course
- Jupyter Notebook Walkthroughs
- Jupyter Notebook Examples