🧠 Data Pipeline dbt AWS Demo

A lightweight end-to-end dbt demo pipeline using SQLite — perfect for learning, analytics, and AWS-style data transformations.

📚 Table of Contents

🚀 Project Overview
🧩 Data Flow
📁 Project Structure
🛠️ Setup & Run
📊 Models Summary
✅ Example Output
💡 Future Improvements
📚 Resources

🚀 Project Overview

This project simulates a small e-commerce dataset with customers, products, orders, and order items. It demonstrates how to use dbt for:

🗂️ Data ingestion (via dbt seed)
🧮 Data transformation (via SQL models)
📊 Aggregation and reporting (customer spend summaries)
✅ Data validation checks

🧩 Data Flow

CSV Seeds → dbt Seed → Staging Models → Transform Models → Summary Tables

Flow in this project:

customers.csv, products.csv, orders.csv, order_items.csv
        ↓
     dbt seed
        ↓
 my_orders_summary.sql
        ↓
customer_sales_summary.sql

📁 Project Structure

data_pipeline_dbt_aws_demo/
│
├── seeds/
│ ├── customers.csv
│ ├── products.csv
│ ├── orders.csv
│ └── order_items.csv
│
├── models/
│ └── example/
│ ├── my_orders_summary.sql
│ ├── customer_sales_summary.sql
│ └── check_data_counts.sql
│
├── dbt_project.yml
└── README.md

🛠️ Setup & Run

Install dependencies

python -m venv venv
source venv/bin/activate  # or venv\Scripts\activate on Windows
pip install dbt-sqlite

Run dbt seed (to load CSVs into SQLite)

dbt seed --full-refresh

Run models (to build transformations)

dbt run

List all built objects

dbt ls

📊 Models Summary

Model	Description
`check_data_counts`	Validates all seed tables were loaded correctly.
`my_orders_summary`	Combines customers, orders, order items, and products into a detailed transactional dataset with quantity, category, and total amount per item.
`customer_sales_summary`	Aggregates total spend, total orders, and average order value per customer.

✅ Example Output

check_data_counts

table_name	record_count
customers	3
order_items	4
orders	3
products	3

my_orders_summary

order_id	customer_id	customer_name	product_id	product_name	category	price	quantity	total_amount	order_date
1	1	Alice	1	Laptop	Electronics	1200	1	1200	2024-04-01
1	1	Alice	3	Chair	Furniture	150	2	300	2024-04-01
2	2	Bob	2	Phone	Electronics	800	1	800	2024-04-03
3	1	Alice	1	Laptop	Electronics	1200	1	1200	2024-04-04

customer_sales_summary

customer_id	customer_name	total_orders	total_spend	avg_order_value
1	Alice	2	2700	1350.0
2	Bob	1	800	800.0

💡 Future Improvements

This project serves as a foundation for modern analytics pipelines. Next steps could include:

☁️ Migrate from SQLite to Snowflake or Redshift for production-grade scalability
🔄 Enhance CI/CD workflows (e.g., run tests on pull requests, nightly dbt runs)
🧪 Add advanced dbt tests — referential integrity, freshness, and schema-level constraints
📈 Integrate dashboards (Power BI, Tableau, or Streamlit) to visualize dbt model outputs
🪶 Add incremental models to simulate real-world batch and streaming data
🔐 Parameterize environments (dev/prod profiles) for multi-environment orchestration

📚 Resources

✨ Created as a hands-on data pipeline demo using dbt + SQLite. 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
data_pipeline_dbt_aws_demo		data_pipeline_dbt_aws_demo
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 Data Pipeline dbt AWS Demo

📚 Table of Contents

🚀 Project Overview

🧩 Data Flow

📁 Project Structure

🛠️ Setup & Run

📊 Models Summary

✅ Example Output

💡 Future Improvements

📚 Resources

About

Uh oh!

Releases

Packages

License

reshmavarghese15/data-pipeline-dbt-aws-demo

Folders and files

Latest commit

History

Repository files navigation

🧠 Data Pipeline dbt AWS Demo

📚 Table of Contents

🚀 Project Overview

🧩 Data Flow

📁 Project Structure

🛠️ Setup & Run

📊 Models Summary

✅ Example Output

💡 Future Improvements

📚 Resources

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages