Skip to content

Latest commit

Β 

History

History
41 lines (31 loc) Β· 2.29 KB

File metadata and controls

41 lines (31 loc) Β· 2.29 KB

Week 3: Working with DataFrames II – Cleaning & Transforming Data

πŸ“š Overview

This week you'll deepen your skills with Pandas, focusing on advanced data cleaning and transformation techniques. You'll learn to combine datasets, work with time series, and perform powerful aggregations to answer complex questions about social issues.

🎯 Learning Objectives

By the end of this week, you will be able to:

  • Join, merge, and concatenate DataFrames to combine datasets
  • Work with time series data (dates, resampling, rolling windows)
  • Group and aggregate data to summarize patterns
  • Clean and transform data for analysis (handling duplicates, outliers, reshaping)
  • Apply advanced filtering and sorting
  • Use Pandas to answer questions about trends and relationships in social impact data

πŸŽ“ Session Resources

πŸ—οΈ Mini-Deliverable

Assignment: Create a notebook that demonstrates joining, merging, and transforming real-world datasets (e.g., education + health), including time series analysis and groupby operations.

Requirements:

  1. Load two or more related datasets (e.g., education and health indicators)
  2. Merge or join DataFrames to combine information
  3. Clean and transform data (handle duplicates, missing values, reshape as needed)
  4. Work with time series (parse dates, resample, plot trends)
  5. Group and aggregate data to answer at least 3 questions
  6. Document your process and findings with clear explanations

πŸ”— Additional Resources

Pandas Documentation


Next Week: Week 4: Data Visualization I: Telling Stories with Plotly

Previous Week: Week 2: Working with DataFrames (Pandas Basics)