Skip to content

asitdave/SQL-Retail-Sales-Analysis

Repository files navigation

Retail Sales Analysis SQL Project

Project Overview

Project Title: Retail Sales Analysis
Level: Moderate
Database: p1_retail_db

This project is designed to demonstrate SQL skills and techniques typically used by data analysts to explore, clean, and analyze retail sales data. The project involves setting up a retail sales database, performing exploratory data analysis (EDA), and answering specific business questions through SQL queries. This project is ideal for those who are starting their journey in data analysis and want to build a solid foundation in SQL.

Objectives

  1. Set up a retail sales database: Create and populate a retail sales database with the provided sales data.
  2. Data Cleaning: Identify and remove any records with missing or null values.
  3. Exploratory Data Analysis (EDA): Perform basic exploratory data analysis to understand the dataset.
  4. Business Analysis: Use SQL to answer specific business questions and derive insights from the sales data.

Project Structure

1. Data Exploration & Cleaning

  • Record Count: Determine the total number of records in the dataset.
  • Customer Count: Find out how many unique customers are in the dataset.
  • Category Count: Identify all unique product categories in the dataset.
  • Null Value Check: Check for any null values in the dataset and delete records with missing data.

3. Data Analysis & Findings

The following SQL queries were developed to answer specific business questions:

  1. Retrieve all columns for sales made on '2022-11-05

  2. Retrieve all transactions where the category is 'Clothing' and the quantity sold is more than 4 in the month of Nov-2022

  3. Calculate the total sales (total_sale) for each category.

  4. Find the average age of customers who purchased items from the 'Beauty' category.

  5. Find all transactions where the total_sale is greater than 1000.

  6. Find the total number of transactions (transaction_id) made by each gender in each category.

  7. Calculate the average sale for each month. Find out best selling month in each year

  8. Find the top 5 customers based on the highest total sales

  9. Find the number of unique customers who purchased items from each category.

  10. Create each shift and number of orders (Example Morning <12, Afternoon Between 12 & 17, Evening >17)

Findings

  • Customer Demographics: The dataset includes customers from various age groups, with sales distributed across different categories such as Clothing and Beauty.
  • High-Value Transactions: Several transactions had a total sale amount greater than 1000, indicating premium purchases.
  • Sales Trends: Monthly analysis shows variations in sales, helping identify peak seasons.
  • Customer Insights: The analysis identifies the top-spending customers and the most popular product categories.

Reports

  • Sales Summary: A detailed report summarizing total sales, customer demographics, and category performance.
  • Trend Analysis: Insights into sales trends across different months and shifts.
  • Customer Insights: Reports on top customers and unique customer counts per category.

Conclusion

This project serves as a comprehensive introduction to SQL for data analysts, covering database setup, data cleaning, exploratory data analysis, and business-driven SQL queries. The findings from this project can help drive business decisions by understanding sales patterns, customer behavior, and product performance.

About

Used SQL to answer business questions from sales data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published