This repository contains an exploratory data analysis (EDA) of user behavior, product data, and session activity in an e-commerce environment.
The goal of this project is to understand how users interact with an online store by analyzing three main datasets:
user.csvβ demographic and basic info of usersproduct.csvβ product metadata and category infosession.csvβ user browsing and session activity
The project includes:
- Data cleaning and preprocessing
- Exploratory visualizations (matplotlib, seaborn, folium)
- Encoding and feature transformation
- Geospatial analysis using
geopandasandfolium
- Python 3.x
- Jupyter Notebook
- pandas, numpy
- matplotlib, seaborn
- scikit-learn
- geopandas, folium
- Distribution and uniqueness of product, session, and user data
- Detection of missing values and basic data imputation
- Label encoding for categorical features
- Initial steps toward segmentation and predictive modeling
- Mapping user behavior geographically using Folium