Email Spam Classification using Machine Learning

📌 Overview

This project classifies emails as Spam or Not Spam using structured email campaign data. Machine learning models are trained using numerical and categorical features derived from email campaigns.

🎯 Problem Statement

Email spam impacts communication efficiency and user trust. The goal of this project is to predict whether an email is spam based on campaign-related attributes.

📊 Dataset Description

The dataset contains campaign-level features related to emails.

Input Features

Email_Type
Subject_Hotness_Score
Email_Source_Type
Customer_Location
Email_Campaign_Type
Total_Past_Communications
Time_Email_sent_Category
Word_Count
Total_Links
Total_Images

Target Variable

Email_Status
- 0 → Not Spam
- 1 → Spam

🛠️ Methodology

Data inspection and preprocessing
Handling categorical and numerical features
Feature-target separation
Train-test split
Classification model training
Model evaluation

🤖 Models Used

Logistic Regression
Naive Bayes

📈 Evaluation Metrics

Accuracy
Precision
Recall
F1-score

🔍 Key Insights

Subject hotness score impacts spam classification
Link and image count influence email status
Simple classifiers perform well on structured data

🚀 Future Enhancements

Advanced feature engineering
Try ensemble classifiers
Apply explainability (SHAP)
Deploy as a web application

🧠 Learnings

Binary classification on structured datasets
Handling categorical campaign features
Model evaluation for imbalance scenarios

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data		data
notebook		notebook
LICENSE		LICENSE
README.md		README.md
📄 .gitignore		📄 .gitignore
📄 requirements.txt		📄 requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Email Spam Classification using Machine Learning

📌 Overview

🎯 Problem Statement

📊 Dataset Description

Input Features

Target Variable

🛠️ Methodology

🤖 Models Used

📈 Evaluation Metrics

🔍 Key Insights

🚀 Future Enhancements

🧠 Learnings

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Email Spam Classification using Machine Learning

📌 Overview

🎯 Problem Statement

📊 Dataset Description

Input Features

Target Variable

🛠️ Methodology

🤖 Models Used

📈 Evaluation Metrics

🔍 Key Insights

🚀 Future Enhancements

🧠 Learnings

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages