Skip to content

This repository contains documentation for a machine learning project focused on classifying text messages as spam or ham. The project leverages classical ML algorithms such as Logistic Regression, Random Forest, and Multinomial Naive Bayes for predictive modeling.

Notifications You must be signed in to change notification settings

Shaileshahire06/Smart-Text-Classifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

6 Commits
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿงญ Introduction

Smart Text Classifier is a machine learning project focused on classifying SMS text messages as spam or ham (not spam). It leverages classical ML algorithmsโ€”Logistic Regression, Random Forest, and Multinomial Naive Bayesโ€”to compare performance and identify the most effective method for spam detection. The project addresses challenges in automated text classification and showcases clear evaluation metrics for each model.

๐Ÿ› ๏ธ Project Type

Backend

๐Ÿš€ Deployed App

  • Frontend: Not applicable
  • Backend: Jupyter Notebook / Python Script
  • Database: Static CSV

๐Ÿ“ Directory Structure

smart-text-classifier/
โ”œโ”€โ”€ data/
โ”‚ โ”œโ”€โ”€ spam.csv
โ”œโ”€โ”€ notebooks/
โ”‚ โ”œโ”€โ”€ Smart Text Classifier.ipynb
โ”œโ”€โ”€ Visuals/
โ”œโ”€โ”€ README.md

๐ŸŽฅ Video Walkthrough of the project

๐ŸŽฅ Video Walkthrough of the codebase

โœจ Features

  • Spam vs. ham classification using multiple ML models
  • Data preprocessing, label encoding, and train-test splitting
  • Performance visualization (precision, recall, F1-score, confusion matrix)
  • Results comparison across algorithms

๐ŸŽฏ Design Decisions or Assumptions

  • Selected classical ML algorithms for interpretability and simplicity
  • Used default hyperparameters for baseline model performance
  • Applied standard text preprocessing (lowercasing, punctuation removal)
  • Focused on model evaluation instead of app deployment
  • Assumed labeled dataset with "spam" and "ham" categories

๐Ÿงช Installation & Getting Started

Install dependencies and launch the notebook:

git clone https://github.com/Shaileshahire06/Smart-Text-Classifier.git
cd smart-text-classifier
pip install -r requirements.txt
jupyter notebook

๐Ÿ“Œ Usage

Step-by-step:

  1. Load and preprocess the data
  2. Train models using provided notebook
  3. Evaluate and visualize model performance

๐Ÿ” Credentials

*No login or credentials

๐ŸŒ APIs Used

None โ€“ analysis is entirely local using SQL + Pandas

๐Ÿ“ฎ API Endpoints

Not applicable โ€“ this is a non-service-based analytical project

๐Ÿงฐ Technology Stack

  • Python: Data analysis with Pandas
  • NumPy / Pandas โ€” Data manipulation and preprocessing
  • Scikit-learn โ€” Machine learning algorithms and model evaluation
  • Matplotlib / Seaborn โ€” Data visualization and analysis
  • Jupyter Notebook: Code, commentary, and charts

About

This repository contains documentation for a machine learning project focused on classifying text messages as spam or ham. The project leverages classical ML algorithms such as Logistic Regression, Random Forest, and Multinomial Naive Bayes for predictive modeling.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published