Smart Text Classifier is a machine learning project focused on classifying SMS text messages as spam or ham (not spam). It leverages classical ML algorithmsโLogistic Regression, Random Forest, and Multinomial Naive Bayesโto compare performance and identify the most effective method for spam detection. The project addresses challenges in automated text classification and showcases clear evaluation metrics for each model.
Backend
- Frontend: Not applicable
- Backend: Jupyter Notebook / Python Script
- Database: Static CSV
smart-text-classifier/
โโโ data/
โ โโโ spam.csv
โโโ notebooks/
โ โโโ Smart Text Classifier.ipynb
โโโ Visuals/
โโโ README.md
- Spam vs. ham classification using multiple ML models
- Data preprocessing, label encoding, and train-test splitting
- Performance visualization (precision, recall, F1-score, confusion matrix)
- Results comparison across algorithms
- Selected classical ML algorithms for interpretability and simplicity
- Used default hyperparameters for baseline model performance
- Applied standard text preprocessing (lowercasing, punctuation removal)
- Focused on model evaluation instead of app deployment
- Assumed labeled dataset with "spam" and "ham" categories
Install dependencies and launch the notebook:
git clone https://github.com/Shaileshahire06/Smart-Text-Classifier.git
cd smart-text-classifier
pip install -r requirements.txt
jupyter notebookStep-by-step:
- Load and preprocess the data
- Train models using provided notebook
- Evaluate and visualize model performance
*No login or credentials
None โ analysis is entirely local using SQL + Pandas
Not applicable โ this is a non-service-based analytical project
- Python: Data analysis with Pandas
- NumPy / Pandas โ Data manipulation and preprocessing
- Scikit-learn โ Machine learning algorithms and model evaluation
- Matplotlib / Seaborn โ Data visualization and analysis
- Jupyter Notebook: Code, commentary, and charts