This project is a complete Machine Learning pipeline built in Google Colab. It trains a Random Forest model to classify iris flowers into 3 species (Setosa, Versicolor, and Virginica) using sepal & petal features.
The dataset relies on measuring the length and width of the flower's sepals and petals.
π Features
- Data Loading
- Exploratory Data Analysis (EDA)
- Visualization (pairplots, heatmaps)
- Train/Test Split
- Random Forest Classifier
- Accuracy Evaluation
- Confusion Matrix
- Save & Load Model (
.pkl)
π Tech Stack
- Python
- Pandas
- NumPy
- Seaborn
- Matplotlib
- Scikit-Learn
- Jupyter/Google Colab
π Dataset The Iris dataset is a built-in dataset in Seaborn containing 150 samples of the three species.
π Model Accuracy Achieved ~95β100% accuracy depending on the train/test split configuration.
π How to Run the Notebook
- Open
iris_ml_project.ipynbin Google Colab - Run all cells
- Modify parameters and try your own samples
π Author Pulkit Meena
