This repository contains all the Data Mining lab exercises and examples for UIU Data Mining Course. It demonstrates fundamental data mining concepts, preprocessing techniques, visualization, and machine learning models with hands-on Python examples.
Before running the code, ensure you have:
-
Python 3.x installed
-
Python libraries:
numpyβ Numerical operationspandasβ Handling tabular datamatplotlibβ Data visualizationscikit-learnβ Machine learning models
Install missing libraries with:
pip install numpy pandas matplotlib scikit-learnUIU-DataMining-Lab/
β
ββ README.md
ββ Social_Network_Ads.csv
ββ social_network_ads.csv
ββ tennis.csv
ββ Data1.csv
ββ DM-Code.ipynb # all code here in one file
-
Data preprocessing: Handling data before applying ML
- Selecting features
- Scaling values for consistent range
-
Slicing & indexing in pandas and numpy
-
Train/Test split to avoid overfitting
-
Random_state β Ensures reproducible splits
-
Logistic Regression β Binary classification
-
Model evaluation β Confusion matrix, classification metrics
- Missing Values Handling β Fill with mean/median/mode
- Noise Removal β Detecting and cleaning noisy data
- Clustering & K-Means β Grouping data without labels
- Decision Trees & Regression β Predicting outcomes
- Ensemble Learning & Random Forest β Combining multiple models
- Cross-Validation β Stratified K-Fold to avoid bias
- Clone this repository:
git clone https://github.com/TashinParvez/UIU-DataMining-Lab.git
cd UIU-DataMining-Lab- Place datasets in the folder (if not already present).
- Run the Python script:
python social_network_ads.py- Observe plots and console outputs for model evaluation.
Note: All original code and datasets were provided by the course faculty. The scripts in this repository include my modifications, experiments, and enhancements for learning purposes.
For any questions, suggestions, or feedback, please feel free to reach out to the repository maintainer.
Happy Learning! π