Skip to content

Hands-on Data Mining lab exercises and Python projects for UIU CSE students, including preprocessing, visualization, and classification models.

Notifications You must be signed in to change notification settings

TashinParvez/UIU-DataMining-Lab

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“š UIU Data Mining Lab

This repository contains all the Data Mining lab exercises and examples for UIU Data Mining Course. It demonstrates fundamental data mining concepts, preprocessing techniques, visualization, and machine learning models with hands-on Python examples.


Prerequisites

Before running the code, ensure you have:

  • Python 3.x installed

  • Python libraries:

    • numpy β†’ Numerical operations
    • pandas β†’ Handling tabular data
    • matplotlib β†’ Data visualization
    • scikit-learn β†’ Machine learning models

Install missing libraries with:

pip install numpy pandas matplotlib scikit-learn

Repository Structure

UIU-DataMining-Lab/
β”‚
β”œβ”€ README.md
β”œβ”€ Social_Network_Ads.csv
β”œβ”€ social_network_ads.csv
β”œβ”€ tennis.csv
β”œβ”€ Data1.csv
β”œβ”€ DM-Code.ipynb          # all code here in one file

Key Concepts in this Lab

  • Data preprocessing: Handling data before applying ML

    • Selecting features
    • Scaling values for consistent range
  • Slicing & indexing in pandas and numpy

  • Train/Test split to avoid overfitting

  • Random_state β†’ Ensures reproducible splits

  • Logistic Regression β†’ Binary classification

  • Model evaluation β†’ Confusion matrix, classification metrics


Other Labs (Coming / Covered)

  • Missing Values Handling β†’ Fill with mean/median/mode
  • Noise Removal β†’ Detecting and cleaning noisy data
  • Clustering & K-Means β†’ Grouping data without labels
  • Decision Trees & Regression β†’ Predicting outcomes
  • Ensemble Learning & Random Forest β†’ Combining multiple models
  • Cross-Validation β†’ Stratified K-Fold to avoid bias

How to Run

  1. Clone this repository:
git clone https://github.com/TashinParvez/UIU-DataMining-Lab.git
cd UIU-DataMining-Lab
  1. Place datasets in the folder (if not already present).
  2. Run the Python script:
python social_network_ads.py
  1. Observe plots and console outputs for model evaluation.

Note: All original code and datasets were provided by the course faculty. The scripts in this repository include my modifications, experiments, and enhancements for learning purposes.

🌐 Contact

For any questions, suggestions, or feedback, please feel free to reach out to the repository maintainer.

Tashin Parvez Email Tashin Parvez LinkedIn Tashin Parvez Facebook Tashin Parvez Hashnode

Happy Learning! πŸš€

About

Hands-on Data Mining lab exercises and Python projects for UIU CSE students, including preprocessing, visualization, and classification models.

Topics

Resources

Stars

Watchers

Forks