Skip to content

This project understands how the student's performance (test scores) is affected by other variables such as Gender, Ethnicity, Parental level of education, Lunch and Test preparation course

Notifications You must be signed in to change notification settings

shamayitadatta/Student-performance-prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Student-performance-prediction

Problem statement

  • This project understands how the student's performance (test scores) is affected by other variables such as Gender, Ethnicity, Parental level of education, Lunch and Test preparation course.

Steps followed for this Machine Learning Project

  • Understanding the Problem Statement
  • Dataset
  • Data Checks to perform
  • Exploratory Data Analysis
  • Data Pre-Processing
  • Model Training
  • Selecting the best model

Sequence of steps followed until Deployment

  • Github and Code Set Up
  • Project Structure, Logging And Exception Handling
  • Project Problem Statement, EDA And Model Training
  • Data Ingestion
  • Data Transformation using Pipelines
  • Model Training and Model Evaluating Component
  • Model Hyper Parameter Tuning
  • Created Prediction Pipeline using Flask Web App

Dataset

Dataset information

  • gender : sex of students -> (Male/female)
  • race/ethnicity : ethnicity of students -> (Group A, B,C, D,E)
  • parental level of education : parents' final education ->(bachelor's degree, some college, master's degree, associate's degree, high school)
  • lunch : having lunch before test (standard or free/reduced)
  • test preparation course : complete or not complete before test
  • math score
  • reading score
  • writing score

Data Checks Performed

  • Missing values
  • Duplicates
  • data type
  • the number of unique values of each column
  • statistics of data set
  • various categories present in the different categorical column

Exploratory Data Analysis (EDA)

Final Conclusions from EDA

  • Student's Performance is related with lunch, race, parental level education
  • Females lead in pass percentage and also are top-scorers
  • Student's Performance is not much related with test preparation course
  • Finishing preparation course is benefitial.

Models used for Training

  • Linear Regression
  • Lasso
  • Ridge
  • K-Neighbours Regressor
  • Decision Tree
  • Random Forest Regressor
  • XGB Regressor
  • CatBoosting Regressor
  • AdaBoost Regressor

Run Locally in your computer

Clone the project

Go to the project directory

bash cd my-project

After setting up environment and installing packages Run

bash python app.py

Screenshots

Screenshot 2024-01-29 at 1 59 49 PM Screenshot 2024-01-29 at 2 00 22 PM

About

This project understands how the student's performance (test scores) is affected by other variables such as Gender, Ethnicity, Parental level of education, Lunch and Test preparation course

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published