🎯 Student Performance Predictor (Custom Linear Regression from Scratch)

This project builds a custom Linear Regression model from scratch (no scikit-learn regressors!) and uses it to predict a student's subject scores based on their career aspiration.

It’s not just a basic regression — the model also gives:

✅ Predicted scores for all subjects (Math, Physics, Chemistry, etc.)
🧠 A confidence rating showing how sure the model is
📈 Suggested weekly study hours & average effort level for that career
💾 Exportable trained model for reuse

🚀 Project Overview

This system uses a multi-output linear regression approach implemented from scratch (supporting both Normal Equation and Gradient Descent).
The input feature is the student's career aspiration, which is one-hot encoded and used to predict multiple subject scores simultaneously.

Once trained, the model can:

Suggest how a student might perform in each subject if they pursue a certain career path.
Estimate how much self-study time students in that career category typically put in.
Express confidence in its predictions (based on error variance).

⚙️ Features

Feature	Description
Custom Linear Regression	Implemented from scratch using NumPy, supporting both normal equation and gradient descent.
Multi-output Support	Predicts multiple subject scores at once (Math, Physics, Chemistry, etc.)
Confidence Scoring	Estimates how reliable each prediction is based on similar data points.
Career Suggestion List	Displays all available career aspiration options to the user for easy selection.
Study Hours Prediction	Predicts average weekly self-study time and effort required for that career.
Model Persistence	Model can be saved and reloaded using Joblib or manual NumPy serialization.

🧩 Model Architecture

Input:
Career Aspiration → Encoded using OneHotEncoder (categorical → numeric features)

Output:
Scores for each subject (Math, Physics, Chemistry, Biology, English, Geography…)

Model:
Custom Linear Regression
[ Y = XW + b ]

Where:

( X ): encoded career vector
( W ): learned weights
( Y ): predicted subject scores

Training:
Either via Normal Equation (closed-form) or Gradient Descent.

🧠 Confidence Calculation

Confidence =
[ 1 - \frac{\text{Mean Absolute Error for that career}}{\text{Max Deviation for that career}} ]

This scales between 0 and 1, where:

1.0 → Model is very sure (low error)
0.5 → Moderate reliability
0.0 → Weak confidence (limited similar samples)

📊 Example Output

Selected Career: Data Scientist

Predicted Subject Scores:

Math: 89.3 Physics: 82.7 Chemistry: 78.9 English: 91.1 Biology: 68.2 Geography: 74.5

Model Confidence: 0.86 Avg Weekly Study Hours: 8.2 hrs/week Suggested Effort: High

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
model		model
Career_Aspiration_Impact_Analysis.ipynb		Career_Aspiration_Impact_Analysis.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎯 Student Performance Predictor (Custom Linear Regression from Scratch)

🚀 Project Overview

⚙️ Features

🧩 Model Architecture

🧠 Confidence Calculation

📊 Example Output

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎯 Student Performance Predictor (Custom Linear Regression from Scratch)

🚀 Project Overview

⚙️ Features

🧩 Model Architecture

🧠 Confidence Calculation

📊 Example Output

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages