🏠 House Price Prediction

📌 Project Overview

This project predicts house sale prices using machine learning techniques on the Kaggle House Prices Dataset . We implemented Linear Regression as a baseline model and then improved performance using a Random Forest Regressor.

The goal is to understand the factors affecting house prices and build a model that achieves low prediction error (measured with RMSE).

📊 Dataset

Source: Kaggle – House Prices: Advanced Regression Techniques

Size: ~1,460 rows, 80 features

Features:

Numerical: LotArea, OverallQual, YearBuilt, GrLivArea, etc.

Categorical: Neighbourhood, HouseStyle, RoofStyle, etc.

Target: SalePrice (continuous variable)

⚙️ Steps in the Project

1. Exploratory Data Analysis (EDA)

- Visualised distributions (histograms, boxplots, scatterplots).

- Checked correlations between features and target.

2. Data Preprocessing

- Handled missing values.

- One-hot encoded categorical variables.

- Split data into training & testing sets.

3. Feature Engineering

- Created new features (e.g., house age, total square footage).

- Removed/combined less useful variables.

4. Modelling

- Baseline model: Linear Regression

- Advanced model: Random Forest Regressor

5. Evaluation

- Metric: Root Mean Squared Error (RMSE)

- Compared Linear Regression vs. Random Forest.

📈 Results

	Linear Regression	Random Forest
RMSE	77699.05	28681.915963515752

Random Forest achieved lower RMSE, meaning it predicts house prices more accurately.

🛠️ Technologies Used

Python

Pandas, NumPy

Matplotlib, Seaborn

Scikit-learn

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
house-prices-advanced-regression-techniques (1)		house-prices-advanced-regression-techniques (1)
EDA_and_Visualization.ipynb		EDA_and_Visualization.ipynb
HelloWorld.ipynb		HelloWorld.ipynb
README.md		README.md
Random_Forest.ipynb		Random_Forest.ipynb
app.py		app.py
index.html		index.html
matplotlib_exercises_.ipynb		matplotlib_exercises_.ipynb
model.pkl		model.pkl
project_iti_t8.ipynb		project_iti_t8.ipynb
style.css		style.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🏠 House Price Prediction

📌 Project Overview

📊 Dataset

⚙️ Steps in the Project

1. Exploratory Data Analysis (EDA)

2. Data Preprocessing

3. Feature Engineering

4. Modelling

5. Evaluation

📈 Results

🛠️ Technologies Used

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🏠 House Price Prediction

📌 Project Overview

📊 Dataset

⚙️ Steps in the Project

1. Exploratory Data Analysis (EDA)

2. Data Preprocessing

3. Feature Engineering

4. Modelling

5. Evaluation

📈 Results

🛠️ Technologies Used

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages