HomeScope is a data science project focused on predicting median house prices in California using a Random Forest Regressor model. It incorporates a variety of data preprocessing techniques, machine learning models, and deployment strategies to provide an intuitive interface for house price prediction.
housing.csv: Dataset used for training and testing the model.Link.docx: Document containing a link to the deployed Streamlit app.part1.ipynb: Jupyter notebook for initial analysis and preprocessing.preprocessing.ipynb: Jupyter notebook dedicated to data preprocessing.requirements.txt: Specifies Python dependencies required for the project.rfr_info.json: JSON file with details on the Random Forest Regressor model and input features.cal_predict.py: Python script for Streamlit app deployment.deploy.ipynb: Jupyter notebook outlining deployment steps.HomeScope.py: Main script for the Streamlit app.
- Python 3.8 or higher
- Pip (Python package installer)
-
Clone the repository:
git clone https://github.com/yourusername/HomeScope.git cd HomeScope -
Install the required packages:
pip install -r requirements.txt
To start the Streamlit app, run:
streamlit run HomeScope.pyThe application will be accessible at http://localhost:8501.
- Navigate to the deployed app or launch the app locally.
- Adjust the input parameters using the sidebar options.
- Click the "Predict" button to receive the predicted median house price.
The project uses a Random Forest Regressor. The rfr_info.json file contains detailed information about the model, including input features and their respective ranges.
longitude: Longitude of the location.latitude: Latitude of the location.housing_median_age: Median age of the houses.total_rooms: Total number of rooms in the houses.total_bedrooms: Total number of bedrooms in the houses.population: Population in the area.households: Number of households.median_income: Median income of the residents.ocean_proximity: Proximity to the ocean.
Contributions are welcome! Please read the contributing guidelines first.
This project is licensed under the MIT License. See the LICENSE file for details.
- Dataset: California Housing Prices dataset.
- Streamlit for providing a platform to deploy data science apps.
- Scikit-learn for machine learning algorithms.
If you have any questions or would like to discuss further, feel free to reach out:
- Email: shubhamchandrawork@gmail.com
- LinkedIn: Shubham Chandra