This repository contains the code and resources for preprocessing the Titanic dataset. The dataset is used to predict survival on the Titanic using machine learning.
- Loaded the Titanic dataset.
- Explored and cleaned the data by handling missing values.
- Encoded categorical variables.
- Standardized numerical features.
- Removed outliers using the IQR method.
-
titanic_data_preprocessing.py: Python code for preprocessing the dataset. -
Titanic-Dataset.csv: The Titanic dataset used for analysis.
- Clone the repository:
git clone https://github.com/your-username/titanic-data-preprocessing.git