This repository presents a regression analysis performed on the College dataset from the ISLR2 package in R.
The analysis explores relationships between various institutional characteristics and student outcomes to demonstrate statistical modelling techniques using R.
- Explore and visualise the College dataset.
- Apply multiple linear regression techniques.
- Evaluate model performance and interpret significant predictors.
- Demonstrate data cleaning, exploratory analysis, and regression diagnostics.
The College dataset is included in the ISLR2 R package and contains information on U.S. colleges such as:
- Number of applications and acceptances
- Tuition and room costs
- Graduation rate
- Student-to-faculty ratio
- Type of institution (Private or Public)
- Data exploration and summary statistics
- Linear and multiple regression modelling
- Model diagnostics and residual analysis
- Variable transformations to improve fit
- Best subset selection to identify optimal predictors
- Polynomial regression and evaluation using cross-validation (Holdout, LOOCV, k-Fold)
college-regression-analysis/
│
├── figures/ # Generated plots and diagnostic visualisations
│ ├── model_diagnostics_original.png
│ ├── studentized_residuals_original.png
│ └── ...
│
├── .gitignore # Files/folders excluded from Git
│
├── LICENSE # MIT License
│
├── README.md # Project overview and instructions
│
├── college-regression-analysis.Rproj # RStudio project file for reproducibility
│
├── college_regression_analysis.R # Main R script with full analysis
│
├── college_regression_report.pdf # Written assignment report
│
└── install_packages.R # Install all required R packages
To reproduce the analysis:
-
Clone the repository
git clone https://github.com/ChokZB/college-regression-analysis.git
-
Open the R project in RStudio
college-regression-analysis.Rproj
-
Install all required packages
source("install_packages.R") -
Run the main script
source("college_regression_analysis.R")
Below is a sample output from the regression and model selection analyses.
Chok Zu Bing
GitHub: @ChokZB
This project is released under the MIT License.
