This repository hosts the code for the EMIT-ECOSTRESS project from Caltech's CS 101 course in Fall 2023. The project goal is to use machine learning to predict urban land surface temperature as measured by the NASA thermal imager ECOSTRESS by using the surface's reflectance spectra as recorded by the NASA hyperspectral imager EMIT. The repository contains code for dataset creation and data processing as well as all modeling and analysis.
-
Clone this repository.
-
Download the data from the team's Google Drive here: https://drive.google.com/drive/folders/1F0khkxABuI1tzEYNzjSlvQq6dORTq9Zq?usp=drive_link. (Alternatively, you can follow the
data_prep/data_downloadnotebook01_Finding_Concurrent_Data_UrbanHeat.ipynb, followed by thedata_prep/dataset_creationnotebooksData_Matching.ipynb,Collapsing_Dataset.ipynb, andDataset_Splitting.ipynbto build your own copy of the dataset.) -
Place the data in a directory alongside the cloned repository
-
Start a new virtual environment and install the dependencies found in the
requirements.txtfile above usingpip install -r requirements.txt. -
Follow the
modeling/Patch_to_Pixel.ipynbnotebook (or themodeling/CNN.ipynbnotebook) to train your own temperature prediction models!