SST spatial distribution prediction using deep learning
Name | Location | Role |
---|---|---|
Joseph Gum | Virtual | Project Facilitator |
Jiarui Yu | Seattle | Participant |
Paula Birocchi | Seattle | Participant |
Boris Shapkin | Seattle | Participant |
Hao Tang | Australia | Participant |
Danyang Li | Australia | Participant |
Chandrama Sarker | Australia | Participant |
Zhengxi Zhou | Australia | Participant |
Dafrosa Kataraihya | Virtual | Participant |
Alex Slonimer | Virtual | Participant |
We have a SST time series from 2000 to 2020 from ERA5 reanalysis to start to work with the model, but we are also interested in obtaining and using sattelite data from MUR (https://urs.earthdata.nasa.gov/). The satellite data is available in the S3 bucket. You can easily access this dataset using this Python code: https://github.com/oceanhackweek/ohw23_proj_sst/blob/main/access_MUR_satellite_data_through_python_S3bucket.py
Pitch + Ideation: Predict SST anomalies (upwelling, other interesting SST anomalies), generate SST spatial distribution forecast. SST prediction is very important to understand the hydrodynamics and thermodynamics processes in the ocean and also near surface atmosphere-ocean interactions.
MUR Satellite Data (2002-present): S3 bucket and NASA website: https://podaac.jpl.nasa.gov/dataset/MUR-JPL-L4-GLOB-v4.1
-
Get data, and define our bounding box/ area of interest (desired approach: time [2000-2020], lat [-5,32], lon[45,90]);
-
Split data on training, validation and testing datasets;
-
Model Architecture: a) ConvLSTM: this is going to be our first (and main) approach! b) 3D CNN c) Transformers d) Hybrid: CNN + Transformer + LSTM ( we won't have enough time to apply this model)
-
Complie and fit a) Early-stop b) * Loss function: MSE, MAE, SSIM (until now, we found better results using MSE) c) * metric
-
Visualization of result and Interpretation!
The deep learning model developed here can also be used with other type of data as input! The idea is to use this model with other parameters in the future. For now, we are interested in forecasting the next day of SST spatial distribution.
https://jamboard.google.com/d/1lOgVwnqQLvNRPAOEVEGnWXm8FSTuPYQWbteptKrslTM/viewer?f=10
ohw23_proj_sst
https://docs.google.com/presentation/d/1uUAIsuj9bxOFMVeIG_h5Bs-ZGDrRodldlz2FHfj4TbE/edit#slide=id.p
https://drive.google.com/drive/folders/1M0o_R4aoDxU9XJOtLEHe90bma-Jn-jM9
Model Architecture | Val MSE | Early Stopping | Environment |
---|---|---|---|
Transformer | 0.1902 | Triggered | CPU Friendly, <32G RAM |
ConvLSTM | 0.027 | Triggered | GPU Needed, <32G RAM |
ConvLSTM + Transformer | 0.0493 | Not Triggered | GPU Needed, <128G RAM |
Model Architecture | Val MSE | Early Stopping |
---|---|---|
3D CNN | 0.2041 | Triggered |
ConvLSTM | 0.0480 | Not Triggered |