ECML PKDD 2024 Diving Deep: Forecasting Sea Surface Temperatures and Anomalies

September 9-13 | Vilnius

About the discovery challenge

About the challenge

Hurricanes, mass coral bleaching, disruption of sea mammal migration patterns, and extremely hot summers or cold winters all have one thing in common - they are driven by temperature changes in our seas and oceans. In this challenge, the aim for participants will be to investigate the predictability of global SSTs and SSTAs. Variability in sea surface temperatures (SSTs), also known as SST anomalies (SSTAs), is linked to climate oscillations and occurrences of extreme events, including the El Niño‐Southern Oscillation (ENSO), the Indian Ocean Dipole (IOD) oscillation, and marine heatwaves. The participants will specifically focus on forecasting SSTs and SSTAs with a time horizon with a one-month and six-month lead time.

Predicting SST is crucial for several reasons:

  • Climate Forecasting: SST is a key indicator of climate patterns and can help predict weather phenomena such as hurricanes, droughts, and floods. By understanding SST variations, meteorologists can better anticipate and prepare for extreme weather events.
  • Ecosystem Management: SST influences marine ecosystems, including the distribution and abundance of species, migration patterns, and reproductive cycles. Predicting SST helps marine biologists and conservationists manage and protect marine biodiversity more effectively.
  • Fisheries Management: Many fish species rely on specific temperature ranges for breeding, feeding, and migration. Predicting SST helps fisheries managers make informed decisions about quotas, fishing seasons, and habitat conservation to ensure sustainable fish populations.
  • Human Activities: SST affects various human activities, such as shipping, tourism, and coastal development. Predicting SST enables stakeholders to plan and adapt infrastructure, coastal defenses, and tourism activities in response to changing ocean temperatures.
  • Climate Change Monitoring: SST is a critical indicator of climate change, with rising temperatures affecting ocean circulation, sea levels, and weather patterns. Accurate predictions of SST trends help scientists monitor and assess the impacts of climate change on the oceans and the broader environment.

Challenge

Dataset

The SSTA data and additional features (sea surface pressure, air temperature 2 metres above the surface) were sourced from ERA5, the fifth-generation reanalysis conducted by the European Centre for Medium‐Range Weather Forecasts (ECMWF), covering the past nine decades globally. ERA5 offers monthly estimates of various atmospheric, land, and oceanic variables on a global scale, with a spatial resolution of 0.25°, spanning from January 1940 to the present. We prepared the SSTA training set by subtracting a climatology value for each month from the corresponding SST values. We define climatology here as the average value of SST over a certain time period. Each column of “.csv” file contains a time series of SSTA for a specific location, the coordinates for each location are contained in the “.csv”. The “...output.csv” contains SSTA values shifted 3 months in advance for the same location.

Task: The task is to predict SSTA 3 months in advance based on the previous SSTA values and additional features - mean sea surface pressure and air temperature 2 metres above the surface.

Evaluation: We provide SSTA data for the evaluation structured in the following way. Each column in the “.csv” contains blocks of 12 time series of SSTA. The goal is to issue a 3 months ahead SSTA forecast for each block. For example, assume that the first block had time series with time stamps [Jan 2011 … Dec 2011] then the forecast would be for April 2012. In the dataset the time stamps are omitted. The evaluation metric is the difference between the RMSE of the simple baseline and RMSE of your forecast averaged across all locations. The simple baseline is the persistent model - the current SSTA value is used as a 3-month ahead forecast.

Additional support materials can be found at: Ding Ning, Varvara Vetrova, Karin R. Bryan, Yun Sing Koh (2023) Harnessing the Power of Graph Representation in Climate Forecasting: Predicting Global Monthly Mean Sea Surface Temperatures and Anomalies. In Earth and Space Science. Volume11, Issue3. https://doi.org/10.1029/2023EA003455

Technical Details

All the code for the competition and information about the dataset will be released here: [ZIP] You will find the sample submission and related information on the same website. Please submit a zip file containing a single csv named submission.csv, and it should be a single column with the 71 predicted SSTA.

How to participate

Participants will be asked to complete a Google form [https://forms.gle/8Tcgpd6Y3ahnv4pF8] to enroll in the challenge. Participants is reminded that registering multiple times to gain an unfair advantage is strictly prohibited.

Prizes

  • Co-authorship of the challenge paper to the winners.
  • The first prize, sponsored by TAIAO (https://taiao.ai/), is €1000 Euros.
  • A free ECML conference registration to the best overall winner solution.

    Issues and Question

    Please use the Forum provided by CodaBench to ask questions and report issues.
    Use the contact email with [DIVING DEEP] in the subject for other necessities.

    Terms and Conditions

    1. Eligibility
  • Participants of all backgrounds and levels of expertise are welcome to join the competition.
  • Participants must agree to abide by these terms and conditions upon registration.
  • External data cannot be used.
  • Teams must be composed of at most five people.
  • Code must be publicly released by participants to ensure the compliance, verify irregularities or inaccuracies, and verify the results.
  • To be eligible for prizes, a brief technical report of 4 pages must be provided, and the worst baseline (persistent model) has to be beaten.

    2. Competition Period
  • The competition consists of two phases: Phase 1 (Development Phase) and Phase 2 (Evaluation Phase).
  • Each phase has a specific start and end date, as stated on the competition page.

    3. Intellectual Property
  • Participants retain intellectual property rights to their submissions, but grant the organizers the right to use their submissions for promotional or educational purposes.

    4. Code of Conduct
  • Participants are expected to maintain professionalism and respect towards other competitors and organizers. Any form of cheating, plagiarism, or unethical behavior will result in disqualification.

    5. Disputes and Appeals
  • In case of any disputes, the decision of the competition organizers shall be final.

    6. Changes to Terms & Conditions
  • The organizers reserve the right to make changes to the terms and conditions at any time.
  • Participants will be notified of any significant changes.
  • Submission

    Submission

    Code submission portal: CodaBench

    Report: Please also include a report/ The report must adhere to the outlined guidelines, including referencing the published source codes (e.g., GitHub repository). In your report please include the SSTA value for September 2024 for the Baltic Sea. They should be submitted by 31st July 2024, through our conference submission system, CMT

    Timeline for the challenge

    We anticipate that the challenge will follow the timeline below:
  • Start of competition: Friday, 17th May 2024, 11:59 PM UTC
  • Phase 1 (Development Phase) and Phase 2 (Evaluation Phase)
  • End of competition: Monday, 17th June 2024, 11:59 PM UTC
  • Written report submission deadline: Monday, 8th July 2024 11:59 PM UTC
  • Publish results: Wednesday, 10th July 2024 11:59 PM UTC
  • Camera-ready deadline: Wednesday, 31st July 2024 11:59 PM UTC
  • Winners present solutions at ECML/PKDD 2024: Monday, 9th September - Friday, 13th September 2024
  • Winners

    Challenge Winners

  • First place: team randomguy: Andreas Voskou, Cyprus University of Technology, Cyprus
  • Second place: team UPB-DICE: N’Dah Jean Kouagou and Arnab Sharma, Paderborn University, Germany
  • Organizers

    Organizing Committee

    Dr Varvara Vetrova

    University of Canterbury, New Zealand

    Dr Phil Mourot

    Waikato Regional Council, New Zealand

    Ding Ning

    University of Canterbury, New Zealand

    Prof Karin Byran

    University of Auckland, New Zealand

    Prof Yun Sing Koh

    University of Auckland, New Zealand

    Contact

    Please reach us for questions. Email divingdeepecml@googlegroups.com