Methodologies · Climate Science & AI

Data Collection and Preprocessing

Climate AI systems draw on diverse observational streams: satellite remote sensing (e.g. MODIS, ERA5 reanalysis), radiosonde profiles, ocean buoy networks, and ground-based station records. A critical preprocessing challenge is harmonising datasets with different spatial resolutions, missing-data patterns, and temporal coverage^[8]. Techniques include spatial regridding, temporal interpolation, and anomaly detection to remove instrument drift or calibration artefacts before model ingestion.

Hybrid Physics–AI Model Architectures

A dominant methodological trend is embedding known physical laws directly into neural network architectures rather than treating climate prediction as a purely data-driven problem. Physics-informed neural networks (PINNs) incorporate differential equation constraints as loss-function terms, ensuring outputs respect conservation of mass, energy, and momentum^[9]. This hybrid approach substantially reduces data requirements and improves out-of-distribution generalisation — critical when extrapolating to future climate states not present in the training record.

Deep Learning Approaches

Convolutional neural networks (CNNs) are widely applied to gridded climate fields because they can efficiently learn local spatial patterns from raster data. Recurrent architectures (LSTMs, transformers) capture temporal dependencies in climate time series, enabling medium-range weather and seasonal forecasting^[5]. Graph neural networks have more recently been applied to irregular observation networks and to learning interactions between discrete Earth system components. Rolnick et al. (2022) provide a comprehensive survey of how these architectures are deployed across mitigation and adaptation problems^[4].

Evaluation and Uncertainty Quantification

Evaluating climate AI models requires metrics that reflect physical plausibility, not only statistical accuracy. Skill scores (e.g. anomaly correlation coefficient, RMSE relative to climatological baseline) are standard, but models must also be tested for their ability to reproduce extreme-value statistics and teleconnection patterns absent from training data^[5]. Ensemble approaches and Bayesian deep learning methods are increasingly used to quantify predictive uncertainty — essential for communicating risk to policymakers.