This post distills a Swiss study from the University of Geneva that benchmarks AI-based weather forecasting against a leading physical model to assess accuracy in predicting extreme weather. The findings highlight both the promise and the current limits of machine learning in meteorology.
They point to a path forward through hybrid modeling approaches.
Study findings: AI forecasts vs physical models
The researchers compared three AI systems—GraphCast, Pangu-Weather, and Fuxi—against the European Centre for Medium-Range Weather Forecasts’ HRES physical model. While AI methods offer rapid forecasts, the study found they are “systematically wrong” for record-breaking events, tending to underestimate their intensity and frequency.
Extreme cold spells were often predicted as milder than reality, and heat waves as well as strong winds were similarly underpredicted. These errors persist even when AI models are trained on large historical datasets.
Why AI struggles with record-breaking events
The authors attribute these shortcomings to AI’s reliance on historical training data and learned patterns. Rare or unprecedented events—amplified by ongoing climate change—often lie outside the range of the data AI has seen, making them poorly represented in predictions.
The study notes a tendency to undercount the frequency of such events as well as their peak intensities. This limitation is particularly acute for extremes that have become more common in a warming world.
Why physical models outperform AI for extremes
By contrast, physical weather models simulate atmospheric processes from first principles. The ECMWF’s HRES model can explore dynamics beyond past observations, providing a sturdier framework for high-impact scenarios—even when those conditions have not been previously observed.
This fundamental difference helps explain why traditional models captured extreme events more reliably than current AI approaches in the study.
Hybrid modeling: the best path forward
Experts emphasize a hybrid approach that marries machine learning with physical constraints. Combining data-driven insights with the governing equations of atmospheric physics can improve forecast skill for dangerous weather while preserving the speed and adaptability of AI.
The study recommends integrating ML components into physical models and using AI to refine subgrid processes or post-process outputs, rather than replacing physics altogether.
Practical implications for forecasting and policy
These findings have tangible implications for weather offices, emergency managers, and climate risk planners.
While AI can enhance short-range predictions and operational efficiency, decision-makers should remain cautious about relying solely on AI for extreme-event warnings.
A hybrid system that leverages the strengths of both approaches offers the most robust forecast guidance for high-impact weather.
- Prioritize hybrid models that combine AI speed with physical realism.
- Use AI outputs for ensemble generation and uncertainty framing, constrained by physics.
- Continuously test AI forecasts against record-breaking events to quantify bias.
- Invest in diverse data sources to reduce extrapolation gaps, while acknowledging physics cannot be fully replaced.
As climate change intensifies, the ability to predict extremes quickly and accurately becomes more critical.
Here is the source article for this story: AI weather models struggle to predict extreme events: Swiss study

