Demystifying Data-Driven Probabilistic Medium-Range Weather Forecasting

The recent revolution in data-driven methods for weather forecasting has lead to a fragmented landscape of complex, bespoke architectures and training strategies, obscuring the fundamental drivers of forecast accuracy. Here, we demonstrate that state-of-the-art probabilistic skill requires neither intricate architectural constraints nor specialized training heuristics. We introduce a scalable framework for learning multi-scale atmospheric dynamics by combining a directly downsampled latent space with a history-conditioned local projector that resolves high-resolution physics.

Learning Accurate Storm-Scale Evolution from Observations

Accurate short-term prediction of clouds and precipitation is critical for severe weather warnings,
aviation safety, and renewable energy operations. Forecasts at this timescale are provided by
numerical weather models and extrapolation methods, both of which have limitations. Mesoscale
numerical weather prediction models provide skillful forecasts at these scales but require significant
modeling expertise and computational infrastructure, which limits their accessibility.
Extrapolation-based methods are computationally lightweight but degrade rapidly beyond 1-2

HealDA: Highlighting the importance of initial errors in end-to-end AI weather forecasts

Machine-learning (ML) weather models now rival leading numerical weather prediction (NWP) systems in medium-range skill. However, almost all still rely on NWP data assimilation (DA) to provide initial conditions, tying them to expensive infrastructure and limiting the practical speed and accuracy gains of ML. More recently, ML-based DA systems have been proposed, which are often trained and evaluated end-to-end with a forecast model, making it difficult to assess the quality of their analysis fields.

Alperen Degirmenci

Alperen Degirmenci is a Senior Robotics Research Engineer at NVIDIA's Seattle Robotics Lab.

Prior to joining SRL, Alperen worked on Autonomous Driving at NVIDIA, building and scaling learning-based autonomy at the intersection of research and product. He  worked on latent-space world models, end-to-end learning, multi-modality object detection and tracking for autolabeling, and VLAs.

Sim2Val: Leveraging Correlation Across Test Platforms for Variance-Reduced Metric Estimation

Learning-based robotic systems demand rigorous validation to assure reliable performance, but extensive real-world testing is often prohibitively expensive, and if conducted may still yield insufficient data for high-confidence guarantees. In this work we introduce Sim2Val, a general estimation framework that leverages paired data across test platforms, e.g., paired simulation and real-world observations, to achieve better estimates of real-world metrics via the method of control variates.