Real-time cleansing of snow and weather data for operational avalanche forecasting

October 19, 2022
In Progress
Share this project


Avalanche forecasting relies on snow, snow cover and weather data – for expert evaluation as well as for machine learning based support tools. The data need to be accessible in high quality as soon as these become available. Any measurement errors, anomalies and data gaps diminish forecast accuracy.

Avalanche forecasting, physical snow models, hydrological predictions and many other Alpine scientific activities are largely data-driven; therefore, consistent and accurate data are fundamental for high-quality outputs. With increasing data volumes and the increased need for timely and accurate forecasts, preferably automated location-based forecasts, it becomes imperative to clean these essential data on the fly.



SDSC Team:
Corinne Jones
Michele Volpi

PI | Partners:


  • Prof. Dr. Jürg Schweizer
  • Dr. Massimiliano Zappa
  • Dr. Jan Svoboda
  • Marc Ruesch
  • David Liechti
  • Dr. Frank Techel
  • Florian Lustenberger

More info



The backbone of the Swiss avalanche forecast infrastructure are the data of the Intercantonal Measurement and Information System (IMIS), which currently consists of 187 automated snow and weather stations. They are distributed throughout the Swiss Alps and in most cases are situated in the high alpine region above the tree line. The stations continuously record the snow and weather conditions, transmit data to the SLF every hour and provide the national avalanche warning service of the SLF as well as local avalanche services responsible for public safety in settlements and on roads with the crucial information for danger assessment. Public products such as new snow or snow height maps also rely on these data.

Therefore, the objective is the development of algorithms that allow real-time detection of anomalies in the time series, but also the detection of outliers, and impute missing data by applying state-of-the-art machine learning approaches.

This real-time data cleansing will solve a long-standing issue with the IMIS data that are known to be contaminated with data anomalies and has hindered automated processing. Hence, the completion of the proposed research will have a major impact, in particular for the application of numerical avalanche prediction models such as we recently developed in collaboration with the SDSC.


This project aims to perform anomaly detection, outlier detection, and imputation on IMIS station data both retrospectively and in real time. In particular, the project focuses on the measurements of snow depth, wind speed and direction, air temperature, precipitation, and a maintenance model.

Proposed approach:

The SDSC is working with SLF to apply time-series-based statistical and machine learning-based methods for anomaly detection, outlier detection, and imputation in the given context.


Numerical avalanche prediction and other models used at SLF will be more accurate with the cleaned data. The cleaned data will also be made openly available in the data portal of SLF, and will therefore benefit the numerous downstream users of the data.

Weather station at SLF



Additionnal resources


  1. Bavay, M. and Egger, T.: MeteoIO 2.4.2: a preprocessing library for meteorological data, Geosci. Model Dev., 7, 3135–3151,, 2014.
  2. Pérez-Guillén, C., Techel, F., Hendrick, M., Volpi, M., van Herwijnen, A., Olevski, T., Obozinski, G., Pérez-Cruz, F., and Schweizer, J.: Data-driven automated predictions of the avalanche danger level for dry-snow conditions in Switzerland, Nat. Hazards Earth Syst. Sci., 22, 2031–2056,, 2022.


Related Pages

More projects


In Progress
Machine Learning for the Future Circular Collider Design
Big Science Data


AI-augmented architectural design
Energy, Climate & Environment


In Progress
Extracting activity from large 4D whole-brain image datasets
Biomedical Data Science


In Progress
Deep statistical learning-based image analysis for measurement of socioeconomic development in sub-Saharan Africa using high-resolution satellite images, and geo-referenced household survey data
Energy, Climate & Environment


Latest news

PassGPT | Using language models to enhance password security
February 6, 2024

PassGPT | Using language models to enhance password security

PassGPT | Using language models to enhance password security

PassGPT is a Large Language Model for password generation trained on leaked passwords, which can outperform existing methods based on generative adversarial networks by guessing twice as many unseen passwords.
ADORE | A benchmark dataset in ecotoxicology to foster the adoption of machine learning
January 24, 2024

ADORE | A benchmark dataset in ecotoxicology to foster the adoption of machine learning

ADORE | A benchmark dataset in ecotoxicology to foster the adoption of machine learning

Applying machine learning to ecotoxicology could help reduce the number of animal tests, costs, and animals sacrificed while preserving the accuracy of the in vivo tests.
License Flowers | Art and AI at SDSC
February 21, 2024

License Flowers | Art and AI at SDSC

License Flowers | Art and AI at SDSC

An adventure to create art using AI to raise awareness on code licenses

Contact us

Let’s talk Data Science

Do you need our services or expertise?
Contact us for your next Data Science project!