DataXAS

Data science for X-ray Absorption Spectroscopy

Started
March 1, 2024
Status
In Progress
Share this project

Abstract

The overall goal of the project is to develop data-driven approaches for the enhancement and characterization of high throughput X-ray spectroscopy (XAS) analyses at PSI.

The primary objectives are to 1) develop generic machine learning tools for noise suppression from XAS data to improve the achievable time-resolution from quick scanning XAS on challenging materials of interest; 2) Blind source signal separation to deconvolute the principal components forming large time-series spectroscopic datasets; 3) develop a classification algorithm to extract the structural motif of catalytic species from operando XAS data; 4) expand the data science expertise of the catalysis community through education initiatives.

The project is expected to yield new, generalizable methods for XAS data processing and characterization that will be implemented at the SuperXAS and Debye beamlines of the SLS at PSI and within two major initiatives for sustainable catalysis in Switzerland: the ETH domain Catalysis Hub and the NCCR Catalysis.

People

Collaborators

SDSC Team:
Ilnura Usmanova
Benjamín Béjar Haro

PI | Partners:

PSI, Photon Science Division:

  • Dr. Adam H. Clark
  • Dr. Tomas Aidukas

More info

PSI, Operando Spectroscopy:

  • Dr. Maarten Nachtegaal

More info

ETH Zurich, Department of Chemistry and Applied Biosciences:

  • Prof. Christophe Copéret

More info

PSI, Science IT Infrastructure and Services Department, SCD:

  • Xingxing (Marie) Yao

More info

EPFL, Laboratory for Functional Inorganic Materials, EPFL Valais Wallis:

  • Prof. Wendy Queen

More info

description

Motivation

Investigation of catalytic active sites is challenging, because they are often made up of only a few atoms amongst a majority of atoms of the same elements that are spectator species and often requires time-resolved synchrotron-based X-ray tools to identify them. Advanced synchrotron spectroscopy techniques give direct access to geometric structural and electronic characteristics of materials under operation. Within the broad ensemble of synchrotron techniques, X-ray absorption spectroscopy (XAS) is particularly well suited to the study of catalytic active sites elucidating, uniquely with element specificity, the local atomic configuration and electronic state of catalyst actives sites in operando conditions.

Figure 1: (a) A schematic for the absorption of an X-ray photon and excitation of a core-level electron to produce a photoelectron. (b) A representation of an absorption edge measured with energy dependence. The sharp vertical step gives the so-called absorption edge and the modulation of the absorption coefficient after the edge arising from the scattering of the photoelectron wave from neighboring ions. Figure reproduced and adapted from S. Pasarelli [1].

The application of state-of-the-art data science techniques to catalysis is still in its infancy and is an important short-, medium- and long-term goal in catalysis. This project aims at providing a fundamental step towards this goal by establishing robust and self-consistent workflows for data collection, processing, and analysis, applied to high-throughput spectroscopic data. The general objective of this project is to develop a data-driven methodology for data handling and processing of XAS data.

Proposed Approach / Solution

For the first task of denoising [2], we propose an efficient denoising approach of noisy XAS spectra based on careful Gaussian Processes (GP) regression with adaptive length scales. We propose using a warping procedure to preprocess the data and make the signals more stationary, which already improves all the classical denoising methods, see Figure 2. For the data with higher levels of noise, we propose GP regression for denoising of the preliminary warped signals.

Figure 2: Different denoising procedures with and without input warping.

In the second task of blind signal separation [3], we encounter the challenge of rotational ambiguity inherent in the classical Nonnegative Matrix Factorization (NMF) formulation. In particular, like shown on Figure 3, multiple pure component sets can lead to the same quality of the NMF representation. We propose to address this issue by regularizing the NMF formulation using a database of real pure elements.

Figure 3: Rotational ambiguity of NMF formulation.

Thirdly, using the labeled dataset of real spectra measurements, we use a classification approach [4], [5], [6], [7], for X-ray absorption near-edge structure (XANES) spectra, possibly NN based, to determine the key geometrical and structural features of the investigated chemical compounds directly from the spectrum.

Impact

Impact. The combined denoising, classification, and decomposition techniques will help domain scientists determine the particular components of the time-resolved XAS spectra with better certainty and verify them by checking fewer potential candidates for the components hence facilitating to streamline the process of compound characterization.

Gallery

Annexe

Cover image source: Adobe Stock

Additional resources

Bibliography

  1. S. (ESRF) Pascarelli, ‘X-Ray Absorption Spectroscopy: Fundamentals and Simple Model of Exafs’, can be found under https://indico.ill.fr/indico/event/75/material/slides/37.pdf, 2017.
  2. Abe H, Aquilanti G, Boada R, Bunker B, Glatzel P, Nachtegaal M, Pascarelli S. Improving the quality of XAFS data. J Synchrotron Radiat. 2018
  3. A. de Juan, R. Tauler, ‘Multivariate Curve Resolution (MCR) from 2000: Progress in Concepts and Applications’, http://dx.doi.org/10.1080/10408340600970005  2007, 36, 163–176.
  4. M. R. Carbone, M. Topsakal, D. Lu, S. Yoo, ‘Machine-Learning X-Ray Absorption Spectra to Quantitative Accuracy’, Phys. Rev. Lett. 2020, 124, 156401.
  5. M. R. Carbone, S. Yoo, M. Topsakal, D. Lu, ‘Classification of local chemical environments from x-ray absorption spectra using supervised machine learning’, Phys. Rev. Mater. 2019, 3, 1–11.
  6. J. Timoshenko, D. Lu, Y. Lin, A. I. Frenkel, ‘Supervised Machine-Learning-Based Determination of Three-Dimensional Structure of Metallic Nanoparticles’, J. Phys. Chem. Lett., 2017, 8, 5091–5098.
  7. J. Timoshenko, A. I. Frenkel, ‘“inverting” X-ray Absorption Spectra of Catalysts by Machine Learning in Search for Activity Descriptors’, ACS Catal. 2019, 9, 10192–10211.

Publications

Related Pages

More projects

MAGNIFY

In Progress
Machine learning Assisted larGe scale quaNtIfication of building energy FlexibilitY
Energy, Climate & Environment

SPI-GreenFjord

In Progress
Energy, Climate & Environment

SPI-PAMIR

In Progress
Energy, Climate & Environment

TREMA

Completed
Transforming real estate management with AI
Engineering

News

Latest news

First National Calls: 50 selected projects to start in 2025
March 12, 2025

First National Calls: 50 selected projects to start in 2025

First National Calls: 50 selected projects to start in 2025

50 proposals were selected through the review processes of the SDSC's first National Calls.
AIXD | Generative AI toolbox for architects and engineers
January 22, 2025

AIXD | Generative AI toolbox for architects and engineers

AIXD | Generative AI toolbox for architects and engineers

Introducing AIXD (AI-eXtended Design), a toolbox for forward and inverse modeling for exhaustive design exploration.
Smartair | An active learning algorithm for real-time acquisition and regression of flow field data
May 1, 2024

Smartair | An active learning algorithm for real-time acquisition and regression of flow field data

Smartair | An active learning algorithm for real-time acquisition and regression of flow field data

We’ve developed a smart solution for wind tunnel testing that learns as it works, providing accurate results faster. It provides an accurate mean flow field and turbulence field reconstruction while shortening the sampling time.

Contact us

Let’s talk Data Science

Do you need our services or expertise?
Contact us for your next Data Science project!