
Find a student project

Note that some projects offered at one institution are potentially accessible to students from other institutions and vice versa.
Contrastive Language-Graph Pretraining


After finishing his Master in electrical engineering at the Ecole Fédérale de Lausanne (EPFL), Nathanaël worked as a researcher in the Acoustic Research Institute (ARI) in Vienna. In 2013, he returned to EPFL for a PhD, where he specialized himself in different fields of data science: signal processing, machine learning, graph theory and optimization. Furthermore, he created two open source libraries for optimization (UNLocBoX) and graph signal processing (GSPBOX). Since 2017, Nathanaël Perraudin is a Research Data Scientist at the Swiss Data Science Center in the ETH Zurich. He focuses on different aspects of deep learning in the area of generative models (VAE and GAN), recursive architectures and convolutional neural network for irregular domains. Outside office hours, he is passionate by tango dancing, tandem bike touring, skiing and rock climbing.
Analyzing Heat Pump System Failure Patterns Using Natural Language Processing


Saurabh Bhargava, joined the SDSC as a Principal Data Scientist in the Industry Cell at the Zürich office in 2022. Saurabh previously worked in the retail sector and the advertising industry in Germany. He lead and built various data products for customers using state of the art machine learning methods and industrializing them thereby adding value for the customers. He completed his PhD from ETH Zürich in June 2017 specializing in machine learning applications on Audio data. He obtained his Master’s and Bachelor’s degrees from EPFL and Indian Institute of Technology (IIT), Roorkee, India in 2011 and 2009 respectively. His interests and expertise are in combining state of the art data science and data engineering tools for building scalable data products.
Exploring the Potential of the Forward-Forward Algorithm in Deep Reinforcement Learning


After earning a MSc in Theoretical Physics at University of Padua, Giulio graduated in Quantitative Finance from Bocconi University. Before joining the SDSC industry cell in June 2021, he spent a few years working in the financial sector, where he mainly dealt with the application of machine learning to financial risk management. When not coding, Giulio spends his free time playing bass guitar, hiking and cooking.
Forecasting the unemployment rate in Canton of Vaud with machine learning algorithms


Alessandro joined the SDSC in March 2019 as a data scientist focused on industry collaborations. His mission is to support corporates in leveraging the power of their data by adopting analytical approaches and data-centric solutions. His background is in biomedical engineering, with a PhD in neuroscience from the University of Tübingen. Before joining the center, he worked as a postdoc at the Max Planck Institute for Biological Cybernetics, at the EPFL Laboratory of Cognitive Neuroscience in Geneva, and as data scientist for a private ecommerce company.
Copula neural processes for time series meta-learning


Simon joined the SDSC as a senior data scientist in April 2022. He conducted his doctoral studies on statistical modeling of genetic data at ETH Zürich and obtained his MSc and BSc degrees at Technical University Munich in computer science. Before joining the SDSC, Simon worked as a freelance statistical consultant, and as an ML scientist at an AI startup in Lugano where he built experience in various topics ranging from generative modeling over Bayesian optimization to time series forecasting. Simon's research interests and expertise lie broadly in probabilistic machine and deep learning, causal inference, generative modeling, and their application in the natural sciences. Simon is an avid open-source software contributor and particularly enthusiastic about probabilistic programming languages, such as Stan.
Using Graph Neural Networks to Model 3D Surfaces with Application to the Stefan Problem


After finishing his Master in electrical engineering at the Ecole Fédérale de Lausanne (EPFL), Nathanaël worked as a researcher in the Acoustic Research Institute (ARI) in Vienna. In 2013, he returned to EPFL for a PhD, where he specialized himself in different fields of data science: signal processing, machine learning, graph theory and optimization. Furthermore, he created two open source libraries for optimization (UNLocBoX) and graph signal processing (GSPBOX). Since 2017, Nathanaël Perraudin is a Research Data Scientist at the Swiss Data Science Center in the ETH Zurich. He focuses on different aspects of deep learning in the area of generative models (VAE and GAN), recursive architectures and convolutional neural network for irregular domains. Outside office hours, he is passionate by tango dancing, tandem bike touring, skiing and rock climbing.
Machine Learning for Biodiversity Monitoring Using Soundscapes


Michele received a Ph.D. in Environmental Sciences from the University of Lausanne (Switzerland) in 2013. He was then a visiting postdoc in the CALVIN group, Institute of Perception, Action and Behaviour of the School of Informatics at the University of Edinburgh, Scotland (2014-2016). He then joined the Multimodal Remote Sensing and the Geocomputation groups at the Geography department of the University of Zurich, Switzerland (2016-2017). His main research activities were at the interface of computer vision, machine and deep learning for the extraction of information from aerial photos, satellite optical images and geospatial data in general.
Deep learning methods for PET imaging with NMDA tracers in the diagnosis of Alzheimer’s disease


Anna joined SDSC as a Data Scientist focusing on industry collaborations in July 2019. She strives to demonstrate rigor and excellence in data analysis and interpretation, deliver actionable results, and therein to enhance industry products and services. She completed her PhD in Bioinformatics at the University of Luxembourg, where she analysed large-scale heterogeneous datasets and leveraged multiple disciplines: Statistics, Network Analysis, and Machine Learning. Before joining SDSC, Anna worked as a Data Scientist at Deloitte Luxembourg, with a focus on the Financial and Insurance sectors. More specifically, Anna developed a computer vision model for car damage recognition, a high-accuracy credit scoring model for mortgage loans and an insurance KPI dashboard with time-series analysis.
ML-Based Predictive Modeling for Robotic On-Site Plastering


Fernando received a PhD. in Electrical Engineering from the Technical University of Madrid. He has been a member of the technical staff at Bell Labs and a Machine Learning Research Scientist at Amazon. Fernando has been a visiting professor at Princeton University under a Marie Curie Fellowship and an associate professor at University Carlos III in Madrid. He held positions at the Gatsby Unit (London), Max Planck Institute for Biological Cybernetics (Tuebingen), and BioWulf Technologies (New York). Since 2022, Fernando is the Deputy Executive Director of the SDSC.


Luis is originally from Spain, where he completed his bachelor studies on Electrical engineering, and my Ms.C. on signal theory and communications, both at the University of Seville. During his Ph.D. he started focusing on machine learning methods, more specifically message passing techniques for channel coding, and Bayesian methods for channel equalisation. He carried it out between the University of Seville and the University Carlos III in Madrid, also spending some time at the EPFL, Switzerland, and Bell Labs, USA, where he worked on advanced techniques for optical channel coding. When he completed his Ph.D. in 2013, he moved to the Luxembourg Center on Systems Biomedicine, where he switched his interest to neuroscience, neuroimaging, life sciences, etc., and the application of machine learning techniques to these fields. During his 4 and a half years there as a Postdoc, he worked on many different problems as data scientist, encompassing topics such as microscopy image analysis, neuroimaging, single cell gene expression analysis, etc. He joined the SDSC in April 2018.
Assessing and Thwarting Privacy Risks in Data Science Platforms


Mathias received his Ph.D. in computer and communication sciences from EPFL in 2015. He then spent two years as a post-doctoral researcher in the Center for IT-Security, Privacy, and Accountability (CISPA) at Saarland University, Germany, where he worked on genomic privacy and privacy in social networks. His current research interests lie at the intersection of privacy and machine learning, with a special application focus on biomedical data. He is currently the lead scientist for the SDSC of the PHRT project “DPPH: Data Protection in Personalized Health”. He is also co-principal investigator of a project funded by the Leenaards Foundation on evaluating and preventing privacy risks in biomedical databases.
Large language models for information retrieval in scientific literature


Luis is originally from Spain, where he completed his bachelor studies on Electrical engineering, and my Ms.C. on signal theory and communications, both at the University of Seville. During his Ph.D. he started focusing on machine learning methods, more specifically message passing techniques for channel coding, and Bayesian methods for channel equalisation. He carried it out between the University of Seville and the University Carlos III in Madrid, also spending some time at the EPFL, Switzerland, and Bell Labs, USA, where he worked on advanced techniques for optical channel coding. When he completed his Ph.D. in 2013, he moved to the Luxembourg Center on Systems Biomedicine, where he switched his interest to neuroscience, neuroimaging, life sciences, etc., and the application of machine learning techniques to these fields. During his 4 and a half years there as a Postdoc, he worked on many different problems as data scientist, encompassing topics such as microscopy image analysis, neuroimaging, single cell gene expression analysis, etc. He joined the SDSC in April 2018.


Fernando received a PhD. in Electrical Engineering from the Technical University of Madrid. He has been a member of the technical staff at Bell Labs and a Machine Learning Research Scientist at Amazon. Fernando has been a visiting professor at Princeton University under a Marie Curie Fellowship and an associate professor at University Carlos III in Madrid. He held positions at the Gatsby Unit (London), Max Planck Institute for Biological Cybernetics (Tuebingen), and BioWulf Technologies (New York). Since 2022, Fernando is the Deputy Executive Director of the SDSC.
Breaking 2: assessing the likelihood of a sub two hours marathon


Raphaël graduated in 2014 with an engineering degree from l’Ecole des Mines de Paris and holds since 2018 a Ph.D. in Statistics from l’Ecole Polytechnique Fédérale de Lausanne. Before joining the Swiss Data Science Center as senior data scientist, Raphaël was post-doctoral researcher at the Institute of Mathematics at EPFL working on quantitative risk modelling for natural hazards using extreme value theory. His research interests lie at the boundary of statistics and environmental sciences with a special focus on the analysis of spatio-temporal data.
Contact us
Let’s talk Data Science
Do you need our services or expertise?
Contact us for your next Data Science project!