MLTox

Enhancing toxicological testing through machine learning

Started
September 1, 2021
Status
In Progress
Share this project

Abstract

We plan to use machine learning (ML) methods to predict the effects of chemicals on aquatic species.

Our main goal is to ​ use a combination of data from in-vivo (whole organisms) and in-vitro (cell culture) experiments to infer the effects of chemicals on organisms for which no testing data is available (both for the chemical and for the organism).

In the literature, this kind of problem is also known as across-chemical (and across-species) extrapolation. Usually, extrapolation across chemicals is performed using measures of chemical similarity under the assumption that similar chemicals will be similarly toxic to the same species. Extrapolation across species can be performed based on measured chemical effects on some species and the similarity between species, either by phylogenetic distance or sequence/structure similarity of known molecular targets of the chemicals, if at all available, or through similarity in physiological traits.

Given the enormous number of chemicals and potentially affected species, extrapolation chemical-by-chemical or species-by-species is a daunting task.

People

Collaborators

SDSC Team:
Fernando Perez-Cruz
Lilian Gasser

PI | Partners:

Department Systems Analysis, Integrated Assessment and Modelling:

  • Prof. Dr. Kristin Schirmer
  • Dr. Marco Baity Jesi
  • Dr. Christoph Schür

More info

description

Problem:

Ecotoxicological testing requires investing large amounts of money, workforce, and time, in addition to the animal suffering for in-vivo tests. There are global efforts to reduce or replace animal testing for both ethical and feasibility concerns for human and environmental risk assessment. Indeed, a ​ paradigm shift is needed to ensure a toxic-free environment as proposed, e.g. in the EU’s Green Deal.

Impact:

With the work proposed here, we will provide new means to protect the environment from toxicants, which allow us to significantly ​ reduce or even replace experiments on animals, by combining ML, in-vitro tests, and pre-existing in-vivo data.

Proposed approach:

  • In WP1, we will train standard ML models on fish data and will compare them to more elaborate models.
  • In WP2, we will analyze how much and under which conditions the usage of in-vitro data can improve the predictions of our ML models.
  • In ​WP3​ , we will use our models to gain a better understanding of the nonlinear relationships that connect species, chemicals, and related toxicity.
  • In WP4​ , we will explore methods for improving the performance of our models and we will release an open-source package with our models.

Gallery

Annexe

Additionnal resources

Bibliography

  1. Luechtefeld et al. (2018) Machine Learning of Toxicological Big Data Enables Read-Across Structure Activity Relationships (RASAR) Outperforming Animal Test Reproducibility. Toxicological Sciences, Volume 165, Issue 1, September 2018, Pages 198–212

Publications

Related Pages

More projects

ML4FCC

In Progress
Machine Learning for the Future Circular Collider Design
Big Science Data

CLIMIS4AVAL

In Progress
Real-time cleansing of snow and weather data for operational avalanche forecasting
Energy, Climate & Environment

SEMIRAMIS

Completed
AI-augmented architectural design
Energy, Climate & Environment

4D-Brains

In Progress
Extracting activity from large 4D whole-brain image datasets
Biomedical Data Science

News

Latest news

PassGPT | Using language models to enhance password security
February 6, 2024

PassGPT | Using language models to enhance password security

PassGPT | Using language models to enhance password security

PassGPT is a Large Language Model for password generation trained on leaked passwords, which can outperform existing methods based on generative adversarial networks by guessing twice as many unseen passwords.
ADORE | A benchmark dataset in ecotoxicology to foster the adoption of machine learning
January 24, 2024

ADORE | A benchmark dataset in ecotoxicology to foster the adoption of machine learning

ADORE | A benchmark dataset in ecotoxicology to foster the adoption of machine learning

Applying machine learning to ecotoxicology could help reduce the number of animal tests, costs, and animals sacrificed while preserving the accuracy of the in vivo tests.
License Flowers | Art and AI at SDSC
February 21, 2024

License Flowers | Art and AI at SDSC

License Flowers | Art and AI at SDSC

An adventure to create art using AI to raise awareness on code licenses

Contact us

Let’s talk Data Science

Do you need our services or expertise?
Contact us for your next Data Science project!