An Integrated Machine Learning Framework for Novel Small Molecule Drug Design
Dr. Jonathan E. Allen, Informatics Thrust Leader, Biosecurity Center at ATOM 
Consortium
Wednesday July 15, 2020, 12:00 to 1:00 pm EDT
Register for free at https://www.rosaandco.com/webinars
Abstract:
The drug discovery process is costly, slow, and failure prone. It takes an 
average of 5.5 years to get to the clinical testing stage, and in this time 
millions of molecules are tested, thousands are made, and most fail. The ATOM 
Consortium (atomscience.org), comprised of LLNL, GSK, Frederick National Lab, 
and UCSF, is working to increase efficiencies in the drug discovery process 
through improved integration of machine learning earlier in the drug design and 
discovery process by evaluating multiple properties needed to make a viable 
drug. A combination of safety, pharmacokinetic and efficacy properties are 
considered simultaneously in the early drug design phase with an aim to 
ultimately show that these molecules will have better success rates with 
subsequent pre-clinical and clinical testing.
The purpose of this webinar will be to introduce key components of the ATOM 
computational framework, highlight ongoing challenges and opportunities for 
improvement. The presentation will begin with a description of AMPL, the open 
source framework developed to build machine learning models that generate key 
safety and pharmacokinetics parameters, used for molecule evaluation and as 
input to anticipated Quantitative System Pharmacology and Toxicology models. 
The end-to-end pipeline handles data curation, feature extraction, model 
building, prediction generation, and data visualization.
Next, we'll describe how the best-performing models are integrated into an 
active learning loop (with code in the process of being open sourced) to guide 
the search for de novo compounds, with plans to integrate an in-house PBPK 
model to predict in-vivo behavior. The active learning loop includes a 
computational search through chemical space for candidate small molecules with 
opportunities for proposed molecules to be evaluated experimentally for model 
validation and re-training. Discussion of the active learning pipeline will 
include an examination of the utility of machine learning model uncertainty 
estimates needed to guide active learning and challenges in designing and 
bounding the chemical search space. We will conclude with an examination of an 
early test of one round of the active learning loop applied to the design of a 
selective kinase inhibitor.


Reply via email to