Graph neural networks for efficient yield prediction of chemical reactions

About

This repo contains supplementary code for our paper Graph neural networks for efficient yield prediction of chemical reactions (to be published soon) where we propose to use a graph neural network to predict actual yield of chemical reactions. We evaluated the model performance on three chemical reactions datasets: 2 small scale public single reaction datasets
-- Buchwald-Hartwig (from Predicting reaction performance in C–N cross-coupling using machine learning and Suzuki-Miyaura reactions (from A platform for automated nanomole-scale reaction screening and micromole-scale synthesis in flow -- as well as one proprietary multiple reaction class dataset, provided by Enamine. We conducted a detailed analysis of model's errors on the commercial dataset and provided a chemically viable explanation for the most common of them.

We provide preprocessing and atom mapping code for open datasets as well as scripts used for training. Graph neural network code in chemprop/ dir is taken from https://github.com/chemprop/chemprop with some very minor modifications. For example, we added a possibility to apply dimensionality reduction(t-sne) to the middle graph representations and the final mixed (graph and rdkit descriptors) representations learned by graph neural net. Example visualizations can be found in clustering/ .

Installation

Required packages are listed in environment.yml. Just run conda env create -f environment.yml

To reproduce

For single reaction class datasets:

Run single_reaction_class_data_preprocessing.ipynb
Run bash train_k_fold.sh

Authors

Dzvenymyra Yarish [email protected]
Sofiya Garkot [email protected]
Oleksandr Grygorenko [email protected]
Yurii Moroz [email protected]
Dmytro Radchenko [email protected]
Oleksandr Gurbych [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
chemprop		chemprop
clustering		clustering
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
hyperparameter_optimization.py		hyperparameter_optimization.py
predict.py		predict.py
single_reaction_class_data_preprocessing.ipynb		single_reaction_class_data_preprocessing.ipynb
train.py		train.py
train_k_fold.sh		train_k_fold.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Graph neural networks for efficient yield prediction of chemical reactions

About

Installation

To reproduce

Authors

About

Releases

Packages

Contributors 2

Languages

License

SoftServeInc/yield-paper

Folders and files

Latest commit

History

Repository files navigation

Graph neural networks for efficient yield prediction of chemical reactions

About

Installation

To reproduce

Authors

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages