Skip to content

Latest commit

 

History

History
121 lines (99 loc) · 4.53 KB

README.md

File metadata and controls

121 lines (99 loc) · 4.53 KB

MolNexTR

This is the official code of the following paper, "MolNexTR: A Generalized Deep Learning Model for Molecular Image Recognition".

Highlights

In this work, We propose MolNexTR, a novel graph generation model. The model follows the encoder-decoder architecture, takes three-channel molecular images as input, outputs molecular graph structure prediction, and can be easily converted to SMILES. We aim to enhance the robustness and generalization of the molecular structure recognition model by enhancing the feature extraction ability of the model and the augmentation strategy, to deal with any molecular images that may appear in the real literature.

visualization

Overview of our MolNexTR model.

Using the code and the model

Using the code

Clone the following repositories:

git clone https://github.com/CYF2000127/MolNexTR

Example usage of the model

  1. Install requirements
pip install -r requirements.txt
  1. Download our model checkpoint from Our Hugging Face Repo and put in your own path

  2. Run the following code to predict molecular images:

import torch
from MolNexTR import molnextr
IMAGE = './examples/1.png'
MODEL = './checkpoints/molnextr_best.pth'
device = torch.device('cpu')
model = molnextr(MODEL, device)
predictions = model.predict_final_results(IMAGE, return_atoms_bonds=True)
print(predictions)

or use prediction.ipynb. You can also change the image and model path to your own images and models.

The input is a molecular image visualization

Example input molecular image.
The output dictionary includes the atom sets, bond sets, predicted MolFile, and predicted SMILES:
{
    'atom_sets':  [
                  {'atom_number': '0', 'symbol': 'Ph', 'coords': (0.143, 0.349)},
                  {'atom_number': '1', 'symbol': 'C', 'coords': (0.286, 0.413)},
                  {'atom_number': '2', 'symbol': 'C', 'coords': (0.429, 0.349)}, ... 
                  ],
    'bonds_sets': [
                  {'atom_number': '0', 'bond_type': 'single', 'endpoints': (0, 1)},
                  {'atom_number': '1', 'bond_type': 'double', 'endpoints': (1, 2)}, 
                  {'atom_number': '1', 'bond_type': 'single', 'endpoints': (1, 5)}, 
                  {'atom_number': '2', 'bond_type': 'single', 'endpoints': (2, 3)}, ...
                  ],
    'predicted_molfile': '2D\n\n 11 12  0  0  0  0  0  0  0  0999 V2000 ...',
    'predicted_smiles': 'COC1CCCc2oc(-c3ccccc3)cc21'
}   

Experiments

Requirement

pip install -r requirements.txt

Data preparation

For training and inference, please download the following datasets to your own path.

Training datasets

  1. Synthetic: PubChem
  2. Realistic: USPTO

Testing datasets

  1. Synthetic: Indigo, ChemDraw
  2. Realistic: CLEF, UOB, USPTO, JPO, Staker, ACS
  3. Perturbed by IMG transform: CLEF, UOB, USPTO, JPO, Staker, ACS
  4. Perturbed by curved arrows: CLEF, UOB, USPTO, JPO, Staker, ACS

Train

Run the following command:

sh ./exps/train.sh

The default batch size was set to 256. And it takes about 20 hours to train with 10 NVIDIA RTX 3090 GPUs.

Inference

Run the following command:

sh ./exps/eval.sh

The default batch size was set to 32 with a single NVIDIA RTX 3090 GPU. The outputs include the main metrics we used, such as SMILES exact matching accuracy and Graph exact matching accuracy.

Prediction

Run the following command:

python prediction.py --model_path your_model_path --image_path your_image_path

Visualization

Use visualization.ipynb to visualize the ground truths and the predictions.

We also show some qualitative results of our method below:

visualization

Qualitative results of our method on ACS.

visualization Qualitative results of our method on some hand-drawn molecular images.