LEP-AD: Language Embedding of Proteins and Attention to Drugs Predicts Drug Target Interactions

Explore our research paper: LEP-AD on BioRxiv

Welcome to our GitHub repository dedicated to our paper titled "LEP-AD: Language Embedding of Proteins and Attention to Drugs Predicts Drug Target Interactions." This research work delves into the repurposing of ESM Pretrained Models for Drug-Target Interaction (DTI) and was presented at the Machine Learning for Drug Discovery workshop (MLDD) during ICLR'23.

Setup ESM-2 Repository

Begin by cloning the ESM-2 repository:

git clone https://github.com/facebookresearch/esm.git

After cloning, navigate to the esm directory. Here, you'll need to create a directory for data storage:

mkdir data

Next, download the required datasets from the provided link and ensure they are stored in the data directory you just created: Download Data

Environment Setup

For optimal performance, it's recommended to utilize CUDA 11.4. To set up the ESM environment, execute the following commands:

conda env create -f environment.yml

conda activate esm2

Protein Representation with ESM

To derive protein representations from ESM, utilize the provided notebook. This will help in extracting unique proteins and making inferences using the ESM-2 model:

Execute the data_protein_esm.ipynb notebook to generate protein representations from ESM-2.

LEP-AD for Drug-Target Interaction

With the protein representations from ESM in place, you're set to use LEP-AD for Drug-Target Interaction. To ensure there's no interference with the previous environment, we'll establish a new one:

conda env create -f environment_LEP_AD.yml

conda activate LEP-AD

Automated Setup Script

To reproduce the results for each dataset, run the LEP-AD.ipynb notebook. Alternatively, the following command line can be executed:

chmod +x setup_and_run.sh

./setup_and_run.sh

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
data		data
LEP-AD.ipynb		LEP-AD.ipynb
LEP-AD.py		LEP-AD.py
README.md		README.md
data_protein_esm.ipynb		data_protein_esm.ipynb
environment.yml		environment.yml
environment_LEP_AD.yml		environment_LEP_AD.yml
esm.py		esm.py
hubconf.py		hubconf.py
model.py		model.py
preprocess_data.py		preprocess_data.py
pyproject.toml		pyproject.toml
setup.py		setup.py
setup_and_run.sh		setup_and_run.sh
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LEP-AD: Language Embedding of Proteins and Attention to Drugs Predicts Drug Target Interactions

Table of Contents

Setup ESM-2 Repository

Environment Setup

Protein Representation with ESM

LEP-AD for Drug-Target Interaction

Automated Setup Script

About

Releases

Packages

Languages

adaga06/LEP-AD

Folders and files

Latest commit

History

Repository files navigation

LEP-AD: Language Embedding of Proteins and Attention to Drugs Predicts Drug Target Interactions

Table of Contents

Setup ESM-2 Repository

Environment Setup

Protein Representation with ESM

LEP-AD for Drug-Target Interaction

Automated Setup Script

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages