diff --git a/data/combinatorial_library/README.md b/data/combinatorial_library/README.md index c5f6198a..c0912c8e 100644 --- a/data/combinatorial_library/README.md +++ b/data/combinatorial_library/README.md @@ -15,7 +15,6 @@ In order to run the analysis notebooks, please download this dataset to this fol Data extracted from `combinatorial_library_deduplicated.json`, performed in `notebooks/4_1_combinatorial_library_data_preparation.ipynb` at https://github.com/volkamerlab/KinFragLib. -- `metadata.json`: Bond and fragment IDs for each recombined ligand - `n_atoms.csv`: Number of atoms for each recombined ligand - `ro5.csv`: Number of ligands that fulfill Lipinski's rule of five (Ro5) and its individual criteria; number of ligands in total - `subpockets.csv`: Number of ligands per subpocket combination @@ -23,3 +22,4 @@ Data extracted from `combinatorial_library_deduplicated.json`, performed in `not - `original_substructure.json`: Ligands with substructure matches in original ligands, i.e. KLIFS ligands that were used for the fragmentation - `chembl_exact.json`: Ligands with exact matches in ChEMBL - `chembl_most_similar.json`: Most similar ligand in ChEMBL for each recombined ligand +- `chembl_highly_similar.json`: Most similar ligand in ChEMBL for each recombined ligand with similarity greater than 0.9.