Prepare the pickle file using a subset of PDB files? #11

BinhongLiu · 2023-03-03T13:25:49Z

Hi!
Could I prepare the pickle file using a subset of PDB files? And so that I could search for a functional site against the subset of the PDB database. I tried the embed_pdb_dataset.py script, but only the LMDB format file was produced. Thanks!

The text was updated successfully, but these errors were encountered:

BinhongLiu · 2023-03-27T10:56:30Z

Hi,
Sorry to bother you again. I'd like to prepare a pickle file containing residues embedding database with data from a pocket database (http://bioinfo-pharma.u-strasbg.fr/scPDB/) and then annotate my protein structures with this pocket database using annotate_pdb.py. It seems to be that I need to prepare the pickle file and background_stats.tar.gz, right?
Would you help me with this? Thanks

awfderry · 2023-04-10T22:08:47Z

Hi, you can use the script lmdb_to_pkl.py to convert the LMDB format to pickle format. For large datasets, it may help to process this in multiple splits and combine the resulting pickle files (using the --split_id and --num_splits arguments).

awfderry · 2023-04-10T22:13:13Z

For a custom database such as scPDB, you can create the dataset using scripts/functional_database.py (you may have to update the dataset class in line 23 from SiteDataset to accommodate your specific data format. You can either use the pre-computed background embeddings from PDB100 (recommended as a starting point) or you can compute your own background distributions using a dataset of PDB files using scripts/compute_background.py.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prepare the pickle file using a subset of PDB files? #11

Prepare the pickle file using a subset of PDB files? #11

BinhongLiu commented Mar 3, 2023

BinhongLiu commented Mar 27, 2023 •

edited

Loading

awfderry commented Apr 10, 2023

awfderry commented Apr 10, 2023

Prepare the pickle file using a subset of PDB files? #11

Prepare the pickle file using a subset of PDB files? #11

Comments

BinhongLiu commented Mar 3, 2023

BinhongLiu commented Mar 27, 2023 • edited Loading

awfderry commented Apr 10, 2023

awfderry commented Apr 10, 2023

BinhongLiu commented Mar 27, 2023 •

edited

Loading