Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Foldseek clustering stuck for days with errors #390

Open
YFeriel opened this issue Dec 2, 2024 · 0 comments
Open

Foldseek clustering stuck for days with errors #390

YFeriel opened this issue Dec 2, 2024 · 0 comments

Comments

@YFeriel
Copy link

YFeriel commented Dec 2, 2024

Hello,

I am using a protein structure database with Foldseek for clustering, but I encountered an issue with the following command:
foldseek cluster /data/foldseek/concat_db /data/db_clusters /data/cluster_tmp_dir

I received this error:
structurerescorediagonal /data/foldseek/concat_db /data/foldseek/concat_db /data/cluster_tmp_dir/4804289747088079168/pref /data/cluster_tmp_dir/4804289747088079168/pref_rescore1 --exact-tmscore 0 --tmsc>[=
Can not write to data file p
Can not write to data file /data/cluster_tmp_dir/4804289747088079168/pref_rescore1.29
...
Error: Rescore with hamming distance step died

Initially, I suspected it could be due to disk space or memory issues, so I added the --remove-tmp-files 1 option. While this resolved potential disk space concerns, the runtime increased significantly. The clustering process has now been running for over seven days and remains stuck at the same step:

structurerescorediagonal /data/foldseek/concat_db /data/foldseek/concat_db /data/cluster_tmp_dir/4804289747088079168/pref /data/cluster_tmp_dir/4804289747088079168/pref_rescore1 --exact-tmscore 0 --tmsc>[=

Could this error be related to disk space or memory limitations, or might it indicate a different issue? Also, are there any optimizations or alternative approaches you would recommend to reduce the runtime and avoid prolonged processing times like this?

Thank you for your help!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant