Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DeepMased - stalled out on finding features #8

Open
TealFurnholm opened this issue Nov 12, 2020 · 12 comments
Open

DeepMased - stalled out on finding features #8

TealFurnholm opened this issue Nov 12, 2020 · 12 comments

Comments

@TealFurnholm
Copy link

Greetings,

I am trying to see if my merge-overlap.pl script creates chimeras when I combine output from different assemblers.

As you can see, nothing has happened in days:
Using TensorFlow backend.
2020-11-10 21:33:21,376 - Indexing file: MERGED_CONTIGS_NOSPLIT_VDD.fa
2020-11-10 21:33:25,745 - Cannot find MERGED_CONTIGS_NOSPLIT_VDD_sorted.bam.bai; creating...
2020-11-10 21:35:08,541 - Processing: MERGED_CONTIGS_NOSPLIT_VDD_sorted.bam
2020-11-10 21:35:10,378 - Number of contigs in the bam file: 1682485

Checking "top" - it is still running, though I wonder it is only using 1 core (100%).
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
40639 txxf 20 0 11.542g 9.143g 145724 R 100.0 1.2 2262:04 python3

There's nothing in the output either.
-rw-r--r-- 1 txxf gmb 0 Nov 10 21:35 MERGED_CONTIGS_NOSPLIT_VDD_sorted_feats.tsv

Any ideas why it is not working?
How long should this normally take?

Best,
Teal

@nick-youngblut
Copy link
Collaborator

Hi Teal. Can you post the command that you're running?

@TealFurnholm
Copy link
Author

Hi!

I ran this command: python3 -m DeepMAsED features bam_fasta_table.tab

@nick-youngblut
Copy link
Collaborator

Maybe it's an issue with tensorflow freezing/stalling? I did have some issues with that for certain versions. Which version are you running? Does running another DL model with that version of TF work with your setup? Check out this issue for some potential solutions: tensorflow/tensorflow#32017

@TealFurnholm
Copy link
Author

TealFurnholm commented Nov 13, 2020 via email

@nick-youngblut
Copy link
Collaborator

Our system admin had to do some voodoo to get deepmased to work:

You were not able to install DeepMAsED via bioconda?

@TealFurnholm
Copy link
Author

TealFurnholm commented Nov 16, 2020 via email

@TealFurnholm
Copy link
Author

TealFurnholm commented Nov 30, 2020 via email

@nick-youngblut
Copy link
Collaborator

not sure exactly what the script is running during find features, gene prediction? MetaQuast?

DeepMAsED features generates feature tables from the bam files via pysam. bam files are processing parallel if --procs is >1.

I also notice it is running only one core

Your command doesn't include --procs, so DeepMAsED features defaults to 1 core.

It still stalls out in the find features.

It may be an issue with pysam. DeepMAsED features is really just pysam to extract features from the bam files. It's therefore almost definitely something to do with pysam. You could try installing a different version of pysam or send me some example files so that I can try and reproduce the issue.

@TealFurnholm
Copy link
Author

TealFurnholm commented Dec 1, 2020 via email

@nick-youngblut
Copy link
Collaborator

The option --procs is not on the github and not in the deepmased -h command line output - you may want to update that.

The UI is a standard command-subcommand format (like git). Use DeepMAsED features -h

allow full multimap? keep 1 random?

We used default bowtie2 params for mapping. You can assess different mapping approaches on accuracy if you'd like by running the DeepMAsED-SM pipeline.

@TealFurnholm
Copy link
Author

TealFurnholm commented Dec 2, 2020 via email

@nick-youngblut
Copy link
Collaborator

I'm glad that you got DeepMAsED features to run. For many large bam files, I highly recommend using many threads to speed up the job.

OSError: Unable to open file (unable to lock file, errno = 37, error_message = 'No locks available')

This appears to be an open issue with h5py: h5py/h5py#1101

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants