You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
I have compiled a DIAMOND database from the current RefSeq database, but apparently the script DIAMOND_analysis_counter.py get stuck at one line.
Do you have any idea if I need to do any preprocessing of the database before starting DIAMOND_analysis_counter.py
Traceback (most recent call last):
File "samsa2/python_scripts/DIAMOND_analysis_counter.py", line 151, in <module>
if split_db_org[1] == "sp.":
IndexError: list index out of range
line 162, in <module>
db_org = split_db_org[1] + " " + split_db_org[2]
IndexError: list index out of range
Best, Michael
The text was updated successfully, but these errors were encountered:
Ah, yes, the parsing script doesn't do well when there are multiple instances of square brackets in the line. I've noticed that the majority of brackets are used in the function, rather than the organism name, so this section (lines 146-162) are parsing out the organism name by assuming that this is what's in the last set of brackets.
The issue is actually with line 147, where it's selecting 22] as the organism name, as this is what's inside the last set of brackets.
You could try running a command on your database to replace this line with one that uses parentheses instead of brackets, if this is the only database entry where you hit this error - otherwise, this may take some regex work that will be a bit tougher for me to work out. Did you find other lines causing issues?
Hi,
I have compiled a DIAMOND database from the current RefSeq database, but apparently the script DIAMOND_analysis_counter.py get stuck at one line.
Do you have any idea if I need to do any preprocessing of the database before starting DIAMOND_analysis_counter.py
Best, Michael
The text was updated successfully, but these errors were encountered: