-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error when running CountPeaks #48
Comments
Hi @idupanloup, Apologies for the slow response. This error occurs when there are no UMI counts identified in your data for the set of provided peaks and a 'NULL' matrix is returned. As you are only inputting 7 peaks, based on your previous post, I'd say that one of your samples simply has no coverage over these peak coordinates, which is why the error is being returned. Cheers, |
Hello, sorry to piggyback an older ticket but I have a similar error occurring:
The BAM file used is the extracted chromosome 5 of a bigger BAM I was hoping to use, I thought this would exclude a memory issue. Also the appropriate flags should be present, as the original BAM file was generated with this STARsolo command:
Is there any other possible reason this error occurs? Thank you very much in advance! |
Hi, @rbarbieri86, Maybe your whitelist is wrong. Although you have called out many peak sites, but none of their barcodes can match the whitelist so you get an empty peak x cell matirx after running CountPeaks function. |
Hi Bridream, Ok, that would be weird for the Sierra whitelist as I used the barcodes.tsv files as indicated. However I could double check the STARsolo whitelist as that one was downloaded following the manual's indications if I remember correctly. I will look into that and come back to you |
Hi Bridream, I have indeed used the wrong whitelist when aligning with STARsolo, substituted with the correct one (3M-february-2018.txt.gz).
I am working on publicly available data which I could analyze with another tool (MAAPER). Any idea of why this is still happening? |
Do you find the whitelist somewhere online? Maybe you can try to generate your whitelist using your raw data, so in this way your whitelist must be right. If you are working on publicly available data, you cac just extract the barcodes whitelist from the processed data, for example, the gene-by-cell matrix. |
The whitelist is provided on the 10X website and also linked in the STARsolo guide on GitHub. I can try using the barcodes.tsv as whitelists after removing the "-1" at the end I think. I have also noticed some discrepancy between the data and their description too. Apparently the data is labeled 10X 3'v3 version but the barcodes are shorter (26 instead of 28), which seems to indicate a v2 kit. However I was using the v2 whitelist beforehand and still had the same issue. Looking into STARsolo logs, it seems there is indeed a problem there as a lot of cells do not have a valid barcode. I will try a few runs more with different parameters and get back to you. Thanks for your assistance by the way. |
Hello, just thought of giving an update. There were indeed issues with the STARsolo step as the Fastq files used were partially corrupted. At the moment, after a few re-downloads and confirming that the kit used was v2 I have tried once more to run CountPeaks which resulted in the same error as above. I am relatively sure that the input should be fine now as the STARsolo logs seem OK, so I will try my luck with MAAPER once more. |
I get an error when using the CountPeaks function for one of my samples:
Do you know what can be the problem ? (i saw a similar issue posted by another user, but did not find the solution)
Thanks !
The text was updated successfully, but these errors were encountered: