Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to get the site list for indels? #92

Open
breezetown opened this issue Jan 17, 2022 · 3 comments
Open

how to get the site list for indels? #92

breezetown opened this issue Jan 17, 2022 · 3 comments

Comments

@breezetown
Copy link

Hi,we want to run bam-readcount to obtain a file of readcounts for our indels. But we don't know how to define the site list for indels, or how to get the site list from the vcf file from the same sample.

@chrisamiller
Copy link
Collaborator

Indels will be reported at the appropriate base as an additional column (for example A:xxxx C:xxxx G:xxxx T:xxxx +A:xxxx). it's straightforward to add these. If you want to add readcounts to your VCF directly, consider looking at the scripts in VAtools as outlined here: https://pvactools.readthedocs.io/en/latest/pvacseq/input_file_prep/readcounts.html

Hope this helps!

@breezetown
Copy link
Author

Thanks, but we can not solve this, if we have a vcf file like this:
CHROM POS REF ALT
chr1 1168012 CCTG C
chr1 1356341 TTCC T
chr1 1534913 CGCG C
chr1 1684347 CCCT C
chr1 1684347 CCCT CCCTCCT
how to get the site list from this vcf file? Because some indels are compliacted.

@chrisamiller
Copy link
Collaborator

Complex indels (like the last one) are not currently well-supported by bam-readcount. The others should be fine, though. if in doubt, pick one, run bam readcount on a small interval around the event and look for the indel to match up as as sanity check.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants