-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
partition_into_regions returns empty region for all contigs in header #1200
Comments
Disabling doesn't break any tests as it was |
Possible explanation for #1169 ? |
Not sure. It seems it can be removed though. |
Digging deeper, it's not quite as simple as this. Turn out that all three of CSI indexed BCF, CSI indexed VCF and tabix indexed VCF need to be treated slightly differently in these cases in which there are multiple contigs defined in the header 🤮 |
Closing in favour of #1202 (more precisely specified) |
This line adds an empty region for every contig that's in the VCFs header in
partition_into_regions
. However, it's common for VCFs to have contigs declared in the header that are not in the file at all. For example, recent 1000 Genomes data declares all contigs in all VCF files (there are thousands). This generates a lot of noise in the returned region strings.What was the rationale for returning the empty regions @tomwhite?
The text was updated successfully, but these errors were encountered: