You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've been trying to generate pairwise relatedness plots using ANGSD as described in the bioarxiv paper, but I am encountering problems in that my values display the same general distribution as in figure S4, but the values deviate substantially. Roughly, R0 = 0 - 3; R1 = 0.1 - 0.4; King = -0.6 - 0.125.
This is a non-model organism, and we built a de novo reference genome. I used a SNP set that we made from only high depth tissue samples and added some low depth individuals in this run, but based the run on the preexisting SNP set. In general I am trying to be pretty strict with the filtration here, and I've tried I've tried playing around with different settings to no avail.
Do you have any advice? Here is my code below.
Generate Genotype likelihoods for each scaffold for 13 individuals
Glad you are using the program. thanks for including a detailed accounting of the commands you ran.
My first guess (not knowing anything about the species or sampling scheme), would be that you have samples from different populations, these pairs would appear as less related (higher R0, lower kinship, lower R1) than a pair of samples from the same population. Does the pattern here make sense in that light? Differential admixture into the two samples can also affect these statistics, often affecting the stats in the same direction
If that doesn't seem to make sense, a few ideas:
you can try removing the -C flag. From my understanding this is helpful when aligning to a distantly related genome, but can be problematic if you are aligning to a closely related one, such as here.
you can try the SFS based method, using the alleles in the reference. This should work as long as you expect this reference allele to be segregating at each site.
In the data I tested on, an upper depth filter didn't seem to matter, but it might here. maybe try -setMaxDepth
Do you see a bias / difference based on depth of the sample?
Hi rwaples,
I've been trying to generate pairwise relatedness plots using ANGSD as described in the bioarxiv paper, but I am encountering problems in that my values display the same general distribution as in figure S4, but the values deviate substantially. Roughly, R0 = 0 - 3; R1 = 0.1 - 0.4; King = -0.6 - 0.125.
This is a non-model organism, and we built a de novo reference genome. I used a SNP set that we made from only high depth tissue samples and added some low depth individuals in this run, but based the run on the preexisting SNP set. In general I am trying to be pretty strict with the filtration here, and I've tried I've tried playing around with different settings to no avail.
Do you have any advice? Here is my code below.
angsd -bam BamList.txt -P 4 -ref ReferenceGenome.fna -out GLF_scaffold1 -uniqueOnly 1 -remove_bads 1 -only_proper_pairs 1 -trim 0 -C 50 -baq 1 -minQ 30 -minMapQ 30 -maxDepth 100 -GL 2 -doCounts 1 -sites SNPsVCF.txt -minind 13 -r Scaffold1 -doGlf 1 -doMajorMinor 1 -doMaf 1 -rmTriallelic 0.000002
Concatenate GLFs for each scaffold
cat *.glf.gz > complete.glf.gz
Generate ibspair file
/apps/ANGSD/0.916/misc/ibs -glf complete.glf.gz -nInd 13 -allpairs 1 -o complete
Process with read_IBS.R
Complete = 'complete.ibspair'
do_derived_stats(read_ibspair_model0(Complete))
ggplot(Complete, aes(x=R1, y=R0)) + geom_point
ggplot(Complete, aes(x=R1, y=Kin)) + geom_point
The text was updated successfully, but these errors were encountered: