-
Notifications
You must be signed in to change notification settings - Fork 92
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
positive-containing DFE from Zhen et al 2021 (GammaPos_Z21, HomSap and DroMel) #1656
base: main
Are you sure you want to change the base?
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #1656 +/- ##
=======================================
Coverage 99.84% 99.84%
=======================================
Files 132 132
Lines 4584 4616 +32
Branches 467 467
=======================================
+ Hits 4577 4609 +32
Misses 3 3
Partials 4 4 ☔ View full report in Codecov by Sentry. |
@@ -36,20 +36,83 @@ def _HuberDFE(): | |||
distribution_type="g", # gamma distribution | |||
distribution_args=[gamma_mean, gamma_shape], | |||
) | |||
# p. 2 in supplement says that the total sequence length of synonymous sites LS | |||
# related to the total sequence length LNS by LNS = 2.85 * LS; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
# related to the total sequence length LNS by LNS = 2.85 * LS; | |
# related to the total sequence length of nonsynonymous sites LNS by LNS = 2.85 * LS; |
distribution_args=[gamma_mean, gamma_shape], | ||
) | ||
# p. 2 in supplement says that the total sequence length of synonymous sites LS | ||
# related to the total sequence length LNS by LNS = 2.85 * LS |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
# related to the total sequence length LNS by LNS = 2.85 * LS | |
# related to the total sequence length of nonsynonymous sites LNS by LNS = 2.85 * LS |
distribution_args=[gamma_mean, gamma_shape], | ||
) | ||
# p. 2 in supplement says that the total sequence length of synonymous sites LS | ||
# related to the total sequence length LNS by LNS = 2.31 * LS |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
# related to the total sequence length LNS by LNS = 2.31 * LS | |
# related to the total sequence length of nonsynonymous sites LNS by LNS = 2.31 * LS |
This is as recommended by @klohmueller over in #1469. It's really just Huber et al's Gamma_H17 except a proportion
p
of the changes are beneficial with coefficients
; the paper shows thatp
ands
are highly confounded (andps
can be estimated better); but these are the MLE values. These values are using divergence to chimp.Kirk said:
What I've implemented is this, and I've verified all those parameters above - so, I figure this is actually mostly QC'ed (since Kirk pulled out those numbers and I verified them). So, I'm tempted to just put in the QC code also - but that wouldn't catch coding errors.
Edit: I've also added the estimated one for DroMelfrom Table S6 as above.