HyPhy message board
http://www.hyphy.org/cgi-bin/hyphy_forums/YaBB.pl
Methodology Questions >> How to >> positive selection test for organism with GC->AT mutation bias
http://www.hyphy.org/cgi-bin/hyphy_forums/YaBB.pl?num=1379344200

Message started by Pär on Sep 16th, 2013 at 8:10am

Title: positive selection test for organism with GC->AT mutation bias
Post by Pär on Sep 16th, 2013 at 8:10am
I'd like to test the possibility of positive selection at the tips of a tree for an organism with is a strong GC->AT mutation bias. I'm afraid, however, that this situation might represent a violation of the model assumptions so that I could run a risk of obtaining a false positive result. I was wondering if anyone knows how robust currently used models/methods are to this situation.

Title: Re: positive selection test for organism with GC->AT mutation bias
Post by noob on Sep 16th, 2013 at 10:33am
Hi,

Are all organisms in your phylogeny evolving under a GC->AT bias? If yes, then this should be alright, since your underlying nucleotide substitution matrix can accommodate this with its equilibrium frequencies. If the GC->AT bias emerges at some point in your tree, then this might be cause for concern, and you may need to set up a custom model to account for this.

Title: Re: positive selection test for organism with GC->AT mutation bias
Post by Pär on Sep 17th, 2013 at 1:04am
Thanks noob for your reply!

Yes, I believe that all organisms - which are extremely closely related - are likely accumulating A+T nucleotides. The organisms differ at the tips by only one or a couple of SNPs from each other in their entire genomes. So those are the mutations that I would like to look at.

Sorry for being slow, but if there is an overall and pronounced accumulation of A+T nucleotides (in an organism already rich in A+T), is this not a violation of the equlibrium (stationary?) nucleotide frequency assumption? But maybe it does not matter?

Title: Re: positive selection test for organism with GC->AT mutation bias
Post by noob on Sep 17th, 2013 at 9:34am
Hi again,

The way the standard nucleotide models obtain an instantaneous substitution rate matrix is by multiplying each column of a symmetric exchangability matrix by the equilibrium frequency for that column. This means that if, at equilibrium, A was 4 times more frequent than C, then the C->A entry in the instantaneous substitution rate matrix would be 4 times higher than the A->C one. Whether this setup can adequately capture the details of your process depends on how far from equilibrium it is. You note that the organism is already AT enriched, which suggests that it might be at equilibrium, or at least might be well approximated locally (you mentioned low divergence) by a process at equilibrium.


Title: Re: positive selection test for organism with GC->AT mutation bias
Post by konrad on Sep 17th, 2013 at 4:32pm
Agreed with noob. In principle if the process is away from equilibrium at the root of the tree (but how would you know?) this could have an effect that would be fixable by adding branches that are not AT enriched, so as to allow inference of the root state of the AT-rich subtree, and inferring frequency parameters separately for the subtree (ML inference rather than taking the usual empirical shortcut).

At the moment we don't have code for doing this, and I expect it won't make a noticeable difference. Also it would require having homologous non-AT-rich sequences and a willingness to assume you know when the mutation bias kicked in. I'd ignore it in practice but yes, it is a potential caveat in principle.

Title: Re: positive selection test for organism with GC->AT mutation bias
Post by Pär on Sep 18th, 2013 at 12:14am
Thanks Konrad and noob for your input.

HyPhy message board » Powered by YaBB 2.5.2!
YaBB Forum Software © 2000-2024. All Rights Reserved.