Welcome, Guest. Please Login
YaBB - Yet another Bulletin Board
 
  HomeHelpSearchLogin  
 
Model comparison issue (Read 2810 times)
Miguel Lacerda
YaBB Newbies
*
Offline


Feed your monkey!

Posts: 36
Natl Univ of Ireland, Galway
Gender: male
Model comparison issue
Nov 6th, 2010 at 6:51am
 
Hi Sergei,

I am trying to compare to simple models but am getting some strange results. Was wondering if you could suggest why...

The null model is a standard GY94 model - all parameters except omega are estimated once-off over the entire alignment and omega is optimised separately at each site, holding the other parameters fixed. The alternative model defines omega = omega1*f[i] + omega2*(1-f[i]), where f[i] is a weight specific to branch i, and optimises omega1 and omega2 separately at each site. For each site, I compute the null and alternative likelihoods and perform a LRT.

I would expect omega (under the null model) to lie somewhere in between omega1 and omega2 (under the alt model); however, I am not getting this. I typically get omega < omega1 < omega2. (e.g. site 9) Surely a larger omega would then produce a better likelihood under the null model? I've tried larger starting values for omega, but seem to always end up with the same estimate.

I also get some sites where the likelihood actually gets worse under the alt model! At site 45, for example, I start the optimisation with omega1=omega2=0.7515262 (the value of omega under the null model) which has a likelihood of -88.87692. After optimising the alt model, I get omega1=408.816, omega2=2596.717 and a likelihood of -1e+100. 

Any suggestions? I've attached my code and data.

Thanks!
Miguel


Back to top
 
Multimedia File Viewing and Clickable Links are available for Registered Members only!!  You need to Login Login (113 KB | )
 
IP Logged
 
Sergei
YaBB Administrator
*****
Offline


Datamonkeys are forever...

Posts: 1658
UCSD
Gender: male
Re: Model comparison issue
Reply #1 - Nov 7th, 2010 at 9:51pm
 
Hi Miguel,

There seems to be an issue with negative values in the rate matrix. If you look at the 'f' vector in the alt.bf example, some values exceed one. However, these values are then used as convex combination coefficients with omega1 and omega2. For example, in model q358 there are (1-2.x)*omega2 values in the rate matrix:

Code:
Q384={61,61};
Q384[0][1]:=(2.68515*omega1+(1-2.68515)*omega2)*t;
 



Hyphy will silently enforce the zero lower bound on most parameter values, but it does not check the lower bound condition for rate matrices. Consequently, the odd results may be due to poorly formed rate matrices.

Sergei
Back to top
« Last Edit: Nov 8th, 2010 at 6:18am by Sergei »  

Associate Professor
Division of Infectious Diseases
Division of Biomedical Informatics
School of Medicine
University of California San Diego
WWW WWW  
IP Logged
 
Miguel Lacerda
YaBB Newbies
*
Offline


Feed your monkey!

Posts: 36
Natl Univ of Ireland, Galway
Gender: male
Re: Model comparison issue
Reply #2 - Nov 9th, 2010 at 12:05am
 
Thanks Sergei! I should have picked that up Smiley
Back to top
 
 
IP Logged