Welcome, Guest. Please Login
YaBB - Yet another Bulletin Board
 
  HomeHelpSearchLogin  
 
Codon Freqs Models corresp. btw HYPHY and PAML (Read 2746 times)
avilella
YaBB Newbies
*
Offline


I love YaBB 1G - SP1!

Posts: 35
Codon Freqs Models corresp. btw HYPHY and PAML
Aug 4th, 2005 at 8:20am
 
Dears,

I'm trying to assess the dN and dS by-branch results I get from the
same data using the Local/free-ratios model under HYPHY/PAML under the
different available models of codon frequencies for each program. I
have tried:

HYPHY
-----
MG94CUSTOM(HKY85)
GY94
GY94W9
MG94
MG94W9

PAML
----
CodonFreq = 0 # 1/61 each
CodonFreq = 1 # F1x4
CodonFreq = 2 # F3x4
CodonFreq = 3 # Fcodon
CodonFreq = 4 # F1x4MG
CodonFreq = 5 # F3x4MG

I would like to confirm which methods in HYPHY can compare to which
ones in PAML. For example: Are this models comparable?

GY94   = F1x4 
GY94W9 = F3x4 
MG94   = F1x4MG
MG94W9 = F3x4MG

Which would be similar to Fcodon? And "1/61 each"?

I understand that the same models under both programs should give me
the same or similar results on the same dataset, while different
models in one program should give more different results in the other
program.

Right now, it seems that the results of F3x4MG are much more similar
to the results of MG94CUSTOM, rather than to those of MG94W9...

Looking forward,

Bests,

    Albert.
Back to top
 
 
IP Logged
 
Sergei
YaBB Administrator
*****
Offline


Datamonkeys are forever...

Posts: 1658
UCSD
Gender: male
Re: Codon Freqs Models corresp. btw HYPHY and PAML
Reply #1 - Aug 4th, 2005 at 10:54am
 
Dear Albert,

In Ziheng's PAML notation:

MG94CUSTOM uses F3x4 estimator (see this paper for complete details [url]http://www.hyphy.org/sergei/2gamma.pdf[/url]), as do MG94W9 and GY94W9.

MG94 and GY94 use F1x4.

Fcodon and 1/61 each are not implemented (mostly for reasons that they Fcodon will almost surely bias the estimation for small data sets, where some codons will have very small counts and for large datasets Fcodon and F3x4 tend to yield similar estimates, and 1/61 is a very unrealistic model). If you are interested in adding them, I could easily do that, at least for the GY94 models (rate matrix parameterization makes it trickier for MG94). Let me know if you want these options for your comparisons, and I'll post a file which implements them.

That F3x4MG is similar MG94custom (and not MG94w9) is right, because PAML always corrects for transition/transversion biases (as does MG94customx010010), whereas MG94, as originally published, did not.

HyPhy and PAML also differ in how they treat gaps and ambiguous characters, which could contribute to small differences in log likelihoods.

HTH,
Sergei
Back to top
 

Associate Professor
Division of Infectious Diseases
Division of Biomedical Informatics
School of Medicine
University of California San Diego
WWW WWW  
IP Logged