HyPhy message board | |
http://www.hyphy.org/cgi-bin/hyphy_forums/YaBB.pl
Methodology Questions >> How to >> very simple statistic, but I'm not sure how/where! http://www.hyphy.org/cgi-bin/hyphy_forums/YaBB.pl?num=1319836146 Message started by rbjmax on Oct 28th, 2011 at 2:09pm |
Title: very simple statistic, but I'm not sure how/where! Post by rbjmax on Oct 28th, 2011 at 2:09pm
I need to provide a colleague with the maximum % nucleotide and amino acid diversity in my set of sequences (overall and within subpopulations). everywhere I look I see mean measures! and I feel goofy for not being able to find this!
thanks in advance, Rachel |
Title: Re: very simple statistic, but I'm not sure how/where! Post by Sergei on Oct 31st, 2011 at 7:22am
Hi Rachel,
F_ST (a standard under Compartmentalization) is probably your best bet. This will compute mean pairwise diversity (no phylogenetic correction). Sergei |
Title: Re: very simple statistic, but I'm not sure how/where! Post by rbjmax on Nov 1st, 2011 at 2:27pm
thanks, Sergei. I'm after maximum measures of diversity, not mean, in order to compare to past published data. at this point I'm considering just a crude pairwise matrix, but the number of taxa involved makes me cringe with sore eyes just to imagine it. I'll keep looking.
Rachel |
Title: Re: very simple statistic, but I'm not sure how/where! Post by Sergei on Nov 1st, 2011 at 7:35pm
Hi Rachel,
If your sequences are aligned, I have a really fast command line utility that can compute pairwise TN93 distances on 50,000 sequences in <5 minutes and output the maximum as well as all distances above a fixed threshold. Let me know if you are interested. Sergei |
Title: Re: very simple statistic, but I'm not sure how/where! Post by rbjmax on Nov 2nd, 2011 at 10:32am
that would be super, Sergei! thank you!
Rachel |
Title: Re: very simple statistic, but I'm not sure how/where! Post by Sergei on Nov 3rd, 2011 at 3:21pm
Hi Rachel,
Here you go. Do the usual Linux source install: ./configure make make install You should now have the TN93dist binary which takes a .fasta file and a number of arguments (hopefully the help line is somewhat meaningful) and runs the pairwise distance calculation. Maximum distance is spooled to the console; only those distances exceeding the threshold you put on the command line are output to a file. Sergei http://www.hyphy.org/cgi-bin/hyphy_forums/YaBB.pl?action=downloadfile;file=tn93dist-1_00_tar.gz (87 KB | )
|
Title: Re: very simple statistic, but I'm not sure how/where! Post by Sergei on Nov 8th, 2011 at 3:58pm |
HyPhy message board » Powered by YaBB 2.5.2! YaBB Forum Software © 2000-2024. All Rights Reserved. |