HyPhy message board
http://www.hyphy.org/cgi-bin/hyphy_forums/YaBB.pl
Methodology Questions >> How to >> very simple statistic, but I'm not sure how/where!
http://www.hyphy.org/cgi-bin/hyphy_forums/YaBB.pl?num=1319836146

Message started by rbjmax on Oct 28th, 2011 at 2:09pm

Title: very simple statistic, but I'm not sure how/where!
Post by rbjmax on Oct 28th, 2011 at 2:09pm
I need to provide a colleague with the maximum % nucleotide and amino acid diversity in my set of sequences (overall and within subpopulations). everywhere I look I see mean measures! and I feel goofy for not being able to find this!

thanks in advance,
Rachel

Title: Re: very simple statistic, but I'm not sure how/where!
Post by Sergei on Oct 31st, 2011 at 7:22am
Hi Rachel,

F_ST (a standard under Compartmentalization) is probably your best bet. This will compute mean pairwise diversity (no phylogenetic correction).

Sergei

Title: Re: very simple statistic, but I'm not sure how/where!
Post by rbjmax on Nov 1st, 2011 at 2:27pm
thanks, Sergei. I'm after maximum measures of diversity, not mean, in order to compare to past published data. at this point I'm considering just a crude pairwise matrix, but the number of taxa involved makes me cringe with sore eyes just to imagine it. I'll keep looking.

Rachel

Title: Re: very simple statistic, but I'm not sure how/where!
Post by Sergei on Nov 1st, 2011 at 7:35pm
Hi Rachel,

If your sequences are aligned, I have a really fast command line utility that can compute pairwise TN93 distances on 50,000 sequences in <5 minutes and output the maximum as well as all distances above a fixed threshold. Let me know if you are interested.

Sergei

Title: Re: very simple statistic, but I'm not sure how/where!
Post by rbjmax on Nov 2nd, 2011 at 10:32am
that would be super, Sergei! thank you!

Rachel

Title: Re: very simple statistic, but I'm not sure how/where!
Post by Sergei on Nov 3rd, 2011 at 3:21pm
Hi Rachel,

Here you go. Do the usual Linux source install:

./configure
make
make install

You should now have the TN93dist binary which takes a .fasta file and a number of arguments (hopefully the help line is somewhat meaningful) and runs the pairwise distance calculation. Maximum distance is spooled to the console; only those distances exceeding the threshold you put on the command line are output to a file.

Sergei


http://www.hyphy.org/cgi-bin/hyphy_forums/YaBB.pl?action=downloadfile;file=tn93dist-1_00_tar.gz (87 KB | )

Title: Re: very simple statistic, but I'm not sure how/where!
Post by Sergei on Nov 8th, 2011 at 3:58pm
Hi Rachel,

A more documented version:

Multimedia File Viewing and Clickable Links are available for Registered Members only!!  You need to Login Login

Sergei

HyPhy message board » Powered by YaBB 2.5.2!
YaBB Forum Software © 2000-2024. All Rights Reserved.