HyPhy message board
http://www.hyphy.org/cgi-bin/hyphy_forums/YaBB.pl
Datamonkey Server >> Datamonkey feedback >> SBP and GARD
http://www.hyphy.org/cgi-bin/hyphy_forums/YaBB.pl?num=1335809036

Message started by Ana on Apr 30th, 2012 at 11:03am

Title: SBP and GARD
Post by Ana on Apr 30th, 2012 at 11:03am
Hi,

A couple of naive questions…

I’m trying to run SBP and/or GARD on a huge dataset of two paralogous genes (34 sequences, 4905bp) and due to its size I’m always getting issues with the analysis being stopped before convergence, because the CPU time limit per job. I don’t want to reduce the number of sequences, so is it correct to cut in half the size of the sequences and run each independently?

The other question is, when performing positive selection analyses on each gene if I run first GARD, will the subsequent output of REL/SLAC/FEL/MEME be already corrected? I am assuming they will be. I also think most of the breakpoints that I obtain result from rate variations and not from recombination. The only way of checking this is analyzing the resultant trees from the fragments defined by the breakpoints, right?

I hope this makes any sense…
Thank you!
Cheers

Title: Re: SBP and GARD
Post by Sergei on Apr 30th, 2012 at 1:56pm
Hi Ana,

Regarding the large GARD analysis, there are several things you can do.

1). If you send me the alignment, I could run it through GARD on the cluster without a time limit.
2). You could use the preliminary GARD results (I assume it is showing some evidence of recombination), to partition the alignment by sites, i.e. if GARD finds 1 breakpoint, you can independently run the section left of it, and then the one right of it.
3). You could randomly split your data set by sequences (e.g. 1/2 or 1/4 of sequences) and run all of them through to see if the results are consistent between splits. If they are not, then the likely cause is that one or more of the sequences present in one part of the data set but not in others are driving recombination signal.

In terms of correcting selection analyses, you should select 'GARD Trees' from the Use this tree set option list. You could see whether recombination is affecting results by comparing the results of a single partition analysis, vs the GARD-inferred set analysis. Take a look at the first exercise in Multimedia File Viewing and Clickable Links are available for Registered Members only!!  You need to Login Login for an example.

Sergei

Title: Re: SBP and GARD
Post by Ana on May 3rd, 2012 at 9:32am
Hi Sergei,

Thank you so much for your reply.

Regarding the large data set to GARD analysis, I followed the suggestion of randomly splitting the sequences, but in fact the results are not consistent between analysis and they still don't run till the end.
So, if you don't mind I would like to know if it's possible to try your first suggestion and send you the alignment. Please, just let me know.

Thank you so much!
Best

ana

Title: Re: SBP and GARD
Post by Sergei on May 3rd, 2012 at 9:43am
Hi Ana,

Sure, send me the alignment. spond at ucsd dot edu

Sergei

HyPhy message board » Powered by YaBB 2.5.2!
YaBB Forum Software © 2000-2024. All Rights Reserved.