MCB 5472 : Inferring Phylogenies (I)

Please send your answers per email to gogarten@uconn.edu, or hand in a hardcopy

Please let me know, how far you got during the lab. If most students didn't finish, we may continue this next week!

 

1) ML on protein sequences using PhyML

a) For a dataset of your choice (or here or here (for the latter, the phylip formated file to be used for tree-puzzle is here) -NOTE: your sequences need to be aligned before you analyze them) use PhyML (on bbcsrv3, after qlogin, enter "phyml" at the command line) and calculate the tree with the highest likelihood using a model for Among Site Rate Variation (ASRV) that has a proportion of invariant site estimated from the data, and that describes the remaining sites with 4 rate categories that are a discrete approximation of a continuous Gamma distribution whose shape parameter is estimated from the data.
Repeat the analysis using a model that does not include invariant sites. Perform a maximum likelihood ratio test (LRT) to determine if the more complex model (the one with an estimated percent invariant sites) leads to a significant improvement in likelihood.


Notes:

2) Using the same dataset and the same model in TREE-PUZZLE

Invoke TREE-PUZZLE from the command line by typing "puzzle"

Use the tree from (1) as usertree (option k). Take you time in selecting the correct model ! (four rate categories plus invariant sites).

 

2B) Strict Molecular Clock

Repeat the analyses from above, but estimate if a strict molecular clock is compatible with the data (option z).

Select the pinvar and alpha from the previous analysis.

For a large data set, this might take some time (about 20 minutes for archaea_euk.phy). If you want control over the commanline, you can send the process (running puzzle) into background.To do so, stop the process in foreground by pressing down <ctrl> and <z> simultaneously. Then restart the process in background by typing
bg %1 ***

 

======================== END Assignment================

Work on your student project !