Assignments for Today
- Read through the wikipedia entries fro UPGMA and Neighbor Joining
- Due to date confusion, takehome exam 5 will only be due next Wednesday
- (Remember the reading assignment for today: Read excerpts of Chapters 5 and 6 from Li's "Molecular Evolution)
Assignments for Wednesday
- Take-home exam #5 is due
- Read the discussion below, who is right? (A Hennigian comb is a comb like phylogeny named after Willi Hennig, the father of cladistics.)
- Read through:
Closing Plaidoyers for Introns Late / Introns Early
PPT Slides for today
From:<http://dml.cmnh.org/2002Jul/msg00351.html> ----- Original Message ----- > > --+--+-----------A This _is_ a Hennigian comb, because in a cladogram, _only_ topology counts. --+--F ... what a side branch is lies completely in the hand of the presentator.
|
Intro
to phylogenetic reconstruction
|
Compilation of sequence dataset |
Alignment |
Determination of substitution model |
Tree building |
Tree evaluation |
Why phylogenetic reconstruction of molecular evolution? A) Systematic classification of organisms e.g.:
Who were the first angiosperms? (i.e. where are the first angiosperms located
relative Where in the tree of life is the last common ancestor located? B) Evolution of molecules e.g.: domain shuffling, reassignment of function, gene duplications, horizontal gene transfer, drug targets, detection of genes that drive evolution of a species/population (e.g. influenca virus, see here for more examples) C) Identification of organisms
How: 1) Obtain sequencesSequencing Databank Searches -> ncbi a) entrez, b) BLAST, c) blast of pre-release data Friends
2) Determine homology (see notes for earlier classes for practical implementation)Reminder on Definitions: 3) Align sequences
4) Reconstruct evolutionary history
(e.g.: smallest error between distance matrix and distances in tree, or use ii) algorithmic approaches (UPGMA or neighbor joining)
find that tree that explains sequence data with minimum number of substitutions (tree includes hypothesis of sequence at each of the nodes)
given a model for sequence evolution, find the tree that has the highest probability under this model. This approach can also be used to successively refine the model. Bayesian statistics use ML analyses to calculate posterior probabilities for trees, clades and evolutionary parameters. Especially MCMC approaches have become very popular in the last year, because they allow to estimate evolutionary parameters (e.g., which site in a virus protein is under positive selection), without assuming that one actually knows the "true" phylogeny.
D
- ...) Else:
5) Interpret the result.
|
Goals class 17: