1 / 35

Phylogenetic Trees

Tutorial 6. Phylogenetic Trees. Tutorial 6. Phylogenetic Trees. Measuring distance Bottom-up algorithm (Neighbor Joining) Distance based algorithm Relative distance based. Measuring Distance.

Download Presentation

Phylogenetic Trees

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Tutorial 6 Phylogenetic Trees

  2. Tutorial 6 Phylogenetic Trees • Measuring distance • Bottom-up algorithm (Neighbor Joining) • Distance based algorithm • Relative distance based

  3. Measuring Distance • Problem: unrelated sequences approach a fraction of difference expected by chance  The distance measure converges. • Jukes-Cantor

  4. Measuring Distance (cont) • Euclidean Distance: Given a multiple sequence alignment, calculate the square root of the sum of the score at every position between two sequences • the score increases proportionally to the extent of dissimilarity between residues

  5. a b d c Star Structure Assumption: Divergence of sequences is assumed to occur at constant rate  Distance to root equals e f b c a d

  6. a b d c Star Structure Assumption: Divergence of sequences is assumed to occur at constant rate  Distance to root equals e f b c a d Unweighted Pair Group Method using Arithmetic Averages UPGMA

  7. a b d c Basic Algorithm UPGMA Constructs a rooted tree. Distance matrix Initial star diagram 7

  8. a b d c UPGMA: Selection step Choose the nodes with the shortest distance and fuse them. 8

  9. c a,d e f b UPGMA: Distance recalculation step • Even distance between ce and be • ea, ed by average distance from c anb b 9

  10. a a c a,d e e f c,b d c a b e f Dbf b 1 2 3 4 f e b c a d d 10

  11. Neighbor Joining Algorithm Constructs unrooted tree. 11

  12. Neighbor Joining’ (merging close sequences – not the actual algorithm) • Step by step summary: • Calculate all pairwise distances. • Pick two nodes (i and j) for which the distance is minimal. • Define a new node (x) and re-calculate the distances from the free nodes to the new node. • Calculate Dix and Djx - the distance of the chosen nodes I and J to the new node X, as well as the distance from X to all other nodes. • Continue until two nodes remain – connect with edge.

  13. Pick two nodes for which the distance is minimal (i,j)

  14. 5,6 Node 10 is a new node.

  15. Re-calculate the distances from new node I,j : the fused nodes (5,6) X :a new added node (node 10) m :the remaining nodes in the star

  16. Calculate Dix and Djx r: ~average distance to nodes L : number of leaves left in the tree (leaves nodes representing taxa, sequences,etc)

  17. Calculate Dix and Djx ΣD5k ΣD6k r5=ΣD5k/(L-2)= 3.22406/(9-2)=0.46058 r6=ΣD6k/(L-2)= 3.22758/(9-2)=0.461083

  18. Calculate Dix and Djx D10,5=(D5,6+r5-r6)/2=(0.06088+0.46058-0.461083)/2) = 0.0301886 D10,6=D5,6-D10,5=0.06088-0.0301886=0.0306914

  19. 0.0301886 0.0306914

  20. Step 2 0.080375 0.044625

  21. Step 3 0.069258 0.040447

  22. Step 4

  23. Step 5

  24. Step 6

  25. Step 7

  26. Problems 2 1 0.1 0.1 0.1 0.4 0.4 4 3

  27. Neighbor Joining (Not assuming equal divergence) • Step by step summary: • Calculate all pairwise distances. • Pick two nodes (i and j) for which the relative distance is minimal (lowest). • Define a new node (x) and re-calculate the distances from the free nodes to the new node. • Calculate Dix and Djx - the distance of the chosen nodes I and J to the new node X, as well as the distance from X to all other nodes. • Continue until two nodes remain – connect with edge.

  28. Step 2. Pick two nodes (i and j) for which the relative distance is minimal (lowest).

  29. X J I M • Negative values • As the average distance from the common ancestor to the rest of the nodes increases, Mij has a lower value. • Select pair that produce lowest value • Reevaluate M with every iteration

  30. 2 1 0.1 0.1 0.1 0.4 0.4 4 3

  31. 2 1 0.1 0.1 0.1 0.4 0.4 4 3

  32. X J I M Re-calculate the distances from new node

  33. EXAMPLE Original distance Matrix Relative Distance Matrix (Mij) The Mij Table is used only to choose the closest pairs and not for calculating the distances 33

  34. Problems with phylogenetic trees Bacillus Bacillus Burkholderias Aeromonas Aeromonas Pseudomonas Pseudomonas Burkholderias Lechevaliera Lechevaliera E.coli E.coli Salmonella Salmonella Bacillus Pseudomonas Pseudomonas Aeromonas Burkholderias Burkholderias Aeromonas Bacillus Lechevaliera Lechevaliera E.coli E.coli Salmonella Salmonella

  35. Software PHYLIP http://evolution.gs.washington.edu/phylip.html http://paup.csit.fsu.edu/ PAUP http://www.megasoftware.net/ MEGA3 http://evolution.genetics.washington.edu/phylip/software.html More

More Related