1 / 27

Linear Sequence Alignment

Linear Sequence Alignment. Travis Hillenbrand. Dot Matrix Dynamic Programming Algorithm Greedy X-drop Approach Linear Alignment. Methods of Comparison. Dot Matrix Method. http://arbl.cvmbs.colostate.edu/molkit/dnadot/index.html. Match …C… …C…. Mismatch …C… …G…. Indel …C… …-…. |.

alpha
Download Presentation

Linear Sequence Alignment

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Linear Sequence Alignment Travis Hillenbrand

  2. Dot Matrix Dynamic Programming Algorithm Greedy X-drop Approach Linear Alignment Methods of Comparison

  3. Dot Matrix Method http://arbl.cvmbs.colostate.edu/molkit/dnadot/index.html

  4. Match …C… …C… Mismatch …C… …G… Indel …C… …-… | Sequence Alignment ATCGATACG, ATGGATTACG 3 possibilities

  5. || ||| ||| Matches: +1 +1 +1 +1 +1 +1 +1 +1 = +8 Mismatches: -1 = -1 Gaps: -2 = -2 Total score = +5 Global Pairwise Alignment ATCGAT-ACG ATGGATTACG ATCGATACG, ATGGATTACG

  6. Dynamic Programming Global alignment (Needleman-Wunsch) algorithm

  7. Dynamic Programming Global alignment (Needleman-Wunsch) algorithm

  8. Dynamic Programming Global alignment (Needleman-Wunsch) algorithm

  9. +1 Max= 1 Dynamic Programming Global alignment (Needleman-Wunsch) algorithm

  10. Dynamic Programming Global alignment (Needleman-Wunsch) algorithm

  11. Dynamic Programming Global alignment (Needleman-Wunsch algorithm) GATC GA-C || |

  12. Greedy X-drop Alignment • Aligns sequences that differ by sequencing errors • Works with measure of difference • Restricts indel penalty Zhang et al. 2000

  13. Greedy X-drop Alignment Zhang et al. 2000

  14. Greedy X-drop Alignment

  15. Greedy X-drop Alignment • X-drop condition saves computation

  16. Linear Alignment

  17. ATCGATACG ATGGATTACG ATCGATACG ATGGATTACG ATCGATACG ATGGATTACG ATCGATACG ATGGATTACG ATCGATACG ATGGATTACG | || ||| Linear Alignment • Index of coincidence • Maximum number of matches between two sequences • Ungapped alignment …

  18. ATCGATACG ATGGATTACG -ATCGATACG ATGGATTACG ATCGATACG -ATGGATTACG ATCGATACG ATGGATTACG || ||| |||| | || ||| Linear Alignment • Attempt to increase similarity Window score: 2 -3 -3

  19. Comparison of alignments • 9 human/mouse homologous gene cds pairs retrieved (Jareborg et al. 1999) • Greedy alignment run first mat=10, mis=-6, X=2200 (indel=-11) • Dynamic Programming and Linear alignment using truncated seqs

  20. Comparison of alignments • Similarity scores

  21. Comparison of alignments • Similarity percentage

  22. Comparison of alignments

  23. Comparison of alignments

  24. Comparison of alignments

  25. Comparison of alignments Maximum coincidence alignment: Offset -72 yielded 1642 matches of 2175 possible (75.4943% similarity), score 6611 ACAGTACTGCTACTTCTCGCCGACTGGGTGCTGCTCCGGACCGCGCTGCCCCGCATATTCTCCCTGCTGGTGCCCACCGCGCTGCCACTGCTCCGGGT | | || | | | ||||||| | | | | | | || | || | | ||| | ATGGCTGCGCACGTCTGGCTGGCGGCCGCCCTGCTCCTTCTGGTGGACTGGCTGCTGCTGCGGCCCATGCTCCCGGGAATCTTCTCCCTGTTGGTTCC ACGGGCCGCCTCACTGACTGGATTCTACAAGATGGCTCAGCCGATACCTTCACTCGAAACTTAACTCTCATGTCCATTCTCACCATAGCCAGTGCAGT ||||||||| |||||||||||||||| || ||| ||| || |||||| || ||| || |||||||||||||||||||||||||| ||| ACGGGCCGCATCACTGACTGGATTCTTCAGGATAAGACAGTTCCTAGCTTCACCCGCAACATATGGCTCATGTCCATTCTCACCATAGCCAGCACAGC Decreasing the gap penalty allows similar regions to be aligned without using IOC

  26. Comparison of alignments References Needleman, S. B. & Wunsch, C. D. 1970. A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of Molecular Biology. 48: 443-453. Setubal, J. and Meidanis, J. 1997. Introduction to Computational Molecular Biology. Pacific Grove, California: Brooks/Cole. Zhang, Z.; Schwartz, S.; Wagner, L.; and Miller, W. 2000. A greedy algorithm for aligning DNA sequences. Journal of Computational Biology 7:203-214.

  27. Linear Sequence Alignment Travis Hillenbrand

More Related