- 66 Views
- Uploaded on
- Presentation posted in: General

Sequence Alignment

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Sequence Alignment

Oct 9, 2002

Joon Lee

Genomics & Computational Biology

- Optimization problems: find the best decision one after another
- Subproblems are not independent
- Subproblems share subsubproblems
- Solve subproblem, save its answer in a table

Genomics & Computational Biology

- Characterize the structure of an optimal solution
- Recursively define the value of an optimal solution
- Compute the value of an optimal solution in a bottom-up fashion
- Construct an optimal solution from computed information

Genomics & Computational Biology

Sequence 1: G A A T T C A G T T A

Sequence 2: G G A T C G A

Genomics & Computational Biology

G A A T T C A G T T A

| | | | | |

G G A _ T C _ G _ _ A

G _ A A T T C A G T T A

| | | | | |

G G _ A _ T C _ G _ _ A

Genomics & Computational Biology

- Initialization: gap penalty
- Scoring: matrix fill
- Alignment: trace back

Genomics & Computational Biology

Genomics & Computational Biology

- A = a1a2…an, B = b1b2…bm
- Sij : score at (i,j)
- s(aibj) : matching score between ai andbj
- w : gap penalty

figure source

Genomics & Computational Biology

- Match: +2
- Mismatch: -1
- Gap: -2

Genomics & Computational Biology

0 + 2 = 2

-2 + (-2) = -4

-2 + (-2) = -4

Genomics & Computational Biology

-2 + (-1) = -3

-4 + (-2) = -6

2 + (-2) = 0

Genomics & Computational Biology

-2 + 2 = 0

2 + (-2) = 0

-4 + (-2) = -6

Genomics & Computational Biology

Genomics & Computational Biology

Genomics & Computational Biology

G A A T T C A G T T A

G G A _ T C _ G _ _ A

G A A T T C A G T T A

G G A T _ C _ G _ _ A

Genomics & Computational Biology

- Match: +2
- Mismatch: -1
- Gap: -2

Genomics & Computational Biology

- Match: +2
- Mismatch: -1
- Gap: -2
G C A T C C G

G A T C G

G A T C G

G A T C G

Genomics & Computational Biology

- Match/mismatch → Substitution matrix

Genomics & Computational Biology

- Global: Needlman-Wunsch Algorithm
- Local: Smith-Waterman Algorithm

From Mount Bioinformatics Chap 3

Genomics & Computational Biology

- Sequence alignment with Java applet
- http://linneus20.ethz.ch:8080/5_4_5.html

Genomics & Computational Biology