Class 13: Global and Semi-global Alignments

    A   R   N   D   C   Q   E   G   H   I   L   K   M   F   P   S   T   W
A   6  -7  -4  -3  -6  -4  -2  -2  -7  -5  -6  -7  -5  -8  -2   0  -1 -13
R  -7   8  -6 -10  -8  -2  -9  -9  -2  -5  -8   0  -4  -9  -4  -3  -6  -2
N  -4  -6   8   2 -11  -3  -2  -3   0  -5  -7  -1  -9  -9  -6   0  -2  -8
D  -3 -10   2   8 -14  -2   2  -3  -4  -7 -12  -4 -11 -15  -8  -4  -5 -15
C  -6  -8 -11 -14  10 -14 -14  -9  -7  -6 -15 -14 -13 -13  -8  -3  -8 -15
Q  -4  -2  -3  -2 -14   8   1  -7   1  -8  -5  -3  -4 -13  -3  -5  -5 -13
E  -2  -9  -2   2 -14   1   8  -4  -5  -5  -9  -4  -7 -14  -5  -4  -6 -17
G  -2  -9  -3  -3  -9  -7  -4   6  -9 -11 -10  -7  -8  -9  -6  -2  -6 -15
H  -7  -2   0  -4  -7   1  -5  -9   9  -9  -6  -6 -10  -6  -4  -6  -7  -7
I  -5  -5  -5  -7  -6  -8  -5 -11  -9   8  -1  -6  -1  -2  -8  -7  -2 -14
L  -6  -8  -7 -12 -15  -5  -9 -10  -6  -1   7  -8   1  -3  -7  -8  -7  -6
K  -7   0  -1  -4 -14  -3  -4  -7  -6  -6  -8   7  -2 -14  -6  -4  -3 -12
M  -5  -4  -9 -11 -13  -4  -7  -8 -10  -1   1  -2  11  -4  -8  -5  -4 -13
F  -8  -9  -9 -15 -13 -13 -14  -9  -6  -2  -3 -14  -4   9 -10  -6  -9  -4
P  -2  -4  -6  -8  -8  -3  -5  -6  -4  -8  -7  -6  -8 -10   8  -2  -4 -14
S   0  -3   0  -4  -3  -5  -4  -2  -6  -7  -8  -4  -5  -6  -2   6   0  -5
T  -1  -6  -2  -5  -8  -5  -6  -6  -7  -2  -7  -3  -4  -9  -4   0   7 -13
W -13  -2  -8 -15 -15 -13 -17 -15  -7 -14  -6 -12 -13  -4 -14  -5 -13  13
Y  -8 -10  -4 -11  -4 -12  -8 -14  -3  -6  -7  -9 -11   2 -13  -7  -6  -5
V  -2  -8  -8  -8  -6  -7  -6  -5  -6   2  -2  -9  -1  -8  -6  -6  -3 -15
B  -3  -7   6   6 -12  -3   1  -3  -1  -6  -9  -2 -10 -10  -7  -1  -3 -10
J  -6  -7  -6 -10  -9  -5  -7 -10  -7   5   6  -7   0  -2  -7  -8  -5  -7
Z  -3  -4  -3   1 -14   6   6  -5  -1  -6  -7  -4  -5 -13  -4  -5  -6 -14
X  -1  -1  -1  -1  -1  -1  -1  -1  -1  -1  -1  -1  -1  -1  -1  -1  -1  -1
* -17 -17 -17 -17 -17 -17 -17 -17 -17 -17 -17 -17 -17 -17 -17 -17 -17 -17
    Y   V   B   J   Z   X   *
A  -8  -2  -3  -6  -3  -1 -17
R -10  -8  -7  -7  -4  -1 -17
N  -4  -8   6  -6  -3  -1 -17
D -11  -8   6 -10   1  -1 -17
C  -4  -6 -12  -9 -14  -1 -17
Q -12  -7  -3  -5   6  -1 -17
E  -8  -6   1  -7   6  -1 -17
G -14  -5  -3 -10  -5  -1 -17
H  -3  -6  -1  -7  -1  -1 -17
I  -6   2  -6   5  -6  -1 -17
L  -7  -2  -9   6  -7  -1 -17
K  -9  -9  -2  -7  -4  -1 -17
M -11  -1 -10   0  -5  -1 -17
F   2  -8 -10  -2 -13  -1 -17
P -13  -6  -7  -7  -4  -1 -17
S  -7  -6  -1  -8  -5  -1 -17
T  -6  -3  -3  -5  -6  -1 -17
W  -5 -15 -10  -7 -14  -1 -17
Y  10  -7  -6  -7  -9  -1 -17
V  -7   7  -8   0  -6  -1 -17
B  -6  -8   6  -8   0  -1 -17
J  -7   0  -8   6  -6  -1 -17
Z  -9  -6   0  -6   6  -1 -17
X  -1  -1  -1  -1  -1  -1 -17
* -17 -17 -17 -17 -17 -17   1

In the previous class…

We can find edit distance efficiently

Today we are going to do it backwards

Definitions

What we look for

Single-letter scores

Starting from the end

Score of the alignment

Recursive Evaluation

Boundary conditions

Before the sequences there are gaps

Gaps in the border

Global v/s semi-global alignment

Where is the score?

Scoring matrices

DNA scoring matrix

Amino-acid scoring matrices

PAM matrices

PAM30 matrix

Gap penalty

Long gaps are more common than short ones

Affine gaps

Linear v/s affine

Homework