Class 9

April 19th, 2016

Welcome

Which are all the entities?
What are their relationships? That is, how are they related?
What are their attributes? Remember that entities and relationships can have attributes
What are the identifiers?
What are the cardinalities?

Entity-Relationship Diagram of GEO

How do you determine the origin of replication in a bacterial chromosome? Can you do the same in Eukarya? Why?
How do you determine the binding sites of a transcription factor?
What is a Motif? What is a Position Specific Score Matrix (also known as Position Weight Matrix)?

Input: list of vector of characters named CDS
Output: list of vector of characters
you can use the function translate() that transform a single CDS into the corresponding protein

Write first a function score.pos() to evaluate the score of a fixed position. The inputs are
- pos: position in the genome
- genome: vector of chars
- mat: a position specific score matrix
Then write the code to evaluate score.pos() on each position of a genome. The final output is a vector of numbers, each one representing the score of each position.

DNA replication is not perfect. Some bases can change

If these changes are lethal, the cell dies (by definition)

Therefore we only see changes that are compatible with cell life

This is one component of natural selection

Naturally, if two genes have the same sequence, they encode the same protein

If they differ in a few bases, the proteins will also differ a little, or less (why?)

So if two proteins are very similar, they probably do the same function

A few changes will probably not change they way it folds

Same shape, same function.

Comparing sequences