Blog of Andrés Aravena
CMB2:

Exercises for Computing in Molecular Biology

18 February 2016

These exercises are not graded but help you to understand the subject and learn effectively. They can be solved individually or in group. Remember that learning is a team activity.

You can always ask your questions (in any language) to the email or in the webpage https://groups.google.com/d/forum/iu-cmb. You can also give answers.

  1. Structured documents

  2. Structured data: Entity-Relationship models and GEO.

  3. Finding Structures: Clustering

  4. Reading structured genomic data. Also the last part of clustering. Slides, Homework

  5. Statistical properties of genomic sequences Slides, Homework

  6. Genetic code. Genes and protein sequences Slides

  7. ORF, CDS, Transcription Factor Binding Sites Slides

  8. Identify and write down the new words or concepts presented in each classes and Google them. Learn them.

  9. Describe with your own words what are Surjective and Injective relationships.

  10. Identify the entities and relationships on the following cases. Draw Entitiy-Relationship (E-R) diagrams for each of them

    • You as student of Istanbul University
    • Human cells
    • Streets of Istanbul
    • Your class schedule
  11. Look for the concept of Desired difficulty at Bjork “Learning & Forgetting” Lab webpage

  12. Practice the usage of GEOquery. Read the series GSE3541

    • use RStudio Environment pane to examine the components of the resulting object
    • which are the samples?
    • which is the platform?
    • do you need to download the platform?
    • which are the genes?
  13. If the mean is a single value to summarize a vector, how can we choose 2 values to have a better model? Or 3? Or any k number of models?

  14. Can you guess which question will be asked in the exam about this subjects?

Codon Adaptation Index

deadline: midterm

Hint: The function syncodons() may be useful

  1. How do we determine the function of a CDS?

Hint: Needleman–Wunsch, Smith–Waterman