The homework of this week aims to replicate the tables and graphics of the website Comparative Genometrics, which has precomputed statistics for the DNA sequences of several thousands of Bacteria.
Please take a look at the page of E.coli K-12. in the Comparative Genometrics. You can see that the graphics are made based on the table CP009685.txt.
Please write the R code to read the genome of E.coli and
produce a table equivalent to CP009685.txt.
You may see that the step size is 1000 nt, the column
pos is the average of
end, the columns
nT are the output of
very easy to calculate.
You have to research and understand how to make the columns
cTAsk. The function
cumsum() may be useful, but you can do the same with a
PS. Can you make a function to produce the reverse complement of a DNA sequence?