To estimate the number of contigs in the assembly

For a genome of length \(G,\) with \(N\) reads of average length \(L,\) and an overlap threshold of \(T,\) the expected number of contigs in an assembly is \[N\exp\left(-\frac{(L-T)N}{G}\right)\] (more precisely, it is the number of gaps between contigs)