Tag Archives: simulation

Small project on genome shotgun: comparing results

As some of you correctly understood the script with no name and no instructions takes as input a single sequence and simulate a whole genome shotgun.

It takes three parameters: filename of the sequence, desired coverage and read length.

I used Yeast chromosome 2 as sequence for tests, and tried both with 20X and 50X coverage with increasing read length from 50bp (SOLiD like) to 400bp (454 like) and 800bp (Sanger like).

All the datasets were independently assembled with Newbler. We know that we started from a single sequence  573,563 bp long.

"de novo" assembly with Newbler: results

 

Tagged , ,