High throughput sequencing
Dye terminator sequencing has long been the main method for providing sequence data, but it has the disadvantage of being time consuming and expensive when a massive amount of data needs to be analysed. A revolution in the field of sequencing began at the turn of the 21st century, with the introduction of sequence by synthesis methods 1) 2) and today (2011) there are many different platforms available for high throughput sequencing. What these methods have in common is that they parallelize the sequencing process, typically producing thousands of short sequencing reads at once. The Wikipedia page on DNA sequencing provides a rich historical review of the subject, and many scientific articles describe the differences among the technologies 3) 4) 5) and compare the expected results 6) 7). Other names for high throughput sequencing methods are next generation sequencing, second generation sequencing, third generation sequencing or massively parallel sequencing.
High throughput sequencing for genetic diversity
Genetic diversity studies form the basis of many aspects of biodiversity science. High throughput sequencing has the potential to dramatically change how genetic diversity studies are planned and analyzed. Still, although the ratio of the number of reads produced in a single run is truly cost-effective, the relatively high cost of a single run has prevented many academic laboratories from using these innovative technologies. To overcome this limitation, barcoding systems were developed 8), where different oligonucleotides (8 to 10 bp in length) are incorporated in the different DNA samples to be sequenced. After these samples are labelled with the barcodes, they can all be multiplexed and sequenced together in a single sequencing run. Each sample is then sorted using bioinformatics methods, by recognition of its barcode. When coupled with laboratory methods for genome complexity reduction, high throughput sequencing can be a very efficient strategy for providing a massive amount of sequence data from different samples, in a short time and at reasonable costs.
High throughput sequencing at the QCBS
A few QCBS members have used one of the high throughput sequencing methods currently available (as of 2011). One example, using a AFLP-like and a pyrosequencing step with a Genome Sequencer FLX (GS-FLX) System is detailed in the specific page High throughput sequencing at the QCBS of this wiki.