We sequenced the genomes from a monozygotic twin discordant for schizophrenia and a tumor-normal pair of an ovarian cancer patient. Using whole-... Show More
We sequenced the genomes from a monozygotic twin discordant for schizophrenia and a tumor-normal pair of an ovarian cancer patient. Using whole-genome twin data to discriminate between correctly identified single nucleotide variants (SNVs) and errors a strategy for the accurate detection of SNVs was developed. By applying stringent sequencing quality measures, excluding error-prone regions and selecting SNVs identified by different mapping and variation calling algorithms, error rates were ~37-fold reduced. This enabled us to identify the first discordant SNVs in monozygotic twins using whole-genome sequencing. In addition, by showing that novel SNVs are highly enriched in errors, accurate estimates of the number of novel and rare SNVs occurring in unrelated Caucasian individuals were obtained. Finally, somatic mutations in coding and regulatory sequences were reliably identified in the highly rearranged ovarian tumor. Overall, our data demonstrate that strategies to reduce error rates in whole-genomes are required for disease gene discovery.
Alternative Stable ID
Whole Genome Sequencing
This study includes 2 datasets:
Click on a Dataset Accession in the table below to learn more, and to find out who to contact about access to these data