This is an archive of papers published by the staff and faculty of Fox Chase Cancer Center. For questions about content, please contact Talbot Research Library
Last updated on
Buetow KH , Edmonson MN , Cassidy AB
Reliable identification of large numbers of candidate SNPs from public EST data
Nature Genetics. 1999 Mar;21(3) :323-325
AbstractHigh-resolution genetic analysis of the human genome promises to provide insight into common disease susceptibility. To perform such analysis will require a collection of high- throughput, high-density analysis reagents. We have developed a polymorphism detection system that uses public-domain sequence data. This detection system is called the single nucleotide polyrmorphism pipeline (SNPpipeline). The analytic core of the SNPpipeline is composed of three components: PHRED, PHRAP and DEMIGLACE. PHRED and PHRAP are components of a sequence analysis suite developed to perform the semi-automated analysis required for large-scale genomes(1,2) (provided courtesy of P. Green). Using these informatics tools, which examine redundant raw expressed sequence tag (EST) data, we have identified more than 3,000 candidate single-nucleotide polyrmorphisms (SNPs). Empiric validation studies of a set of 192 candidates indicate that 82% identify variation in a sample of ten Centre d'Etudes Polymorphism Humain (CEPH) individuals. Our results suggest that existing sequence resources may serve as a valuable source for identifying genetic variation.
NotesTimes Cited: 86 English Article 173JR NAT GENET