FCCC LOGO Faculty Publications
Wei Q , Wang L , Wang Q , Kruger WD , Dunbrack RL Jr
Testing computational prediction of missense mutation phenotypes: functional characterization of 204 mutations of human cystathionine beta synthase
Proteins. 2010 Jul;78(9) :2058-74
PMID: 20455263   
Back to previous list
Abstract
Predicting the phenotypes of missense mutations uncovered by large-scale sequencing projects is an important goal in computational biology. High-confidence predictions can be an aid in focusing experimental and association studies on those mutations most likely to be associated with causative relationships between mutation and disease. As an aid in developing these methods further, we have derived a set of random mutations of the enzymatic domains of human cystathionine beta synthase. This enzyme is a dimeric protein that catalyzes the condensation of serine and homocysteine to produce cystathionine. Yeast missing this enzyme cannot grow on medium lacking a source of cysteine, while transfection of functional human CBS into yeast strains missing endogenous enzyme can successfully complement for the missing gene. We used PCR mutagenesis with error-prone Taq polymerase to produce 948 colonies and compared cell growth in the presence or absence of a cysteine source as a measure of CBS function. We were able to infer the phenotypes of 204 single-site mutants, 79 of them deleterious and 125 neutral. This set was used to test the accuracy of six publicly available prediction methods for phenotype prediction of missense mutations: SIFT, PolyPhen, PMut, SNPs3D, PhD-SNP, and nsSNPAnalyzer. The top methods are PolyPhen, SIFT, and nsSNPAnalyzer, which have similar performance. Using kernel discriminant functions, we found that the difference in position-specific scoring matrix values is more predictive than the wild-type PSSM score alone, and that the relative surface area in the biologically relevant complex is more predictive than that of the monomeric proteins.
Notes
Wei, Qiong Wang, Liqun Wang, Qiang Kruger, Warren D Dunbrack, Roland L Jr R01 GM73784/GM/NIGMS NIH HHS/United States R01 HL057299/HL/NHLBI NIH HHS/United States Research Support, N.I.H., Extramural United States Proteins Proteins. 2010 Jul;78(9):2058-74.