AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (4.2 MB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Open Access

A New Hidden Markov Model for Protein Quality Assessment Using Compatibility Between Protein Sequence and Structure

Department of Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri, MO 65211, USA.
Christopher S. Bond Life Sciences Center, University of Missouri, MO 65211, USA
Department of Computer Science, City University of Hong Kong, Hong Kong, China.
Show Author Information

Abstract

Protein structure Quality Assessment (QA) is an essential component in protein structure prediction and analysis. The relationship between protein sequence and structure often serves as a basis for protein structure QA. In this work, we developed a new Hidden Markov Model (HMM) to assess the compatibility of protein sequence and structure for capturing their complex relationship. More specifically, the emission of the HMM consists of protein local structures in angular space, secondary structures, and sequence profiles. This model has two capabilities: (1) encoding local structure of each position by jointly considering sequence and structure information, and (2) assigning a global score to estimate the overall quality of a predicted structure, as well as local scores to assess the quality of specific regions of a structure, which provides useful guidance for targeted structure refinement. We compared the HMM model to state-of-art single structure quality assessment methods OPUSCA, DFIRE, GOAP, and RW in protein structure selection. Computational results showed our new score HMM.Z can achieve better overall selection performance on the benchmark datasets.

References

[1]
J. Moult, K. Fidelis, A. Kryshtafovych, T. Schwede, and A. Tramontano, Critical assessment of methods of protein structure prediction (CASP) round x, Proteins: Structure, Function, and Bioinformatics, vol. 82, no. S2, pp. 1-6, 2014.
[2]
Y. Zhang, Progress and challenges in protein structure prediction, Current Opinion in Structural Biology, vol. 18, no. 3, pp. 342-348, 2008.
[3]
Y. Xu, D. Xu, and E. C. Uberbacher, An efficient computational method for globally optimal threading, J. Comput. Biol., vol. 5, no. 3, pp. 597-614, 1998.
[4]
Y. Xu and D. Xu, Protein threading using PROSPECT: Design and evaluation, Proteins, vol. 40, no. 3, pp. 343-354, 2000.
[5]
J. Peng and J. Xu, Boosting protein threading accuracy, Res. Comput. Mol. Biol., vol. 5541, pp. 31-45, 2009.
[6]
J. Soding, Protein homology detection by HMM-HMM comparison, Bioinformatics, vol. 21, no. 7, pp. 951-960, 2005.
[7]
K. Lin, A. C. May, and W. R. Taylor, Threading using neural network (TUNE): The measure of protein sequence-structure compatibility, Bioinformatics, vol. 18, no. 10, pp. 1350-1357, 2002.
[8]
J. U. Bowie, K. Zhang, M. Wilmanns, and D. Eisenberg, Three-dimensional profiles for measuring compatibility of amino acid sequence with three-dimensional structure, Methods Enzymol., vol. 266, pp. 598-616, 1996.
[9]
S. Sunyaev, E. Kuznetsov, I. Rodchenkov, and V. Tumanyan, Protein sequence-structure compatibility criteria in terms of statistical hypothesis testing, Protein Eng., vol. 10, no. 6, pp. 635-646, 1997.
[10]
H. Sumikawa, K. Fukuhara, E. Suzuki, Y. Matsuo, and K. Nishikawa, Tertiary structural models for human interleukin-6 and evaluation by a sequence-structure compatibility method and NMR experimental information, FEBS Lett., vol. 404, nos. 2-3, pp. 234-240, 1997.
[11]
Y. Matsuo and K. Nishikawa, Protein structural similarities predicted by a sequence-structure compatibility method, Protein Sci., vol. 3, no. 11, pp. 2055-2063, 1994.
[12]
A. C. Camproux and P. Tuffery, Hidden Markov model-derived structural alphabet for proteins: The learning of protein local shapes captures sequence specificity, Biochim. Biophys. Acta, vol. 1724, no. 3, pp. 394-403, 2005.
[13]
A. C. Camproux, P. Tuffery, J. P. Chevrolat, J. F. Boisvieux, and S. Hazout, Hidden Markov model approach for identifying the modular framework of the protein backbone, Protein Eng., vol. 12, no. 12, pp. 1063-1073, 1999.
[14]
C. Bystroff, V. Thorsson, and D. Baker, HMMSTR: A hidden Markov model for local sequence-structure correlations in proteins, J. Mol. Biol., vol. 301, no. 1, pp. 173-190, 2000.
[15]
Y. Wu, M. Lu, M. Chen, J. Li, and J. Ma, OPUS-Ca: A knowledge-based potential function requiring only Calpha positions, Protein Sci., vol. 16, no. 7, pp. 1449-1463, 2007.
[16]
H. Zhou and Y. Zhou, Distance-scaled, finite ideal-gas reference state improves structure-derived potentials of mean force for structure selection and stability prediction, Protein Sci., vol. 11, no. 11, pp. 2714-2726, 2002.
[17]
H. Zhou and J. Skolnick, GOAP: A generalized orientation-dependent, all-atom statistical potential for protein structure prediction, Biophysical Journal, vol. 101, no. 8, pp. 2043-2052, 2011.
[18]
J. Zhang and Y. Zhang, A novel side-chain orientation dependent potential derived from random-walk reference state for protein fold selection and structure prediction, PLoS One, vol. 5, no. 10, p. e15386, 2010.
[19]
S. F. Altschul, T. L. Madden, A. A. Schaffer, J. Zhang, Z. Zhang, W. Miller, and D. J. Lipman, Gapped BLAST and PSI-BLAST: A new generation of protein database search programs, Nucleic Acids Res., vol. 25, no. 17, pp. 3389-3402, 1997.
[20]
W. Zheng and X. Liu, A protein structural alphabet and its substitution matrix CLESUM, Transactions on Computational Systems Biology II, vol. 3680, pp. 59-67, 2005.
[21]
J. L. Sussman, D. Lin, J. Jiang, N. O. Manning, J. Prilusky, O. Ritter, and E. E. Abola, Protein Data Bank (PDB): Database of three-dimensional structural information of biological macromolecules, Acta Crystallogr. D Biol. Crystallogr., vol. 54, pp. 1078-1084, 1998.
[22]
W. Li and A. Godzik, Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences, Bioinformatics, vol. 22, no. 13, pp. 1658-1659, 2006.
[23]
A. Zemla, LGA: A method for finding 3D similarities in protein structures, Nucleic Acids Res., vol. 31, no. 13, pp. 3370-3374, 2003.
[24]
Y. Zhang, A. K. Arakaki, and J. Skolnick, TASSER: An automated method for the prediction of protein tertiary structures in CASP 6, Proteins, vol. 61, no. Supp. l7, pp. 91-98, 2005.
[25]
S. Wu, J. Skolnick, and Y. Zhang, Ab initio modeling of small proteins by iterative TASSER simulations, BMC Biol., vol. 5, p. 17, 2007.
[26]
A. Roy, A. Kucukural, and Y. Zhang, I-TASSER: A unified platform for automated protein structure and function prediction, Nat. Protoc., vol. 5, no. 4, pp. 725-738, 2010.
[27]
A. Sali and T. L. Blundell, Comparative protein modelling by satisfaction of spatial restraints, J. Mol. Biol., vol. 234, no. 3, pp. 779-815, 1993.
[28]
R. Samudrala and M. Levitt, Decoys R Us: A database of incorrect conformations to improve protein structure prediction, Protein Science, vol. 9, no. 7, pp. 1399-1401, 2000.
Tsinghua Science and Technology
Pages 559-567
Cite this article:
He Z, Ma W, Zhang J, et al. A New Hidden Markov Model for Protein Quality Assessment Using Compatibility Between Protein Sequence and Structure. Tsinghua Science and Technology, 2014, 19(6): 559-567. https://doi.org/10.1109/TST.2014.6961026

628

Views

15

Downloads

2

Crossref

N/A

Web of Science

4

Scopus

0

CSCD

Altmetrics

Received: 18 June 2014
Accepted: 25 June 2014
Published: 20 November 2014
The Author(s)
Return