• 沒有找到結果。

Using PSI-BLAST profiles as feature vectors in this study, we have proposed six approaches, which are Fuzzy K-NN, Modified Fuzzy K-NN, QuickRBF, Linear Combination Fusion 1, Linear Combination Fusion 2, and Reliability Index Fusion, to predict relative solvent accessibility of RS126 data set.

In the future study, we can apply dimensionality reduction technique [54] to reflect the structure existent in the data set. Then we can find more reliable distance metrics faithfully from PSSM table to improve the classification accuracy of our fuzzy k-NN method. Besides, we can apply our method on a larger data set, like CB513. Data set growth can give an indirect advantage to our method. And our better modified fuzzy k-NN approach can be selected as a promising approach for various protein applications.

References

[1] B. Rost and C. Sander, “Conservation and prediction of solvent accessibility in protein families,” Proteins, vol. 20, pp. 216–226, 1994.

[2] M.J. Thompson and R.A. Goldstein, “Predicting solvent accessibility: higher accuracy using Bayesian statistics and optimized residue substitution classes,”

Proteins, vol. 25, pp. 38-47, 1996.

[3] K. Ginalski and L. Rychlewski, “Protein structure prediction of CASP5 comparative modeling and fold recognition targets using consensus alignment approach and 3D assessment,” Proteins, vol. 53, pp. 410–417, 2003.

[4] B. John and A. Sali, “Detection of homologous proteins by an intermediate sequence search.” Protein Sci., vol. 13, pp. 54–62, 2004.

[5] J. Moult et al., “Critical assessment of methods of protein structure prediction (CASP)-round V.” Proteins, vol. 53, pp. 334–339, 2003.

[6] C. Sander and R. Schneider, “Database of homology-derived protein structures and the structural meaning of sequence alignment.” Proteins, vol. 9, pp. 56–68, 1991.

[7] J.A. Cuff and G.J. Barton, “Application of multiple sequence alignment profiles to improve protein secondary structure prediction.” Proteins, vol.40, pp.

502–511, 2000.

[8] D. Frishman and P. Argos, “Seventy-five percent accuracy in protein secondary structure prediction.” Proteins, vol. 27, pp. 329–335, 1997.

[9] D.T. Jones, “Protein secondary structure prediction based on position specific scoring matrices.” J. Mol. Biol., vol. 292, pp. 195–202, 1999.

[10] D. Przybylski and B. Rost, “Alignments grow, secondary structure prediction improves.” Proteins, vol. 46, pp. 197–205, 2002.

[11] G. Wohlfahrt et al., “Positioning of anchor groups in protein loop prediction: the importance of solvent accessibility and secondary structure elements.” Proteins, vol. 47, pp. 370–378, 2002.

[12] E. Eyal et al., “Importance of solvent accessibility and contact surfaces in modeling side-chain conformations in proteins.” J. Comput. Chem., vol. 25, pp.

712–724, 2004.

[13] S.J. Russell et al., “Stability of cyclic beta-hairpins: asymmetric contributions from side chains of a hydrogen-bonded cross-strand residue pair.” J. Am. Chem.

Soc., vol. 125, pp. 388–395, 2003.

[14] M. Totrov, “Accurate and efficient generalized born model based on solvent accessibility: derivation and application for LogP octanol/water prediction and flexiblepeptide docking.” J. Comput. Chem., vol. 25, pp. 609–619, 2004.

[15] B. Rost et al., “Protein fold recognition by prediction-based threading.” J. Mol.

Biol., vol. 270, pp. 471–480, 1997.

[16] G. Gianese

et al.,

“Improvement in prediction of solvent accessibility by probability profiles.” Protein Eng., vol. 16, pp. 987–992, 2003.

[17] J. Pei and N.V. Grishin, “Combining evolutionary and structural information for local protein structure prediction.” Proteins, vol. 56, pp.782–794, 2004.

[18] J.M. Keller et al., “A fuzzy k-nearest neighbor algorithm.” IEE Trans. Syst. Man

Cybern., vol. 15, pp. 580–585, 1985.

[19] R. Adamczak et al., “Accurate prediction of solvent accessibility using neural networks-based regression.” Proteins, vol. 56, pp. 753–767, 2004.

[20] S. Ahmad and M.M. Gromiha, “NETASA: neural network based prediction of

[21] S. Ahmad et al., “Real value prediction of solvent accessibility from amino acid sequence.” Proteins, vol. 50, pp. 629–635, 2003.

[22] G. Pollastri et al., “Prediction of coordination number and relative solvent accessibility in proteins.” Proteins, vol. 47, pp. 142–153, 2002.

[23] H. Kim and H. Park, “Prediction of protein relative solvent accessibility with support vector machines and long-range interaction 3D local descriptor.”

Proteins, vol. 54, pp. 557–562, 2004.

[24] Z. Yuan et al., “Prediction of protein solvent accessibility using support vector machines.” Proteins, vol. 48, pp. 566–570, 2002.

[25] Z. Yuan and B. Huang, “Prediction of protein accessible surface areas by support vector regression.” Proteins, vol. 57, pp.558–564, 2004.

[26] A.A. Salamov and V.V. Solovyev, “Protein secondary structure prediction using local alignments.” J. Mol. Biol., vol. 268, pp. 31–36, 1997.

[27] S. Salzberg and S. Cost, “Predicting protein secondary structure with a nearest-neighbor algorithm.” J. Mol. Biol., vol. 227, pp. 371–374, 1992.

[28] T.M. Yi and E.S. Lander, “Protein secondary structure prediction using nearest-neighbor methods.” J. Mol. Biol., vol. 232, pp. 1117–1129, 1993.

[29] S.F. Altschul et al., “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.” Nucleic Acids Res., vol. 25, pp. 3389–3402, 1997.

[30] M.N. Nguyen and J.C. Rajapakse, “Prediction of protein relative solvent accessibility with a two-stage SVM approach,” Proteins, vol. 59, pp. 30–37, 2005.

[31] J. Sim, S.Y. Kim, and J. Lee, “Prediction of protein solvent accessibility using fuzzy k-nearest neighbor method,” Bioinformatics, vol. 21, pp. 2844–2849, 2005.

of static accessibility,” J. Mol. Biol., vol. 55, pp. 379–400, 1971.

[33] C. Chothia, “The nature of the accessible and buried surfaces in proteins,” J. Mol.

Biol., vol. 105, pp. 1–12, 1976.

[34] M.L. Connolly, “Solvent-accessible surfaces of proteins and nucleic acids,”

Science, vol. 221, pp. 709–713, 1983.

[35] W. Kabsch and C. Sander, “Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features,” Biopolymers, vol. 22, pp. 2577–2637, 1983.

[36] J. Janin, “Surface and inside volumes in globular proteins,” Nature, vol. 277, pp.

491–492, 1979.

[37] G.D. Rose, A.R. Geselowitz, G.J. Lesser, R.H. Lee, and M.H. Zehfus,

“Hydrophobicity of amino acid residues in globular proteins,” Science, vol. 229, pp. 834–838, 1985.

[38] C. Sander, M. Scharf, and R. Schneider, “Design of protein structures in: Protein Engineering: A practical Approach,” Oxford University press, pp. 89–115, 1992.

[39] H. Hirakawa and S. Kuhara, “Prediction of Hydrophobic Cores of Proteins Using Wavelet Analysis,” Genome Inform Ser Workshop Genome Information, vol. 8, pp. 61–70, 1997.

[40] B. Rost and C. Sander, “Combining evolutionary information and neural networks to predict protein secondary structure,” Proteins, vol. 19, pp. 55–72, 1994.

[41] G.J. Barton, “Protein secondary structure prediction,” Curr. Opin. Struct. Biol., vol. 5, pp. 372–376, 1995.

[42] P. Baldi, S. Brunak, P. Frasconi, G. Pollastri, and G. Soda, “Exploiting the past and the future in protein secondary structure prediction,” Bioinformatics, vol. 15,

[43] S. Henikoff and J.G. Henikoff, “Amino acid substitution matrices from protein blocks,” Proc. Natl. Acad. Sci., vol. 89, pp. 10915–10919, 1992.

[44] J.C. Bezdek et al., “Review of MR image segmentation techniques using pattern recognition.” Med. Phys., vol. 20, pp. 1033–1048, 1993.

[45] D. Cabello et al., “Fuzzy k-nearest neighbor classifiers for ventricular arrhythmia detection.” Int. J. Biomed. Comput., vol. 27, pp. 77–93, 1991.

[46] Y. Huang and Y. Li, “Prediction of protein subcellular locations using fuzzy k-NN method.” Bioinformatics, vol. 20, pp. 21–28, 2004.

[47] K. Leszczynski et al., “Application of a fuzzy pattern classifier to decision making in portal verification of radiotherapy.” Phys. Med. Biol., vol. 44, pp.

253–269, 1999.

[48] H. Seker et al., “A fuzzy logic based-method for prognostic decision making in breast and prostate cancers.” IEEE Trans. Inf. Technol. Biomed., vol. 7, pp.

114–122, 2003.

[49] B. Sokolowska et al., “A fuzzy-classifier system to distinguish respiratory patterns evolving after diaphragm paralysis in the cat.” Jpn. J. Physiol., vol. 53, pp. 301–307, 2003.

[50] S. Hua and Z. Sun, “A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach.” J. Mol.

Biol., vol. 308, pp. 397–407, 2001.

[51] B. Rost and C. Sander, “Prediction of protein secondary structure at better than 70% accuracy.” J. Mol. Biol., vol. 232, pp. 584–599, 1993.

[52] Y.Y. Ou, “QuickRBF is an Efficient Construction of Radial Basis Function Networks with the Cholesky Decomposition.” http://csie.org/~yien/quickrbf/ . [53] S.K. Riis and A. Krogh, “Improving prediction of protein secondary structure

Biol., vol.3, pp. 163–183, 1996.

[54] S.T. Roweis et al., “Nonlinear Dimensionality Reduction by Locally Linear Embedding.” Science, vol. 290, pp. 2323, 2000.

相關文件