Chapter 4 Conclusion
4.2 Major contributions and future works
Recently, there are rapidly increasing number of reliable PPIs which are recorded in many databases, such as IntAct, DIP, MIPS, MINT, and BioGRID. We identified the PPI family which consists of a group of homologous PPIs to provide a description of the functional and evolutionary relationships of PPIs. The new PPI family, which combines the PPI families and SB-PPI families, may be more and more complete and reliable while more reliable PPIs and 3D-structure are identified. Additionally, The PPI family could help biologists to study the comparison of biochemical networks across multiple species. The cross-species network comparison can be used to identify the corresponding pathways from one organism to another.
For systems biology, the comparison of networks provides clues for understanding some important issue, such as evolution of networks.
Some important issues will be discussed in the future. The PPI family could combine with the SB-PPI family for verifying the reliability of the members in a family in older to building the reliable PPI family. Moreover, we also want to know whether this reliable PPI family may have hierarchical relationship as the protein family (e.g., SCOP). Figure 16 shows the hypothesis of the hierarchical relationship in PPI families. A PPI superfamily may consist of more than one PPI family and some PPI families further include many core families. This relationship implies that the evolution may be involved in a protein interacting with another protein. Our major assumption, which considered that a group of homologous PPIs may be from a common ancestry, could be further proved when co-evolution occurs in homologous PPIs of the reliable PPI family.
45
Figure 16. The hypothesis of the hierarchical relationship between PPI families.
SuperFamily
Family 1
Core family 1
Core family 2
Family 2
Core family 1
Family n
…
Figure 16. The hypothesis of the hierarchical relationship between PPI families.
Based on the methods of PPI family and SB-PPI family, we also could verify the PPI in a family, especially these PPIs from large-scale experiments. Each PPI of a family may be scored through a new scoring function in the future. This scoring function may be composed of several scores. For example, the reliability score may consider the existence of crystal structure and the PPI identified by small-scale experiments or many different large-scale experiments. The interface (or interacting domain pair) is highly conserved and important for protein-protein interacting, and the similarity of interface could provide a clue to evaluate a PPI. In addition, the conserved biological properties, such as GO terms, also could be considered biological meaning score according to their highly conservation in our studies. It is useful to construct the reliable PPI family and offer biologists to realize evolutions of homologous PPIs and PPI families.
46
References
1 Finn, R. D. et al. The Pfam protein families database. Nucleic Acids Research 38, D211-222 (2010).
2 Wu, C. H. et al. PIRSF: family classification system at the Protein Information Resource. Nucleic Acids Research 32, D112-114 (2004).
3 Lo Conte, L. et al. SCOP: a structural classification of proteins database. Nucleic Acids Research 28, 257-259 (2000).
4 Greene, L. H. et al. The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution.
Nucleic Acids Research 35, D291-297 (2007).
5 Edwards, A. M. et al. Bridging structural biology and genomics: assessing protein interaction data with known complexes. Trends in Genetics 18, 529-536 (2002).
6 Kemmeren, P. et al. Protein interaction verification and functional annotation by integrated analysis of genome-scale data. Molecular Cell 9, 1133-1143 (2002).
7 Pellegrini, M., Marcotte, E. M., Thompson, M. J., Eisenberg, D. & Yeates, T. O.
Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. Proceedings of the National Academy of Sciences of the United States of America 96, 4285-4288 (1999).
8 Chen, Y. C., Lo, Y. S., Hsu, W. C. & Yang, J. M. 3D-partner: a web server to infer interacting partners and binding models. Nucleic Acids Research 35, W561-567 (2007).
9 Yu, H. et al. Annotation transfer between genomes: protein-protein interologs and protein-DNA regulogs. Genome Research 14, 1107-1118 (2004).
10 Shoemaker, B. A. & Panchenko, A. R. Deciphering protein-protein interactions. Part I.
Experimental techniques and databases. PLoS Computational Biology 3, e42 (2007).
11 Kerrien, S. et al. IntAct--open source resource for molecular interaction data. Nucleic Acids Research 35, D561-565 (2007).
12 Stark, C. et al. BioGRID: a general repository for interaction datasets. Nucleic Acids Research 34, D535-539 (2006).
13 Salwinski, L. et al. The Database of Interacting Proteins: 2004 update. Nucleic Acids Research 32, D449-451 (2004).
47
14 Mewes, H. W. et al. MIPS: analysis and annotation of genome information in 2007.
Nucleic Acids Research 36, D196-201 (2008).
15 Ceol, A. et al. MINT, the molecular interaction database: 2009 update. Nucleic Acids Research 38, D532-539 (2010).
16 Patil, A. & Nakamura, H. Filtering high-throughput protein-protein interaction data using a combination of genomic features. BMC Bioinformatics 6, 100 (2005).
17 Saeed, R. & Deane, C. An assessment of the uses of homologous interactions.
Bioinformatics 24, 689-695 (2008).
18 Chen, C. C., Lin, C. Y., Lo, Y. S. & Yang, J. M. PPISearch: a web server for searching homologous protein-protein interactions across multiple species. Nucleic Acids Research 37, W369-375 (2009).
19 Aloy, P. & Russell, R. B. Interrogating protein interaction networks through structural biology. Proceedings of the National Academy of Sciences of the United States of America 99, 5896-5901 (2002).
20 Lu, L., Lu, H. & Skolnick, J. MULTIPROSPECTOR: an algorithm for the prediction of protein-protein interactions by multimeric threading. Proteins 49, 350-364 (2002).
21 Stein, A., Panjkovich, A. & Aloy, P. 3did Update: domain-domain and peptide-mediated interactions of known 3D structure. Nucleic Acids Research 37, D300-304 (2009).
22 Finn, R. D., Marshall, M. & Bateman, A. iPfam: visualization of protein-protein interactions in PDB at domain and amino acid resolutions. Bioinformatics 21, 410-412 (2005).
23 Chen, Y. C., Chen, H. C. & Yang, J. M. DAPID: a 3D-domain annotated protein-protein interaction database. Genome Informatics 17, 206-215 (2006).
24 Deshpande, N. et al. The RCSB Protein Data Bank: a redesigned query system and relational database based on the mmCIF schema. Nucleic Acids Research 33, D233-237 (2005).
25 Kersey, P. et al. Integr8 and Genome Reviews: integrated views of complete genomes and proteomes. Nucleic Acids Research 33, D297-302 (2005).
26 Ashburner, M. et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nature Genetics 25, 25-29 (2000).
27 Bork, P. et al. Predicting function: from genes to genomes and back. Journal of Molecular Biology 283, 707-725 (1998).
48
28 Marcotte, E. M., Pellegrini, M., Thompson, M. J., Yeates, T. O. & Eisenberg, D. A
combined algorithm for genome-wide prediction of protein function. Nature 402, 83-86 (1999).
29 Matthews, L. R. et al. Identification of potential interaction networks using sequence-based searches for conserved protein-protein interactions or "interologs".
Genome Research 11, 2120-2126 (2001).
30 Kriventseva, E. V., Fleischmann, W., Zdobnov, E. M. & Apweiler, R. CluSTr: a database of clusters of SWISS-PROT+TrEMBL proteins. Nucleic Acids Research 29, 33-36 (2001).
31 Wu, X., Zhu, L., Guo, J., Zhang, D. Y. & Lin, K. Prediction of yeast protein-protein interaction network: insights from the Gene Ontology and annotations. Nucleic Acids Research 34, 2137-2150 (2006).
32 Gavin, A. C. et al. Proteome survey reveals modularity of the yeast cell machinery.
Nature 440, 631-636 (2006).
33 Bonifacino, J. S. & Traub, L. M. Signals for sorting of transmembrane proteins to endosomes and lysosomes. Annual Review of Biochemistry 72, 395-447 (2003).
34 Heldwein, E. E. et al. Crystal structure of the clathrin adaptor protein 1 core.
Proceedings of the National Academy of Sciences of the United States of America 101, 14108-14113 (2004).
35 Tillier, E. R. & Charlebois, R. L. The human protein coevolution network. Genome Research 19, 1861-1871 (2009).
36 Hart, P. J. et al. Crystal structure of the human TbetaR2 ectodomain--TGF-beta3 complex. Nature Structural Biology 9, 203-208 (2002).
37 Rubin, S. M., Gall, A. L., Zheng, N. & Pavletich, N. P. Structure of the Rb C-terminal domain bound to E2F1-DP1: a mechanism for phosphorylation-induced E2F release.
Cell 123, 1093-1106 (2005).
38 Massague, J., Blain, S. W. & Lo, R. S. TGFbeta signaling in growth control, cancer, and heritable disorders. Cell 103, 295-309 (2000).
39 Massague, J. TGF-beta signal transduction. Annual Review of Biochemistry 67, 753-791 (1998).
40 Bisgrove, B. W., Essner, J. J. & Yost, H. J. Regulation of midline development by antagonism of lefty and nodal signaling. Development 126, 3253-3262 (1999).
41 Pawson, T. & Nash, P. Assembly of cell regulatory systems through protein interaction domains. Science 300, 445-452 (2003).
49
42 Bornberg-Bauer, E., Beaussart, F., Kummerfeld, S. K., Teichmann, S. A. & Weiner, J.,
3rd. The evolution of domain arrangements in proteins and interaction networks.
Cellular and Molecular Life Sciences 62, 435-445 (2005).
43 Aloy, P. et al. Structure-based assembly of protein complexes in yeast. Science 303, 2026-2029 (2004).
44 Davis, F. P. et al. Protein complex compositions predicted by structural similarity.
Nucleic Acids Research 34, 2943-2952 (2006).
45 Huang, L. S. et al. 3-nitropropionic acid is a suicide inhibitor of mitochondrial respiration that, upon oxidation by complex II, forms a covalent adduct with a catalytic base arginine in the active site of the enzyme. Journal of Biological Chemistry 281, 5965-5972 (2006).
50
51
Appendix A
List of publications
Journal papers
1. Chen, C.-C., Lin, C.-Y., Lo, Y.-S. and Yang, J.-M. PPISearch: a web server for searching homologous protein-protein interactions across multiple species. Nucleic Acids Research 37:W376-W383 (2009). (Impact factor: 6.878)
2. Lo, Y.-S., Lin, C.-Y. and Yang, J.-M. (2010) PCFamily: a web server for searching homologous protein complexes. Nucleic Acids Research (Accepted)