Taxonomic diversity-based domain interaction prediction
Abstract
Identification of protein domain-domain
interactions (DDIs) is an essential step in understanding proteins’ functional
and structural roles. MirrorTree is a DDI prediction method that is based on
the principle of interacting proteins’ co-evolution. However, this method is
sensitive to taxonomic diversity and evolutionary span within the two protein
homolog sets compared to predict DDI. In this work, we propose a new
MirrorTree-based DDI prediction method, namely Taxonomic Diversity-based Domain
Interaction Prediction (TAXDIP). TAXDIP improves the MirrorTree method by
adding a sampling step that favors representation of higher-level taxonomic
ranks (e.g. family over species) in two protein homolog sets prior to their
comparison. This additional step ensures increased evolutionary span within
protein homolog sets. TAXDIP is first assessed using a set containing 6,514
positive (interacting) domain pairs and a negative (non-interacting) set of
equal size containing randomly generated domain pairs with no known
interactions. TAXDIP achieved 71.0% sensitivity and 63.0% specificity on this
set. Next, a benchmark-set containing
500 interacting and 500 non-interacting domain pairs is used to compare the
performance of TAXDIP against DDI prediction methods ME and RDFF. TAXDIP showed better sensitivity and
specificity than RDFF. While TAXDIP’s sensitivity is better than ME, its
specificity remained below ME. In conclusion, TAXDIP, with its performance, is
a viable alternative to existing prediction methods. Furthermore, given
TAXDIP’s true predictions are overlapping with, and furthermore, complementing
other DDI prediction methods, TAXDIP has a strong position in becoming part of
a meta-DDI prediction method that combines multiple methods to build a
consensus prediction.
Keywords
References
- Sprinzak E, Altuvia Y, Margalit H. “Characterization and prediction of protein-protein interactions within and between complexes”. Proceedings of the National Academy of Sciences of the United States of America, 103(40), 14718-14723, 2006.
- Deng M, Mehta S, Sun F, Chen T. “Inferring domain-domain interactions from protein-protein interactions”. Genome Research, 12(10), 1540-8, 2002.
- Jothi R, Cherukuri PF, Tasneem A, Przytycka TM. “Co-evolutionary analysis of domains in interacting proteins reveals insights into domain-domain interactions mediating protein-protein interactions”. Journal of Molecular Biology, 362(4), 861-875, 2006.
- Gonzalez AJ, Liao L. “Predicting domain-domain interaction based on domain profiles with feature selection and support vector machines”. BMC Bioinformatics, 11(537), 2-14, 2010.
- Chen XW, Liu M. “Prediction of protein-protein interactions using random decision forest framework”. Bioinformatics, 21(24), 4394-4400, 2005.
- Pagel P, Wong P, Frishman D. “A domain interaction map based on phylogenetic profiling”. Journal of Molecular Biology, 344(5), 1331-46, 2004.
- Gomez SM, Rzhetsky A. “Towards the prediction of complete protein-protein interaction networks”. Pacific Symposium on Biocomputing Pacific Symposium on Biocomputing, Kauai, Hawaii, 3-7 January 2002.
- Nye TM, Berzuini C, Gilks WR, Babu MM, Teichmann SA. “Statistical analysis of domains in interacting protein pairs”. Bioinformatics, 21(7), 993-1001, 2005.
Details
Primary Language
English
Subjects
Engineering
Journal Section
Research Article
Publication Date
April 22, 2019
Submission Date
April 19, 2018
Acceptance Date
-
Published in Issue
Year 2019 Volume: 25 Number: 2