Please use this identifier to cite or link to this item:
http://acervodigital.unesp.br/handle/11449/116863
- Title:
- An improved interolog mapping-based computational prediction of protein protein interactions with increased network coverage
- Universidade Federal de Minas Gerais (UFMG)
- Universidade Estadual Paulista (UNESP)
- Inst Integrat Om & Appl Biotechnol
- Fed Univ Para
- 1757-9694
- CENAPAD-MG
- Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
- Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
- Fundação de Amparo à Pesquisa do Estado de Minas Gerais (FAPEMIG)
- Automated and efficient methods that map ortholog interactions from several organisms and public databases (pDB) are needed to identify new interactions in an organism of interest (interolog mapping). When computational methods are applied to predict interactions, it is important that these methods be validated and their efficiency proven. In this study, we compare six Blast+ metrics over three datasets to identify the best metric for protein protein interaction predictions. Using Blast+ to align the protein pairs, the ortholog interactions from DIP were mapped to String, Intact and Psibase pDBs. For each interaction mapped to each pDBs, we retrieved the alignment score, e-value, bitscore, similarity, identity and coverage. We evaluated these Blast+ values, and combinations thereof, with the Receiver Operating Characteristic (ROC) curves and computed the Area Under Curve (AUC). To validate these predictions, we used a subset of the Database of Interacting Proteins (DIP) composed of experimental interactions curated by the International Molecular Exchange (IMEx). The cut-off point for each metric/pDB was computed aiming to identify the best one that separates the true and false predicted interactions. In contrast to other methods that only compute the first Blast hit, we considered the first 20 hits, thus increasing the number of predicted interaction pairs. In addition, we identified the contribution of each individual pDB, as well as their combined contribution to the prediction. The best metric had an AUC of 0.96 for a single pDB and AUC of 0.93 for combined pDBs. Compared to other studies, with a cut-off point of 0.70 representing a specificity of 0.95 and a sensitivity of 0.90 for individual pDB, our method efficiently predicts protein protein interactions.
- 1-Nov-2014
- Integrative Biology. Cambridge: Royal Soc Chemistry, v. 6, n. 11, p. 1080-1087, 2014.
- 1080-1087
- Royal Soc Chemistry
- http://dx.doi.org/10.1039/c4ib00136b
- Acesso restrito
- outro
- http://repositorio.unesp.br/handle/11449/116863
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.