Function inference for interacting proteins through protein signatures
Year
: 2007
Abstract: Abstract
Signatures are functional sub-units of proteins’ primary structures. They include domains, motifs, and binding sites. Signatures play an important role in determining the interactions web of a particular protein and thereby, general or specific function(s) of the protein can be inferred. Commonality of signatures among proteins was used as the key information to predict the web of interactions of all proteins in S. cerevisiae, C. elegans, and H. sapiens and then to infer function for interacting proteins. Known signatures in the three studied organisms were retrieved from Pfam database. Next, each protein was represented by a feature vector, called signature profile, indicating the presence or absence of known signatures in the amino acid sequence of that protein. Then, the similarity between signature profiles was calculated based on the Binary Similarity Function (BSF) on a pair-wise fashion. The BSF, scored the protein pairs and if the score was higher than a threshold the two proteins in the pair were predicted to be interacting. The threshold was specified in association with a confidence level. In cases where one protein in the pair has known function, the interacting partner was assigned a similar function. As proteins interact with more than one protein in cellular systems, the partner with the highest BSF score was considered as the most likely, and the function was inferred accordingly. Based on this methodology, 1024 new interactions were identified in C. elegans genome, as an example, and the unknown proteins involved in these interactions were categorized into 25 general functions.
Signatures are functional sub-units of proteins’ primary structures. They include domains, motifs, and binding sites. Signatures play an important role in determining the interactions web of a particular protein and thereby, general or specific function(s) of the protein can be inferred. Commonality of signatures among proteins was used as the key information to predict the web of interactions of all proteins in S. cerevisiae, C. elegans, and H. sapiens and then to infer function for interacting proteins. Known signatures in the three studied organisms were retrieved from Pfam database. Next, each protein was represented by a feature vector, called signature profile, indicating the presence or absence of known signatures in the amino acid sequence of that protein. Then, the similarity between signature profiles was calculated based on the Binary Similarity Function (BSF) on a pair-wise fashion. The BSF, scored the protein pairs and if the score was higher than a threshold the two proteins in the pair were predicted to be interacting. The threshold was specified in association with a confidence level. In cases where one protein in the pair has known function, the interacting partner was assigned a similar function. As proteins interact with more than one protein in cellular systems, the partner with the highest BSF score was considered as the most likely, and the function was inferred accordingly. Based on this methodology, 1024 new interactions were identified in C. elegans genome, as an example, and the unknown proteins involved in these interactions were categorized into 25 general functions.
Keyword(s): Protein signatures
function inference
Collections
:
-
Statistics
Function inference for interacting proteins through protein signatures
Show full item record
contributor author | محمود اخوان مهدوی | en |
contributor author | Yen-Han Lin | en |
contributor author | Mahmood Akhavan Mahdavi | fa |
date accessioned | 2020-06-06T13:51:04Z | |
date available | 2020-06-06T13:51:04Z | |
date copyright | 10/28/2007 | |
date issued | 2007 | |
identifier uri | https://libsearch.um.ac.ir:443/fum/handle/fum/3371822?locale-attribute=en | |
description abstract | Abstract Signatures are functional sub-units of proteins’ primary structures. They include domains, motifs, and binding sites. Signatures play an important role in determining the interactions web of a particular protein and thereby, general or specific function(s) of the protein can be inferred. Commonality of signatures among proteins was used as the key information to predict the web of interactions of all proteins in S. cerevisiae, C. elegans, and H. sapiens and then to infer function for interacting proteins. Known signatures in the three studied organisms were retrieved from Pfam database. Next, each protein was represented by a feature vector, called signature profile, indicating the presence or absence of known signatures in the amino acid sequence of that protein. Then, the similarity between signature profiles was calculated based on the Binary Similarity Function (BSF) on a pair-wise fashion. The BSF, scored the protein pairs and if the score was higher than a threshold the two proteins in the pair were predicted to be interacting. The threshold was specified in association with a confidence level. In cases where one protein in the pair has known function, the interacting partner was assigned a similar function. As proteins interact with more than one protein in cellular systems, the partner with the highest BSF score was considered as the most likely, and the function was inferred accordingly. Based on this methodology, 1024 new interactions were identified in C. elegans genome, as an example, and the unknown proteins involved in these interactions were categorized into 25 general functions. | en |
language | English | |
title | Function inference for interacting proteins through protein signatures | en |
type | Conference Paper | |
contenttype | External Fulltext | |
subject keywords | Protein signatures function inference | en |
identifier link | https://profdoc.um.ac.ir/paper-abstract-1007125.html | |
conference title | The 57th Canadian Chemical Engineering Conference | en |
conference location | Edmonton | fa |
identifier articleid | 1007125 |