Clever Geek Handbook
📜 ⬆️ ⬇️

Superfamily of proteins

The superfamily of proteins is the largest unit of protein grouping for which a common evolutionary ancestor can be assumed (see homology (biology) ). Usually the superfamily is united according to the principle of similarity of secondary and tertiary structures ("structural similarity") of the proteins included in it [1] and according to the principle of similarity of the mechanism of action of proteins ("mechanistic similarity" or "functional similarity"), despite the fact that amino acid similarities sequences within the superfamily may not be observed and most often not observed [2] . Superfamilies usually contain several protein families that exhibit similar amino acid sequences within families, but not between families. The term “protein clan” is often used to refer to protease superfamilies based on the MEROPS classification [2] .

Content

Superfamily Identification

 
Conservative secondary structures of 80 members of the PA protease superfamily. H is an α-helix , E is a β-barrel , L is a loop. The conservatism of the primary structures for the same alignment is shown below. The arrows represent the residues in the domain of the catalytic triad. Alignment based on secondary structures is taken from the DALI database.
 
Structural homology in the PA protease family. The double β-barrel, characteristic of the superfamily, is highlighted in red. The corresponding characteristic (representative for the corresponding families) structures of some members of the families belonging to the PA protease superfamily are shown. Note that for some members of the superfamily, these characteristic structures are partially modified (that is, the structural similarity is incomplete). Chymotrypsin A, a tobacco virus protease, calcivirin, West Nile fever virus protease, Staphylococcus aureus exfoliative, HtrA protease, plasminogen activator from rattlesnake , chloroplast protease, horse arteritis virus protease are shown.

Sequence Homology

Members of the superfamily of proteins belonging to different families usually do not show any significant or easily detectable homology of amino acid sequences, nevertheless having similar secondary and tertiary structures and often possessing some kind of functional similarity. Often it is almost impossible or at least very difficult to align them with amino acid sequences, due to frequently observed insertions and deletions . For example, in the PA protease superfamily, no amino acid residue is conserved throughout the superfamily, including even those amino acid residues that make up the catalytic triads of these proteins.

Conversely, members of individual families that make up the superfamily are determined precisely by the principle of homology of amino acid sequences (primary structures) of proteins that make up the family. For example, in the PA protease superfamily there is a family of C04 proteases.

Structural homology

The secondary and tertiary structures of proteins that directly affect their functionality (for example, the ability of a receptor to recognize ligands — agonists and antagonists , or the ability of an enzyme to catalyze certain chemical reactions ) are much more evolutionarily conservative than the amino acid sequence. An example of this is the PA protease superfamily. In it, very few amino acid residues of proteins exhibit a significant degree of evolutionary conservatism, while the characteristic elements of the secondary structure, as well as their spatial ordering in tertiary structures, are very highly conservative.

Structural alignment and alignment programs, such as the DALI program, can use the three-dimensional secondary and tertiary structures of the proteins of interest to the researcher with the available database of protein structures in order to find proteins with similar folding . A comparison of three-dimensional secondary and tertiary structures of proteins helps to identify many cases of evolutionarily related (having a common evolutionary ancestor) proteins, the similarity of which would not be revealed by a simple analysis and comparison of their primary structures (amino acid sequences).

Mechanism and functional similarities

The catalytic mechanism of enzymes that are members of the superfamily is usually more or less conservative on the scale of the entire superfamily, however, their substrate specificity can vary significantly on the scale of the superfamily. As a result, the final functional purpose of proteins in the body can significantly vary, depending on what their substrate specificity is. That is, there may not be any external functional similarity.

Also (though not necessarily and not always) the catalytic domain of enzymes that are members of the superfamily tends to a high degree of conservatism of the amino acid sequence within this particular domain (and even more so - the secondary and tertiary structure of the catalytic domain). However, even if the amino acid sequence of the catalytic domain is not preserved, the catalytic mechanism itself and / or the secondary and tertiary structures of the catalytic domain can be preserved.

An example of the above conservation of the catalytic mechanism on the scale of the superfamily is, again, the PA protease superfamily. It contains proteins of very different final functional purpose (that is, there is no external functional similarity) - among them there is the digestive enzyme chymotrypsin , thrombolytic enzyme - plasminogen activator , toxins of bacteria and snakes and viral proteases that ensure the assembly of viral particles. As mentioned above, not a single amino acid residue is evolutionarily conserved on the scale of this superfamily, even in the catalytic domain. Nevertheless, all members of the superfamily have significant similarities between the specific catalytic elements of their three-dimensional structures. Moreover, although members of different PA protease superfamily families use different nucleophiles , they all produce covalent nucleophilic catalytic proteolysis of proteins or peptides by a common catalytic mechanism.

Evolutionary Importance

Superfamilies of proteins reflect the limitations of our current ability to identify a common ancestor [3] . Superfamily of proteins is the largest evolutionarily significant association of similar proteins and genes that can be done at the moment, based on direct evidence of similarity (in the case of superfamilies - mainly structural, and partly by mechanism). Therefore, the discovery of three-dimensional structural similarities of functionally and amino acid different proteins in living creatures that are very far away on the evolutionary ladder is one of the proofs of very ancient (possibly the most ancient among the generally studied) evolutionary events.

Some protein superfamilies contain proteins that are present in one form or another in all living creatures studied (for example, potassium channels ) in all kingdoms (including animals , plants , fungi , bacteria, and other prokaryotes ). This may indicate that the common ancestor of the proteins of these superfamilies was the proteins present in the universal common ancestor of all living beings [4] .

Superfamily members can be found in different species of animals, plants, unicellular creatures ( orthologous proteins). Moreover, the most evolutionarily ancient are, obviously, those members of the superfamily who belong to beings located at the very lowest steps of the evolutionary ladder. The common ancestor for these proteins, obviously, is the protein of some ancient extinct creature, which was a common ancestor for all those species in which proteins representing the superfamily are found. Thus, the study of orthologous proteins within the superfamilies of proteins can allow more accurate drawing of the species evolution tree.

In addition, several different representatives of a certain superfamily of proteins (including those performing different functions) can be found in the same organism of an animal, plant, or unicellular creature of the same species. Such proteins are called paralogous . These paralogous proteins can also come from one common protein ancestor, which at some stage of evolution was duplicated in the genome of this species. The study of paralogous proteins, their similarities and differences, helps to shed light on the path of the functional evolution of proteins.

Examples

  • The PA protease superfamily is the chymotrypsin-like protease superfamily. Superfamily members have a similar structure in the form of a double β-barrel and similar proteolysis mechanisms, however, the degree of amino acid sequence homology is less than 10%. The superfamily contains both cysteine and serine proteases (that is, it uses different nucleophiles ). However, family members have a similar catalytic mechanism and a similar three-dimensional spatial structure of the catalytic domain. [2] [5]
  • Superfamily α / β hydrolases . Members of the superfamily contain a similar structure in the form of an α / β-sheet containing 8 β-strands connected by α-helices , while the amino acid residues of the catalytic domain of all members of the superfamily are in the same place in the same order [6 ] although among the members of the superfamily there are proteases, lipases , peroxidases, esterases , epoxide hydrolases and dehalogenases. [7]
  • Superfamily of Ras-like proteins. Superfamily members share a common catalytic G domain.
  • Superfamily of TIM-barrels. Superfamily members have a common large α 8 β 8 barrel-shaped structure. This is one of the most common foldings among proteins, and therefore the legality of combining them all into one family (the monophilicity of these proteins) is still disputed. [8] [9]
  • Superfamily of alkaline phosphatases . Superfamily members have a common structure resembling an αβα-sandwich, [10] and in addition, they catalyze common phosphatase reactions by a common mechanism. [eleven]

Superfamily protein resources

Several biological databases document protein superfamilies, in particular:

  • Pfam
  • PROSITE
  • Interpro
  • Pass2
  • SUPERFAMILY
  • SCOP and CATH

See also

  • Homology (biology)

Notes

  1. ↑ Holm, L; Rosenström, P. Dali server: conservation mapping in 3D. (Eng.) // Nucleic Acids Research : journal. - 2010. - July ( vol. 38 , no. Web Server issue ). - P. W545-9 . - DOI : 10.1093 / nar / gkq366 . - PMID 20457744 .
  2. ↑ 1 2 3 Rawlings, ND; Barrett, AJ; Bateman, A. MEROPS: the database of proteolytic enzymes, their substrates and inhibitors. (Eng.) // Nucleic Acids Research : journal. - 2012 .-- January ( vol. 40 , no. Database issue ). - P. D343-50 . - DOI : 10.1093 / nar / gkr987 . - PMID 22086950 .
  3. ↑ Shakhnovich, BE; Deeds, E; Delisi, C; Shakhnovich, E. Protein structure and evolutionary history determine sequence space topology. (Eng.) // Genome Research : journal. - 2005 .-- March ( vol. 15 , no. 3 ). - P. 385-392 . - DOI : 10.1101 / gr . 313605 . - PMID 15741509 .
  4. ↑ Ranea, JA; Sillero, A; Thornton, JM; Orengo, CA Protein superfamily evolution and the last universal common ancestor (LUCA). (English) // Journal of molecular evolution : journal. - 2006 .-- October ( vol. 63 , no. 4 ). - P. 513-525 . - DOI : 10.1007 / s00239-005-0289-7 . - PMID 17021929 .
  5. ↑ Bazan, JF; Fletterick, RJ Viral cysteine ​​proteases are homologous to the trypsin-like family of serine proteases: structural and functional implications. (Eng.) // Proceedings of the National Academy of Sciences of the United States of America : journal. - 1988 .-- November ( vol. 85 , no. 21 ). - P. 7872-7876 . - DOI : 10.1073 / pnas.85.21.7872 . - PMID 3186696 .
  6. ↑ Carr PD, Ollis DL Alpha / beta hydrolase fold: an update (unopened) // Protein Pept. Lett .. - 2009. - T. 16 , No. 10 . - S. 1137-1148 . - PMID 19508187 .
  7. ↑ Nardini M., Dijkstra BW Alpha / beta hydrolase fold enzymes: the family keeps growing (English) // Curr. Opin. Struct. Biol. : journal. - 1999 .-- December ( vol. 9 , no. 6 ). - P. 732-737 . - DOI : 10.1016 / S0959-440X (99) 00037-8 . - PMID 10607665 .
  8. ↑ Nagano, N; Orengo, CA; Thornton, JM One fold with many functions: the evolutionary relationships between TIM barrel families based on their sequences, structures and functions. (Eng.) // Journal of Molecular Biology : journal. - 2002 .-- 30 August ( vol. 321 , no. 5 ). - P. 741-765 . - DOI : 10.1016 / s0022-2836 (02) 00649-6 . - PMID 12206759 .
  9. ↑ Farber, G. An α / β-barrel full of evolutionary trouble (neopr.) // Current Opinion in Structural Biology. - 1993. - T. 3 , No. 3 . - S. 409-412 . - DOI : 10.1016 / S0959-440X (05) 80114-9 .
  10. ↑ SCOP (unopened) . Date of treatment May 28, 2014.
  11. ↑ Mohamed, MF; Hollfelder, F. Efficient, crosswise catalytic promiscuity among enzymes that catalyze phosphoryl transfer. (English) // Biochimica et Biophysica Acta : journal. - 2013 .-- January ( vol. 1834 , no. 1 ). - P. 417-424 . - DOI : 10.1016 / j.bbapap.2012.07.07.015 . - PMID 22885024 .


Source - https://ru.wikipedia.org/w/index.php?title= Protein Superfamily&oldid = 101034460


More articles:

  • Rogel, Shpela
  • Baghdad (football club, Uzbekistan)
  • March for the Emperor
  • Kellah Tosah
  • Leonov, Boris Fedorovich
  • Mayachinsky Village Council
  • Meganeuvers
  • An-24 disaster near Szczecin
  • Woodville, Richard Keyton (Senior)
  • Temnothorax mpala

All articles

Clever Geek | 2019