Pathways & interactions
Short name: Retropepsins
Overlapping homologous superfamilies
- Aspartic peptidase domain superfamily (IPR021109)
This group of aspartic peptidases belong to the peptidase clan AA. The clan includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses) which are active as homodimers. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin [PMID: 1851433, PMID: 8841139].
Family A2 includes the peptidase (retropepsin, EC 126.96.36.199) from the human immunodeficiency virus and other retroviruses. In most retroviruses, the peptidase is encoded as a segment of a polyprotein (usually the pol polyprotein, which includes the peptidase, a reverse transcriptase, RNase H and an integrase, but occassionally the gag polyprotein) which it cleaves during viral maturation to release individual proteins. Some retrotransposon polyproteins also contain a homologous, retropepsin-like peptidase which is also a member of family A2.
Family A3 includes peptidases from the double-stranded DNA plant viruses known as badnaviruses or pararetroviruses. The viral genome includes genes (ORFs IV and V) that encodes polyproteins. The ORF V polyprotein contains the peptidase and a reverse transcriptase. The peptidase processes the ORF IV polyprotein, which includes the viral coat protein [PMID: 15831103].
Family A9 includes peptidases from spumaretroviruses, and the peptidase is a component of either the gag and pol polyprotein, which is processes [PMID: 9311808]. The structure has been solved for the peptidase from simian foamy virus and shows a retropepsin-like fold [PMID: 18597783].
Family A11 includes polyprotein-processing peptidases from retrotransposons such as the copia transposon from Drosophila melanogaster. No tertiary structure has been solved for any member of the family, and family A11 is included in clan AA on the basis of the similar motif around the active site Asp.
Family A28 includes the yeast DNA-damage inducible protein 1 which is a component of the DNA repair pathway. The tertiary structure shows a retropepsin-like fold [PMID: 17010377]. This peptidase is not a component of a polyprotein.
Family A32 includes the bacterial PerP peptidase which converts the transmembrane factor PodJ from a form that recruits proteins for pilus formation, to a truncated form that recruits proteins for stalk formation. This converts the bacterium from a motile form to the sessile form found in biofilms [PMID: 16395329].
Aspartic peptidases, also known as aspartyl proteases (EC:3.4.23.-), are widely distributed proteolytic enzymes [PMID: 6795036, PMID: 2194475, PMID: 1851433] known to exist in vertebrates, fungi, plants, protozoa, bacteria, archaea, retroviruses and some plant viruses. All known aspartic peptidases are endopeptidases. A water molecule, activated by two aspartic acid residues, acts as the nucleophile in catalysis. Aspartic peptidases can be grouped into five clans, each of which shows a unique structural fold [PMID: 8439290].
- Peptidases in clan AA are either bilobed (family A1 or the pepsin family) or are a homodimer (all other families in the clan, including retropepsin from HIV-1/AIDS) [PMID: 2682266]. Each lobe consists of a single domain with a closed beta-barrel and each lobe contributes one Asp to form the active site. Most peptidases in the clan are inhibited by the naturally occurring small-molecule inhibitor pepstatin [PMID: 4912600].
- Clan AC contains the single family A8: the signal peptidase 2 family. Members of the family are found in all bacteria. Signal peptidase 2 processes the premurein precursor, removing the signal peptide. The peptidase has four transmembrane domains and the active site is on the periplasmic side of the cell membrane. Cleavage occurs on the amino side of a cysteine where the thiol group has been substituted by a diacylglyceryl group. Site-directed mutagenesis has identified two essential aspartic acid residues which occur in the motifs GNXXDRX and FNXAD (where X is a hydrophobic residue) [PMID: 10497172]. No tertiary structures have been solved for any member of the family, but because of the intramembrane location, the structure is assumed not to be pepsin-like.
- Clan AD contains two families of transmembrane endopeptidases: A22 and A24. These are also known as "GXGD peptidases" because of a common GXGD motif which includes one of the pair of catalytic aspartic acid residues. Structures are known for members of both families and show a unique, common fold with up to nine transmembrane regions [PMID: 21765428]. The active site aspartic acids are located within a large cavity in the membrane into which water can gain access [PMID: 23254940].
- Clan AE contains two families, A25 and A31. Tertiary structures have been solved for members of both families and show a common fold consisting of an alpha-beta-alpha sandwich, in which the beta sheet is five stranded [PMID: 10331925, PMID: 10864493].
- Clan AF contains the single family A26. Members of the clan are membrane-proteins with a unique fold. Homologues are known only from bacteria. The structure of omptin (also known as OmpT) shows a cylindrical barrel containing ten beta strands inserted in the membrane with the active site residues on the outer surface [PMID: 11566868].
- There are two families of aspartic peptidases for which neither structure nor active site residues are known and these are not assigned to clans. Family A5 includes thermopsin, an endopeptidase found only in thermophilic archaea. Family A36 contains sporulation factor SpoIIGA, which is known to process and activate sigma factor E, one of the transcription factors that controls sporulation in bacteria [PMID: 21751400].
- PF00077 (RVP)